ENST00000563938.1

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000563938.1

Source

Gencode19

Same with

lnc-H2BFM-1:1,NONHSAT138080

Classification

intergenic

Length

4484 nt

Genomic location

chrX+:103197629..103202112

Exon number

1

Exons

103197629..103202112

Genome context

Sequence
000001 ATGATAATGG AGATGGAGTA CAAATGGGAC AACTCTGAGA TAGAGTACAT GAAAGCTCAA GACACCTGTA TCCAACTGCC 000080
000081 TACTTGATGT TTTCACTGAC ATATAGGCAT TCCAAACTTA ATATGTCAAA GATTAAACTA TTTATTCTCC TTACCAATCT 000160
000161 GTCCCTAGTC TTTCCACTTC CAATGGTATC ATCATTTTCT GGTACTTTCA ACACAAAGGA TACTATCATA TAGGGAACGT 000240
000241 GACTCTCTCA CTCATTACTC TATTTCCAGT GCCCAGATTG TCATGCACAT AGTAAGTGCT CAATAGATAT TTGTAGAATG 000320
000321 AATGGATGAA TTGGCCCTGC CCTCCAAACA TGTTCGATTA AAAATCATTC AAGAATATAC ACAAAGTAGT ATATGAAAAA 000400
000401 ATGACAGACT TCTGTCCTTT CTTTTCAATT ATTAGACCCA CAGCCTTAGT TCAGTCTTTC CTGATTTTAT ACCTGAAGAA 000480
000481 CTGCCACAAC TCTTTTTATT GGCTCTTCTT GTCACCTTCC AATGTTAATA GCTTACAAGA TAAAATCCAA ACCACTTGGC 000560
000561 GTGACATTCA AGGTCCTCCA TAATCTGACA CCAGCCTACC TTTTTCACCT CTTTTTCCTC TACTTAAATA TTATGAATCT 000640
000641 CGATGTCCCA ACTACACAAG ACTACTGGAT GTTTCCCAAA TGTGCCCTAC ATTTCTCTTT GTTCTTCTTT TCTCCTCTTC 000720
000721 CTGGAATTCC ATTGCCCCCA TTTCTTCCTT CAACAAATGC TATCCATTTT TAAAGGCCCA ATTCAAATGT CATTTCCTCT 000800
000801 ATGCAGGCTT CTTGGACTCT CTCAGATAGA ATTAGGCTTC CCTCTTTTCC ACTTTCCCCT GGTACTTATC AACCTTGATG 000880
000881 TAACATTTAA AATCATAGAT GTCTCAGAAT TATAATATTA GATTTAGAAG TGGTCTTGTT GATCATGTGA TCCGACTTCT 000960
000961 TCATGTTATC GAGGGCAAGA GCTAGAAGAA TCCAGGTCTT TTAATTTGCC TTGACTTGTG ATTATGTATG TATGTATTTT 001040
001041 TTATCCTCTA CCTGACATCA CTGGGCAGAA ATAGGGTGTT ATTCATCTTT GTGTTCTTAT TCATGATACT TAATAAAATC 001120
001121 TATTGCACAT AGTGGATATA CAATAAGTAT GTTTCAATTG ACTTAAGATC AGAGGGATGG ATAAAATAGA GGTACCATTC 001200
001201 ATAGTATTGT ACTCTGTACC AGGGTACTAT GCCAAAGTTT TACAAGAATT ACCTGATTGA TCAAACAATC CTATGAGATA 001280
001281 AGCACCATTT GTAGCTTTAT TTGCCCAAGA CTTCATAGCC AGTAAGTTGT AGAATCTGGA CAAAAATAAA ATTCCTGTGG 001360
001361 CTCCAGATTC TTTGTTCTTA GTCATTGTGC TATAGTGCAA AACAACAAGG TGGGAACTCA AACATTTTGG GTTTAGAATA 001440
001441 CGACTGATGG ATGAATTTGG CATGAAAGAC TAACTTTTAT TTCCTGGAGG ACATAGTAGC TGGGTTTGGG AAGATAGTTT 001520
001521 GGGACAATAT TGTAGAAGGC TCTGAATGCC AAGCATGGTA TGAGAAGTTA CCTCCCTTCC CTAGCTCTCC ATCTGCACCA 001600
001601 GTCTAGATCA CCGTATGAGG CGCAACTCAA ACAGTACTTA TTGTTTGAAG ACTTCCAGAA TCTTCCAAGT CACAATGAAT 001680
001681 TTCCCCTTCT CCCACAGCAC TTTGTTCATA CCTTTGTTAT AATACATAAT ATTACTTTCT TTCTTTTATG TTTTTTTGAG 001760
001761 ACGGAGCCTC TCTCTGTTAC CCAGGCTGGA GCGCAGTGGT GTGATCTCAC CTCCCCTGGG TTCAAGCGAT TCTCCTGCCT 001840
001841 CAGCTTCTGA GTAGCTGGGA CTACAGGCAT GCACCACCAT GCCCCGCTGG CTTTTGTATT TTTAGTAAAA ACGAGGTTTC 001920
001921 ACCATGTTGC CCAGGCTGGT CTCGAACTCC TGGCCTCAGG AGATCCACCA GCCTCGGCCT CCCAGAGTGA TGGGATTACA 002000
002001 GGCGTGAGCC ACTGAGCCCG GCCAATATGA CTTTCTTAAC AGAAATATTT TGTGGCTGTA TCTGTTTTCT TCACATTTTG 002080
002081 TAAATTCCTT GAGAGCAGGC ACTTTATTTA CAATTGTATA CCTCTTTTTA TTTCACAACT GATGCATAGT AGATGCTTGT 002160
002161 CTTCGTCCAT TTTGTGTTGC TATAAGTGAA TACCTGAAAC TGGGCAATTT ACAAAGAAAA GAGGTTTATT TAGCTCATGG 002240
002241 TTCTGCAGGC TGGGAAGTTT AAGATTGGGC AGCTGCATCT GGTAGCTTCT GATGAGGGCC TTGTACTGTG GCAAAACATG 002320
002321 GCAGAGAAAT GGAAGGGGAA CCAGGTCCCT GCAAAGAGAG TAAAACACCA AACACGAGAT GCAACCTCAC TTTACAACAA 002400
002401 CTTGCACTTG CATAACCAAT TCAGTCCCAC AAAGAGTAAG AACTGACTCA CTCCCACCAG ACTGCTTTAA TCCCTTCATG 002480
002481 AGAGTGAATC CTTAATGACC CAGAAGCCTG TTAAATATTC CACCCCCTCT CAACACTGTT ACACTGGGGA CCAAGTCTCA 002560
002561 ATATGAGTTT TGGTGGGCAC AAACCATATT CAAATCATAA CAATGCTCAA TAAATATTAG TTGAATTAAA TGTTTCCTTG 002640
002641 GGAGAATATA GACATAAATA AGGCATGGTT TCTGCTCTCA AGAGAAGCCA ACATTCTATC AAGGGAAACA GTACATTATT 002720
002721 CCATATGAGA AGCTATACTA GTGACATGTA TAAAGCACGA GGAATTCAGC GATCAATTAG AAAATGCAAT AGGAAAAATA 002800
002801 ACCCATTCAT GATTTTCCCA AAATCTATAA GGCAGCTAGG AAGATAGCTA ACAGATGTAT AAACCATTAT ACAGAAAAGT 002880
002881 ATGAGATTGT ACTAAAGGAA ATTTTATAAA AATCAGAAAG TGGAGGAATA TTCTTGTCAA GAACTACAAA AGGTCTAGAA 002960
002961 TTTGACCCTA TTTACAAACT TACAAATTAG CCTGACACAC ACAGTTTCAT GTATTCTGTC AGAAGACATG AGAATCTTGG 003040
003041 GTGAGAGGCA AAGGACTTTA TTACTCAGAA AAAGCAGTAG CTAGAACTTC CTGTTGGCTT GTGTTCGTTC CCAATGCCCC 003120
003121 CCAAATCCCA CAAGGGCAAT GCAAAGGACC AATGATGGAT GGCTGCACAG TGGGTTGCAT TACAGGAAAG AAACAATACA 003200
003201 TTTGGAGGAT CCACTGCTTT TATAGAACAT GGAAATAAAC CTGCTGTTTG TGATGTTTTC CCATCCTTCA GGGTTTCTCA 003280
003281 CTGAAAACAT GGTCCTGAGA ATGGCCAAGG TGAAGAGTCA TCAGTGCCTT ACCTTTTTGG CATACTCAGC AAGAGTACGC 003360
003361 ATAAATGCTT AGGGACCATG ATGGGTTGTC TTTCCCAACA ATATTCCATT TGTATGTGTG GAAACTTTAA GTCAGCAAAT 003440
003441 TGGTAATTCT TCTCAAATTA GTCCATAAAT CCAATATTAT TCCAATCAAA ATATTAACAG GCCAATTTTT TGTGGAATTT 003520
003521 GAAACACTGA TTACAAAACT GGAAAAGGCA CATATAGTAC ATACAGTAAA TTGTTATATG AAAGAAGTGA CATTACATAG 003600
003601 GAGTGAAAAA TGGATGAATA ATTTAATAAA TGTGATTGGA CAGTTGTTTC CCTTCTCGGA GAAAAATCTA GATCCATGCC 003680
003681 TTATACAAAA TAAAATCCAG ATATATAAAA GACCAAAGTA CACAACCAAA ATCTTAAAAC TAATAAAAAA TCATGTCCTC 003760
003761 AGAGTAGATA ATTTCCTAAA CAAAATCCAA AAACACAAGT TTTAAAGGAA AAGATTGATA TATTTGATTC GACAAACATT 003840
003841 GTATAATAAA AGACGTGAGA AGTAACCTTA GTGAGAAAAG ACGAGCAAAG ATTGGTACAG GCTTTTTGGA AAACTAGTTG 003920
003921 GCTCTATCTA CTGAAGCTGA ACCTATGCAT TCCCTATGAC CAGCAACTCC ACTTCTTGGT ATATACCCAA GAAAAGTGCA 004000
004001 TATGTATATT CACTAAGGAC ATTATGAAAA GCATTATTTG TAATAGCCCC ACACTGAAAA AAAAAACCAA GAGCCCACAA 004080
004081 GCAGTAGAAT GAATAAATAA ATTGTAGTAT ATTCATACAT TAAAATGTAG CACATAAAGG AGAATTAACT ACAAATTCAC 004160
004161 GACAACATGG ATGAATCTCA CAAACCTAAT GTTGAGCAAA AGAATCCAGA CACAGAGGAG TATATACTGA AAGATCCCAC 004240
004241 TTATTTAAAG CAGAAAAGAG GCAAAACTAA TAATCAGGAG AGTGGTTATC ATCCTTGGGC ATAGTGACTG GAAAGGAGCA 004320
004321 CAAAAGAAGC TTATGGCAGG GAGGGGAGAG GTTGATAATG TACTGTGTCT TGATCTGAGA GCTGGTTATA TGGGTGTCTT 004400
004401 TACTTTGTGA ACATTTATCA AGATATACAC TTAAGATATG TATGTATGTA TATTGTACTA AAATAAAACA TTTTGAAAAT 004480
004481 CTGC
[back to top]

Predicted Small Protein

Name ENST00000563938.1_smProtein_1559:1849
Length 97
Molecular weight 11264.8152
Aromaticity 0.166666666667
Instability index 71.471875
Isoelectric point 6.25872802734
Runs 15
Runs residual 0.022014604811
Runs probability 0.0560501678149
Amino acid sequence MRSYLPSLALHLHQSRSPYEAQLKQYLLFEDFQNLPSHNEFPLLPQHFVHTFVIIHNITF
FLLCFFETEPLSVTQAGAQWCDLTSPGFKRFSCLSF
Secondary structure LLLLLHHHHHHHHLLLLHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLEEEEEEEEELLLL
EEEEEEELLLLLHHHLLLEEEELLLLLLEEEEEELL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHH
HHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooooooooooooooooooooooooooooooooooooooTTTTTTTTTTTT
TTTTTTiiiiiiiiiiiiiiiiiiiiiiiiiiiiii