ENST00000564237.1

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000564237.1

Source

Gencode19

Same with

lnc-HIST2H2AA3-1:1

Classification

intergenic

Length

4527 nt

Genomic location

chr1-:149816065..149820591

Exon number

1

Exons

149816065..149820591

Genome context

Sequence
000001 AGAGTTCCGA CTGAAATTTG AGAAGCTCTT TGCTATTCAA GTGGATATGT GCAGTTGACA GTTTGAGAGA TGCATCTAGG 000080
000081 GTTCAGTAAA GACAACACAA GCCTGTCTTT AGGGTCTACC TGTGAACTGT GAACACAGCA ATGAGAATGA TGGACATCAC 000160
000161 CTTTAAGTAT TTTTCTAGAC TTTATTACTC ATGTGTTTGT CATGAGGTGT AACTTAGTAG TTCATAGTCC TATAATGTAT 000240
000241 GTTATTGACT AGGTAGCATT TATTTTTCTA ATTGTTTCTG TTATAGTGCT GCCACATGTG TTTCCCAGAA ACGCATTTTA 000320
000321 CCCACAGTTC TTAGGGTTGG CCTGATTAGT TTAATTGCTG TCTGAACCTG CTTCTTACTG TGATTAGTTC AGGAATCTAG 000400
000401 ATCAAACTCA TTGGCATTTA ACATTTCAGG AAGTGAACTG AGTAACAACT AACTCAGCAG GGGAGTGTAG TATGCTATTA 000480
000481 TCTTTTGGGA AAGCAGCTTA TTTGCTTTCA AGAGGCAGCA GGAGGATGGA CTGTCTTTAG GTATGAAGGC AAGAATGATT 000560
000561 ATAGACAATA TGCAGAGGAG GACACAGTGT GGGAGAATCA GGGAACTGAG CCACTCATGG AGTTCAGCCA ACCTTGGATC 000640
000641 CCTGCTGCAC CTTTGGACTT CCAGTTAAGC CAATTTGTCT GACATATTTA CTTATACCAG TTTGAATCTT GAAATATTTC 000720
000721 AGGAATAATA ATTTCCTAGA TAAAAGGAAA GACCTTTCAT GAAAGGTCTC AAGTCAAATA GGGTCAATTA GGACAGAGTT 000800
000801 GCTCCAATTA CATATTTGGA ACAGATGTCC AAATGTTAAT ACTTGACTAA GGCTAAAGAC TAATATTACC ATCACAGGAA 000880
000881 AAATGTCCAG GGTTTTTTTT CAGATGTGAA ATTTTATTTA AAAATTTTAA ATAAACTAAA TCAAAAAATT TTAGTAGTTG 000960
000961 TACTAATTTC CTGGGGCTGT CAAAGTACCA CAAACTGTAT GGCGTAAAAC AACACAAAGT TATTCTTTCA TGGTTTTAGA 001040
001041 GGCTAGAAGT TTGAAATCAA CGTGTTGGTA GGGCCATGCT CTCTCCAAAC CCACTAGGGG AAGACTCCTG TCTTTCAGTG 001120
001121 TCTGGTAGCC CCACTTGTTC TTTGGTTTCT GGCAGCATAA CTGTAATCTC TACCTCAGTT TTTTCATGTA TGTCTCCATG 001200
001201 TTTTTTTACT TTCTTTCTTG AGATGGAGTT TCACTCTTGT TGCCCAGGCT GGAGTGCAGT GGCATGATCT TGGCTTACTG 001280
001281 CAACCTCTGT GCCCCGGGTT CAAGCAATTT TCCTGCCTCA GCCTCCCGAG TAGCTGGGAT TACAGGCATG CGTCAGCACG 001360
001361 CCCGGCTGAT TTTGTATTTT TGGTAGAGAT GGAGTTTCAT CATGTTAGTC AGGCTGGCCT CGAACTGACC TCAGGTGATC 001440
001441 CACCTGCCTT GGCCTCCCAA AGTGCTGGGA TTACAGATGT GAGCCACTGC ACCCGGCTGT CTCCATGTCT TCTTATAAGG 001520
001521 GTATCAGTCA TACTGGATTA GGGCCCACCC TAAAGACCTC ATTTTAACTT GATTACCTCT GTAAAGACCC TGTTTCCAAA 001600
001601 GAAGGCAAAA TTCTAAGCAA CTAGGGGTTA GACTTCAACA TATCTTTCGG GGGGACACAA CTCAACCCAT AACAGTAGTC 001680
001681 AATGGCTGTG GCAGGCTAAA TGTGGCTCCC AAATATGTCC ATATCCTAAT CCCTACAGCC TGTGAATATT ACCTTATATA 001760
001761 GCCAAGAGGA TTTTGCAGAT GTGATTCTGA GATTGAGAGA TTATGCCAGA TTATCCAGGT AGGCCCCAAA TGTAATCACC 001840
001841 ACAGTCCTTA TAGGAGAGGC AAGAAAGTCA AGTGTAGAAG GAGGCGATAG AAGGAGAGAG GGATTTGAAG ATTAATAGGC 001920
001921 TGCTTGCTTT GAAGACAGAG GGAAGGGACC ATAAACCAGA AATAAACCTC TAGAAGCTGG AAAAGGCATG GAAATAGACC 002000
002001 CTCCCTTAAG GTCTCTGGAG GGAGTGCAGC CTTGATTTCT ACCGAGTAAA ATTGATTTTG TACTTCAGAC CTCCAAAACT 002080
002081 GTAAGAGAAT GACTGTTGTT TTAAAACCAT TGAGTTTGTA GTAATTTGTT GCAGCAGCCA CAAGAAACTA ATACAACATC 002160
002161 TATATAGAAT TTTTTCAATA ATTGGAGAAA TTTGAATATG GATTGCATAT TAATATTACT GAATCAGCAT TAAATTTGTT 002240
002241 AGGTGTAATA ATGTGATTGT AGCTATTTAG GAGAATATCC TATTTTTAAG AGACATGCCA CCATATTTAG GGAGAAGTGC 002320
002321 CAACATATTT GCAGTTTATT TTCAAATGGT TCAGAGGCTG TCTGTGTACA TGAGAAGACA AAGATAAGGC AAATGCAGCA 002400
002401 AAATTGTAAT AATTGGTGAA TCCAGGTGAA GGGACTATGG CTGGTCTTTG TACTTTTTTT TCCAACTTTT CTGTAGGTTT 002480
002481 AAAATTTTCA AAATAAAAAA ATGGGAAATA CTTTAAAAAT TGTAATCAAA GACATTAGTA CAGAAACTTT CATAATGTAT 002560
002561 TTTATTTTTA CAGTAAAATT AATTTATGTA AATTGATAGA ATTTTACTAA TTTCACTCCC AAGTTACATT AAAAGGCTTA 002640
002641 CATATGTTTG ATAATAGCAT ATGTAAACTA GAACTCTGAA TGATATCCAT TGGTCATAAT ACGTACTATG TAGCGGTAAT 002720
002721 GGTGACTTTT GTGATTGCAC AAGTCTAGAG ATGCCCCAAA TGACATTGAC TTAGACATCT GGTTATTCTA AGGCTGAAAC 002800
002801 TGAAGTTGAA TAGAAGGTTT TAGTCAAATA CTGAGATGAA AACTGAGGCA GTCCTGGCGG GGGGGAGTGA GTGTGTGTGT 002880
002881 ATATATACAC ACATAGACAT CATGCTTCTA AACATTTACA GAAAGAAAGG GTAGATTATC TACAAAAAAA TAAGAATCAG 002960
002961 ACTGATATGA GATCTTACAA ACCTAACCCC CTTCTCTTTC CTAAACTCCA GATTCTCATA TTTCTGACTT CCTATTTGAT 003040
003041 ATTTACACTT CGATATTTAC CAGGAGTCTT CAACATTTTG TTCAAAACAG TACTCTTGGT TTTCTTCCTC CAAGACTACT 003120
003121 CCTTACTCAT ATCAGCAAAT AGCAGCTCTT TTCAAGTGCT CAGTGTAAAA ACCTACAATT AATCCTTGAT TTCTCTTTCA 003200
003201 GTCAGCCTAT ACTAAATCAA TTTCATTTAA AATATCTCGG CTACTACTCT GCATCTCCAC TGCTACCATC GGCCTCTCCA 003280
003281 GTCACATTCT CCAAGAGCAC TCTATCTCAT TTAAAAGACA AAATCTCTGC AGTGGCCTGT GATGCTCCTT AATGGCCTAC 003360
003361 ATAATCCAGC CCTCAAGCAC CTCCGTGATC TCTGTAAAAC TTTCCCTTGG TCACTGTGCT TCAGCCACAT TAACCAGCTT 003440
003441 GCATATTTCT CACATTCACC AAGCTTGTTC CTGCCTTGGG GCCTTTGTAC TTACCATGTT CTGTTCTGAG AATACTCTGC 003520
003521 CTCAAGATAT CCTACAACTA TCTTACTGTA TTCAGCTCTC TGCTCAAGTA TTAACTGATG AAACCTGTCA TCCCTACTCC 003600
003601 ACTCCATGTT CTGCTTTACT TAACAGCAAT TGCACATATG GCCCCCTGAA TAATATACAT TTAGTCACTT ATTTTTACTT 003680
003681 ATCTGCTAAT TAAAATGTAG ACTTTTTCTA TTCTGTTTAC TGCTGTATTC CCAGCATGTT TTATCCGAAT GTGCAGTGGT 003760
003761 TTCTTTTCTT CTCCCTTATC GTGGGAAGTG ATGTGCACAA ATACACATAA TGGAGCCTGA ATGTCATATT GCTTTCATAC 003840
003841 CTGTGTGAAT TTTGGTAAGA AAGGAAAAGT AGCGATTGAC AGGTAATATA ATTACATTAA GTCACTCTCA TAGTTAGCTG 003920
003921 TTTATTGCTT TCCTGCTCTT ATTCTCAGTC CCCAGGACCA AATGTTGACC ACTACCTTCC CCCACATATA ATTAGGTTAT 004000
004001 TTACCGAACG CCATGCAGGT GGCTGTTAAA AGGAAGATAT ATACTTACCT TATAAACTCA ACTTTTCCCT GTTGTCTTTC 004080
004081 TGTCTCACCC CTACCTCCAT GCTTTAAATT AACTTTTCAG GCTTAGGCCT TATCTCTCAG TAGAGCCATA TAAGGTATGT 004160
004161 GTAAAAGCAG GAAAATGTTT CCTGGGGATG AAGCTTTGAA AAGCTTTTTT TTTTTTTTCT TTTGGCAATA AAATAAGGTA 004240
004241 GATTCAGCAC AATACCTAAT AACTAAAAAA TCTGTTTTTA ATTGGGTGGG GCAGACAGCA AGTGTGTCAT CCTGGAAGAT 004320
004321 ACTATTTGGG ATTTTATGTA GGTACATAAG AGAAAAAAGT GAACAAAAGC AAGGGGCTAC CAGGACGCCG CAGTATGCTT 004400
004401 AACATGTATT TTCTAAGTTT GTATTATGCC TTTATCTTGG TACTTTTATC TTCTGTTCTC ACTTGATCTT TTTGAAATGT 004480
004481 ATTTTAAATC CTAATAAAAA TATATAAAGT CTGGAATTAA TAAAGGA
[back to top]

Predicted Small Protein

Name ENST00000564237.1_smProtein_2966:3181
Length 72
Molecular weight 8299.8447
Aromaticity 0.183098591549
Instability index 44.7492957746
Isoelectric point 9.90020751953
Runs 7
Runs residual 0.0499629355078
Runs probability 0.0503066091301
Amino acid sequence MRSYKPNPLLFPKLQILIFLTSYLIFTLRYLPGVFNILFKTVLLVFFLQDYSLLISANSS
SFQVLSVKTYN
Secondary structure LLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLEEEELLLL
LEEEEEEEEEL
PRMN LLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHLLLLHHHHHHHHHHHHHH
HHHHLLLLLLL
PiMo iiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTTTTTTTTooooTTTTTTTTTTTTTT
TTTTiiiiiii