ENST00000520259.1

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000520259.1

Source

Gencode19

Same with

lnc-AC120194.1-1:2,NONHSAT127159

Classification

intergenic

Length

2367 nt

Genomic location

chr8+:71383369..71397922

Exon number

4

Exons

71383369..71384080,71390328..71390398,71392957..71393543,71396926..71397922

Genome context

Sequence
000001 TTTTCTAAGA CCTCCCTGGC CCGCCATACG CCGATCCTGA ACCTACGAAA ACCCAAGACT GCGGGCACGG TGGCTCATGC 000080
000081 CTGTAATCCC AGCACTTTGG GAGGCCGAGG CGGGCAGATC ACGAGGTCAA GAAGTCGAGA CCAGCCTTGC TAACATGGCG 000160
000161 AAACCCTGAC TCTACTGAAA ATACAAAAAT TAGCTGGGCG TGATGGCGTG CGCGCCTGTA GTCCCAGCTG CTCGGGAGGC 000240
000241 TGAGGTAGGA GAATCGCTTC AATCCGGGAG GTGGAGGTTG CGGTAAGCAG AGATTGCGCC ACTGCACTCT AGCCTGGCGA 000320
000321 CAGAGACTCC TTCAAAAACA ACAACAACAA CAACAACAAC AACAAAACAA AACCTCCCAA GACCCTAGTG GGCAGACACA 000400
000401 CAGCCAGCCA GACATGGAGA GGAGCACATT AGCAGAAGAA GACGCAAGTG GCTGGTCCTC GAGAGGACGT TGAGAGGAGC 000480
000481 ATGCCAGCAG AAGAGCACAC AACGACAGGC ACCGACACAC CACCGGCAGG CCATGCACCA GCAGAATGAT GCGGAGTTTG 000560
000561 ACGGGGCAGT CAGAGAAGAG CCTGGGCCGC CAAGGGGCCC GACTCCAGGG GAAAACCATC TCCCTTCTGG CTCCCCCATC 000640
000641 TGCTGAGAGC TACTTCTACT CAATAAAACC TTGCACTCAT TCTCCAAGCC CACGTGTAAT CCGATTCTTC AGTGTCACCT 000720
000721 CCCAAATGTG GAAATGCATT GTCTTTCTTC TTTCCTAGAC CAGTGGTCCT CAAACTTGAG CAGGTTTAAG CATGGTCATA 000800
000801 GCAGAATCAA TGTGCTCACC CCATCATGAA AAAAAAAATG TGTTAATGAA TTGATTACTA CTTTGAGTGT GCCATATTTT 000880
000881 AAAGACGAAG AGAAACTAAT CAAGTTAGTT TTTATTTGGT CAGAGGCTTG TTTGTGCCTG CTTATCTAAA GTTATAAAGG 000960
000961 CCCCAAATAT GGCTCCGTGT TGCAGAGGGA CTGTCTCTTC TTATATCGAC AGAGCCCATA TTCACAATGT ATCACACTTA 001040
001041 CCTCGATGTA TTGTCTGGAG TTTCCTAGGC TGTGTCCTTT GCCTGGAATG CTCTCTCTTG CTGGCTGGCT TCTTCCCACC 001120
001121 AGCTGTGGCC AGGGCTGCTC CTTATTCCTT CGTCAAAACT CCGTCTGCTG GGTGTGGTGC CTCACACCTG TAATCCCAGC 001200
001201 ACTTTGAGAG GCCAAGGTGG GAGTATCGTT TGAGCCCAGG AGTTCAAGAC AAGCCTGGAC AACAGAGCAA GACCCTGTCT 001280
001281 CTACCAAAGA AAAGAAAAGA AAAGAAAAGA AACAGCCACT TTCTCTTCCA TGAGCCAGTT TATGTTTTCT TCACGGCATT 001360
001361 TGTTACAAAT CACAAATGTG GAGGAAAAAT GAAGGCAAAG CAAAATAACA GGCAAGTCTG AAAACAAAAG TGTTGCCAGT 001440
001441 CACGCTAAGC CTCAGGCACG CATTCTGAAT GAAGCTAAAG TTTAAAGTCC CCTCTGATAG ATTTGTGACT TAAATGTCCA 001520
001521 CACTAGCAAG TGTGTTTCTC ACATGGAGAA GTAGCACGAT TTTATGTCAG GAAAGGGATG AGAACATGCT TCTTGAACAC 001600
001601 ATACAATGCT GAAGACATTT CCTGGAAAAC CATTGGCCTT TGCTTAGGAC AGTTGCATCA CACAGAGATG TTTGTGGTTC 001680
001681 AGATTTATAA GAAACATGGG AACTGAAGCC AAAGCACTAC TTTTGCTCAG TATAACCAGG TGCCAGATCT AAAGTGTTAG 001760
001761 GCAGGTTCCT ATATGTATGT TAGAAAAAAG TTGCCCAAGT TTTGGACAAT TCCTTTTCCA TCTCTCTTAC AAGTCAACAA 001840
001841 ATCGAAGCAA TAAACTATTC TCTGAAGAAA ACAAATTTAT TTGGTTGATT CAAACTTTTA ATGAGACTAT TACTCCCCTG 001920
001921 TGTGCCTTTC CTTTCTCTGG TTCTAGAAAT ACATCTAGAA AGGGGGAAAA TGTTAATATA TTTTCTACTG ACTTTATATC 002000
002001 ACAAATGCTT AGCTTAGAGG GTGGACTCTG AGGAACAGTG TCCTTTCCAA ATATTATAAA ACTTTGTGTT CATTCTGGAA 002080
002081 AAACAAACAA AATTAATCAC AATGATTTCC TGCTTTGGAG CAATGAGAAA TGGGAGGGAA ACTTCACCTT TTCATAGCTT 002160
002161 TTGATGTTTG CACCATGTGA ATTACCCACC CCCAAATTTA AAATACGAAT TTTTGAAAAA GTGTTTACCT CTGAGAAGAG 002240
002241 AAACTAGTAG TGAAGAATGA GGAGCTGTAG ATTTTTAACG ATTCTGATGT ATTTGCATTT TTCCCAATGT TCATGAATAA 002320
002321 ATAAATGTGC ATTCATAATT AAAAAGAATA AACAACCTGG AAAAAAC
[back to top]

Predicted Small Protein

Name ENST00000520259.1_smProtein_968:1192
Length 75
Molecular weight 7878.234
Aromaticity 0.0945945945946
Instability index 41.4986486486
Isoelectric point 8.57891845703
Runs 15
Runs residual 0.0600527356625
Runs probability 0.0408467290821
Amino acid sequence MAPCCRGTVSSYIDRAHIHNVSHLPRCIVWSFLGCVLCLECSLLLAGFFPPAVARAAPYS
FVKTPSAGCGASHL
Secondary structure LLLLLLLLLLHHHLHHHEELLLLLLLEEEHHHHLHHHHHHHHHHHLLLLLHHHHLLLLLL
EELLLLLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLL
LLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTTTTTTTToooooooo
oooooooooooooo