ENST00000439598.2

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000439598.2

Source

Gencode19

Same with

lnc-TM4SF20-1:3,NONHSAT077205

Classification

antisense

Length

2568 nt

Genomic location

chr2-:228085768..228189880

Exon number

14

Exons

228085768..228086200,228087098..228087276,228092486..228092608,228093197..228093348,228102802..228102890,228123894..228123999,228129385..228130154,228130591..228130651,228133069..228133159,228133344..228133485,228144199..228144291,228146420..228146505,228170176..228170371,228189834..228189880

Genome context

Sequence
000001 AGCGATGCGC AGACATCTTT GTTGTGGGCG AAACGAGGTG ACCGCAGAGT CTGTCACCCA GGTTGGAGTG CGGTGGTGTG 000080
000081 ATCTTGGCTC CCTGCAACCT CCACCTCCCA GGTTCAAGCG ATTCTCCTGC CTCAGCCTCC CAAGTAGCTG CGATTACAGG 000160
000161 CACCCACCAC CATGAAGAAT GGCATTTTTA AAAGATGGCT TTCTTAAAAA AGGATACTCA GTTGCTCTTC TACTCATACC 000240
000241 AAGTCCTGCA GACACATTCT GGGACTGCAC CTTATGAGTG CTGCTCACTC TGTTTCCTGC AGAAGACATA CAGAGCCGTT 000320
000321 CGCATAGAGG GCCATGATTT TGATTCAGCA ATTCTTCAAA CCACTGATGC TGGTTGACCA TTTGACTTTG TTTTTGAAGT 000400
000401 ATTTTTTACA TACAGTGTAC AGAACATTTC CGCTGGATAT TTGGATACTG AACAGGAGTA ACAAGAGGAA CAGCTCCTAC 000480
000481 CTCCCCCCTG CCCCTTCCCT CACAAGCCTG TGAGTCAGGG CAGGGGCAGC CACTGAAAGT AGCAGAGGCC TTCATAAAGG 000560
000561 AGAAGTTGTC AGGTAGACCT TTGGATTAAC TCCCCAAAAG CAGCCCCCTA GGAGACAGGC GACAGCACCG CAGTCAGGGG 000640
000641 ATGCAGGCGT GGAGGCACTT CTCCATGTTG CCATCATGTG AAGAAGGACA TGTTTGCTTC TGCTTCTGCC ACGAGTGTTG 000720
000721 AAGGCAATGG TGATGAAAGT AAATATTTAA AAACACTACC ATAGCTGTCC CAAATTTCAG AAAAACAAGG TACCACTGGA 000800
000801 ACCATTTAAT TTTGTTGGAA CACAGAATGA CCATCTTGTG ATCAGTTTTC TGCATCTTTA AAATACCTGG AAGGTCATCT 000880
000881 TTAAATAGTA CAATCTGATA AAAGTTTAAT ACTTAGTGTA GACTTATATG AATACTTCTA CATTCATACA TTTTTGTATA 000960
000961 CTTATACATT TTCATACTCA ATTAATGCTT AATTCAAAAA ATATCTGAGT ACCTGCAGTG TTCCCCATGC TACATACTGG 001040
001041 GCTTAAACAG CCAAAAAGAC ACAGTCTTTG ACCTCAAAAT ACTTTGATTC TTGTAAGAAA AACTGACAAA GACAAAGAGA 001120
001121 AGGCAATGTG AAAACTTCTA CAGTTGGGAT AACCTTGGAG GTTACAGGAT GTGGACTCAC TGAAAGCTTT CTGCTGGACT 001200
001201 TGAACTTGAA GCTGAAGTCT GAAGGACAGA TGGGTGTTGC TTAGCGAAGG GAAAGTAGGA AGCAAAGACC CCTGGACAGA 001280
001281 GTGTAGCAGG CACAGAGGGA GCATGACACC TTCGGGAGAG AGCCGCAAGC ACCTGGGCCC AGATCCCAGC TTTAGCTTCA 001360
001361 TTAGCCCTGT GAACTCTGTA TGTCAGCCTT GACATAGAAA AAAAGAGGAT ACTAAACCTC CATCTTAAAA GAGACTTGGA 001440
001441 AGAAGCTGGG ATTATAGGCA CACCACCACA CCTGGCCCCA GAAAAGGGAA TCCTTGACCT GCAAGAATCG AGGAATATTG 001520
001521 TTTCTGACAT GCTCAGTGGA CCTACTGCTG GTGTTCTTAA CCATTTCCTT CCTAAGTGGA ACTTCTTTTG AGGTTAATGT 001600
001601 TTATATACAA TTACTTTAAC ATTGGATTTG GGAAATGGAA CAACATACTG CGGATCTTTC TGCTACCGAT GAAAGAAGAA 001680
001681 GCCAGCATGC TCGTTCCTGT GCATGCTGGT GAAACAGACG TGGCCTGCCC TGCCTTTCTC CAATCTCTCT CCCGCAGCCC 001760
001761 AGCCCAGCCC AGCCTGGTTT AATAGCAGGG GGAGTTGGCA CCGTTCACAG CTGGTTAACA CCCTACTGAG CAGGTTTTGA 001840
001841 ATGCAGAAAG ACTGAGAAAA CTTCACTATT CATGTAAGAA AAATAGGCTC CACGGGATCC CTTGATCAGG ATGGTGGTAG 001920
001921 AGGAAATGAA CTTCCATCAG CACTGTAATG ATCTCAGCCT TCTCCACCAC CCCACGAAGT TTCTCATAGC AAGCCACATC 002000
002001 TTACTCATCT CTACATTCCT AGGAGGGGGG ACAAGTTCAC AGTGTTGGGG AGAGGAAAGT GTCCTCTTAG CTGCCTCCCA 002080
002081 TGACTTCAGG GGCCCATTTG ATGATAATAG GACACTCTCT GGACAAAGCC TGCAGATGGA TTTGATCAAG CAAGAGAGCT 002160
002161 GAGGATGAAT GCGAAAAAAA AAAAAGTGGT CCTGATGGCT CCTTGACCAT GAAATAAAGG TCATCAATGA GGGAAGACAG 002240
002241 GAGACTCCAG GAGCACAGAG ACAGTGACAA AGCAATGGAT AAAGGAATTG TGGGTACCAG TGAGATGGAA GGATTATGGG 002320
002321 ATTTGGAGTC CTAGGGACAA AGATGGAAAG ACTGGAGGGT GCTTGGAGAG GAACTGTGAT TGGTGATGTC AAGATCAAGA 002400
002401 GTACTGTCTT ACAGGGGAAC CAGTGACAGA GGTCAAGAAG AAGATGAGAC CACTGCAATG GTAATTTCAT GACATGTACA 002480
002481 GGTGTTGATA ATTCCTTTGT GCTGGACAAG GTAAAACAAA GAGGGGTGAT GACTTTGAAA TGTTCTTAAT AAAAAATTCC 002560
002561 AGCAATAA
[back to top]

Predicted Small Protein

Name ENST00000439598.2_smProtein_1910:2161
Length 84
Molecular weight 9268.3557
Aromaticity 0.0722891566265
Instability index 50.9409638554
Isoelectric point 5.26043701172
Runs 9
Runs residual 0.031593038822
Runs probability 0.0340929164459
Amino acid sequence MVVEEMNFHQHCNDLSLLHHPTKFLIASHILLISTFLGGGTSSQCWGEESVLLAASHDFR
GPFDDNRTLSGQSLQMDLIKQES
Secondary structure LEEELLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHLLLLLLLLLLLHHHHHHHHLLLLL
LLLLLLLLLLLHHHHHHHHHHLL
PRMN -
PiMo -