ENST00000416209.2

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000416209.2

Source

Gencode19

Same with

lnc-ARIH2-1:5,NONHSAT089609

Classification

intergenic

Length

3447 nt

Genomic location

chr3+:48885390..48889414

Exon number

2

Exons

48885390..48885841,48886420..48889414

Genome context

Sequence
000001 GGGCTGTCGG GGGGTCGAGA CAGTGGAGTG TTCCTGATGC CAGCTCACGG AGGGAATCCA GGGAATTAGG TAATGAACGG 000080
000081 TCTACCCATG AAGGGGACCG ATTAGTGCCT CTCGCCTGGT CCTGAAGACC TTTAAATTAT TCTTTCAGCT GATGCCAAGA 000160
000161 GCCTCTTTTT GGCCAGCACT TACTAAGCAC CAGAGACACA GCGAGAAGCA GAAGCATACG AGCACATCAG TGTGATTCTG 000240
000241 CTGTGATGAA GGGCAAGAGA GGGCTCACTT TCCGGCGTGA CATTCAAGCT AGCCCCGAGG CCTGAGCAGG ACCCGCCCAG 000320
000321 GGGAAGATCT GGAGGAAAGC CATCTGAACA TACAACAAGA GCAAAGGCCT GGTGGAGTCA AGACCACCTT GGCTGCCAGG 000400
000401 TGGAATGGTG GAATAACTCA GGAGTCTGTC AGGTCTCTGA GATGTCAAAC GGGCTTGGCA CACACCCAGG AGCTCAATGC 000480
000481 TGCTGGGAGC TGAGCAGCTG GGAGCAAGGA GCCCTCATCC CCACTGATGA AGAGGCCAGC CCAGGTTCTA GGCTAAGCAG 000560
000561 GCCTGTGATT ATAAACAGCA GTGTCTCCCT GGACAAGTTT CTTGACCCTG AGAAGAATGT TAGACCATCA ACTTTGTGAG 000640
000641 GGGACAGGCA GTCTCCAACT TTTTTCCTGC TCAGTCCTTA TTGCATACTC TTGGAGAAGG TCTACTGACT TGTTCTCAGA 000720
000721 TGCCCCACGG CTCAGCCAAC CACCCATCCA GCAGAGCATT CAAAAGGCAT CTGGAAGGAA TGAGGGCGAC CGTTCTGCCA 000800
000801 GGCCCCTATT TTTTTTTGAG AGGAGTCTCA CTCTGTCGCC CAGGCTGGAG TGCAATGGCA TGACCTCTGC TCACTGCAAG 000880
000881 CTCTGCCTCC CAGGTTCAAG CGATTCTCCT GTCTCAGCCT CCCAAGTAGC TGGTACTACA GGCGTGCACC ACCACACCCA 000960
000961 GCAAATTTTT TCTATTTTTA GTAGAGATGG GGTTTCTCCA TGTTGGCCAG GATGGTCTCA ATCTCTTGAC CTCGTGATCC 001040
001041 ACCCGCCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCGTG AGCCACCGCA CCTGGCTTTT TTTTTTTCTT TTAGATGGAG 001120
001121 TCTTCCTTTT GTCGCCCAAG GTGGAGTGCA ATGGCACGAT CTCAGCTCAC TGCAACCTCC ACCTCCCAGG TTCAAGCAAT 001200
001201 TCTCCTGCAT CAGCCTCCTG TGTAGCTGGG ATTATAGGCA CCTGCCACCA TGCCCAGTTA ATTTTTGTAT TTTCAGTAGA 001280
001281 GACGGGGTTT TGCCATGTTG GCCAGGATGG TCTCGAACTC CTGACCTAGG GATCTGCCCG CCTTAGCCTC CCAAAGTTCT 001360
001361 GGGATTACAG GCGTGAGGCA CTGCACCTGG CCATTTTGTT TGTTTGGGTT TGTTTTTGAG ATGGAGTCTT GCTCTGTTGC 001440
001441 CCAGGCTGGA GTGCAGTGGC ATGATCTCGG CCCATTGCAA GCTCCACCTC CCGGGTTCAT GCCATTCTCC TGCCTCAGCC 001520
001521 TCCAGAGTAG CTGGGACTAC AGGCGTGCGC CACCATGCCC AGCTAATTTT TTGTATTTTT TAGTAGAGAC GGGGTTTCAC 001600
001601 CATGTTAGCC AGGATGGTCT GGATTTCCTG ACCTCATGAT CCGCCTGCCT TGGCCTCCCA AAGTGCTGGG ATTACAGGCG 001680
001681 TGAGCCACCA CGCCTGGCCT GTTTGTTTTT TTGAGACACA GTCTTACTCT GTTGCCCAGG CTGGAGTGCA GTGACGCGAT 001760
001761 CTCGGCTCAC TGCAACCTCC ACCTCCCAGG TTCAAGCGAT TCTCGTGTCT CAGCCTCCCA AGTAGCTGGG ATTATAGGTG 001840
001841 CGCACCACCA CGCCCGGCTT ATTTTTTGTA TTTTTAGTGC AGATGGGGTT TCACCATATT GGCCAGGCTG GTCTTGAACT 001920
001921 CCTGACCTCA GGTGATCCGC CTGCCTCAGC TTCCCAAAAT GCTGGGATTA CCAGCATGAG TCACCACGCC CGGCCAAGAA 002000
002001 AGACCCATAT TTTGTTTTGT TTTCTTTTTT GAGATGGAGT CTTGCTCTGT CGCTCAGGCT GGAGTGCAGT GGCGCAATCT 002080
002081 CTGCTCACTG CAACCTCCAC CTCCTGGGTT CAAGCCATTC TCCTGCCTCA GCCTCCCGAG TAGCTGGGAC TACAGGCGCA 002160
002161 CACCACCACG CCTGGCTAAT TTTTGTATTT TTAGTACAGA CAGGGCCAGA CTGGTCACGA ACTCCTGACC TCAGGCGATC 002240
002241 CACCCGCCTC AGCCTTCCAA AGTGCTGGAA TTATAGGTGT GAGCCATCGC ACCTGGCCAC CAGACCCCCA TTAACTTCAG 002320
002321 TAGGGATGGC ACCAGGTTTG AGAGGCCAAA AGAGATCCAG AGCCAGCAAA CAAGACTTAG GTTTGATTGA GGGGAATTTG 002400
002401 CATACAGAGC AGTCCAGTGG AGGTGGGCTA GATAGGAGAA CTGCCCCACC TGCAGAAAGT ATGCAGTATA TATAGCATTT 002480
002481 TCACTTAACA CCCTCCCCCT AACAACTTTG ATTTAACCCA AAACAAAGGG GCTAAATCCC CTGTACATCC ACAGGACAGA 002560
002561 ATGGGGGCTC AGATATTCCT CATGGGTAAG TAATGAATCT CTGGCTTGTC CTCACTTGGA ACTCCTAACA CATTCAGGTG 002640
002641 CATCTGCCAT ACAGGGTCAT TCTCAGAGTA TGCTTAAGTT ATTGCTGTCA GGTGCAGCTA CCATACACAG GTGTGTCTGC 002720
002721 CATATAGCCA CAGAAAGCAG GAGTCCTACA GCTGCTCCTC ATGGGTCTCC ATAGACTAGT CTTAGAGCAA GGATGGGCAG 002800
002801 CCAGCCTCCA GGTTTAACCT GGCACTCTCC CCCAGCACTT GGTCCCCAGA GCAGGGGTGC ATCTCTCCCA CTTCTGGTCA 002880
002881 ACCCTTCAGT AACAAGTAGA GCCCCCAGGG TCTTCATGAA TATCTAGTTG AACTGGCTAA AGAAGAGGGC CCTGTAACAG 002960
002961 GACAGAACAC CAGGAGCCTG AACCCCGCAT CTTCTACCCT TATCCTGCAT TTATCCTGCC TGCATTCAAG CCTGTCCAGC 003040
003041 TGCACCACTG GAGGTTACCA TGGCAACTAA ACCCAGAGGC AGCCGGTAAC TGAGATACAG CTGCCAAGAG CAACTGATGG 003120
003121 TGGAAAAGGC CAACAGCAGA GACTCTTAGA AGGAAGAGAA GTACACAGGA GGAGGCCCCA GAACCCTGAG GGTGCTACAC 003200
003201 TCAAATCTAA TCCTACTCAC AACTCAGCAG GCAGTCAACG TCTTGCACAA ACTGCCATAT CACCAACCAG AGGCACAGAT 003280
003281 ATGGACACAA TGGGGCCAAT TTGCAGAGAT GTTTGCAACT ATGGATGCAG AGTAGCACAG GTGGGACACA TGTTGTTTAG 003360
003361 CCCAAAAGCA ACGGGTCTTG GTGAAAACGA AAATGGGTAG GGTACCATTG CTGTTTCCAA TGAAACAATG AATTTTGTGT 003440
003441 GCTCTTG
[back to top]

Predicted Small Protein

Name ENST00000416209.2_smProtein_2033:2227
Length 65
Molecular weight 7189.1812
Aromaticity 0.109375
Instability index 54.5578125
Isoelectric point 9.29852294922
Runs 10
Runs residual 0.00765625
Runs probability 0.0382529588413
Amino acid sequence MESCSVAQAGVQWRNLCSLQPPPPGFKPFSCLSLPSSWDYRRTPPRLANFCIFSTDRARL
VTNS
Secondary structure LLLLHHHHHLLEEEHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEEEEELLLEEE
EELL
PRMN -
PiMo -