ENST00000416209.2
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
ENST00000416209.2 |
Source |
Gencode19 |
Same with |
lnc-ARIH2-1:5,NONHSAT089609 |
Classification |
intergenic |
Length |
3447 nt |
Genomic location |
chr3+:48885390..48889414 |
Exon number |
2 |
Exons |
48885390..48885841,48886420..48889414 |
Genome context |
|
Sequence |
000001 GGGCTGTCGG GGGGTCGAGA CAGTGGAGTG TTCCTGATGC CAGCTCACGG AGGGAATCCA GGGAATTAGG TAATGAACGG 000080
000081 TCTACCCATG AAGGGGACCG ATTAGTGCCT CTCGCCTGGT CCTGAAGACC TTTAAATTAT TCTTTCAGCT GATGCCAAGA 000160 000161 GCCTCTTTTT GGCCAGCACT TACTAAGCAC CAGAGACACA GCGAGAAGCA GAAGCATACG AGCACATCAG TGTGATTCTG 000240 000241 CTGTGATGAA GGGCAAGAGA GGGCTCACTT TCCGGCGTGA CATTCAAGCT AGCCCCGAGG CCTGAGCAGG ACCCGCCCAG 000320 000321 GGGAAGATCT GGAGGAAAGC CATCTGAACA TACAACAAGA GCAAAGGCCT GGTGGAGTCA AGACCACCTT GGCTGCCAGG 000400 000401 TGGAATGGTG GAATAACTCA GGAGTCTGTC AGGTCTCTGA GATGTCAAAC GGGCTTGGCA CACACCCAGG AGCTCAATGC 000480 000481 TGCTGGGAGC TGAGCAGCTG GGAGCAAGGA GCCCTCATCC CCACTGATGA AGAGGCCAGC CCAGGTTCTA GGCTAAGCAG 000560 000561 GCCTGTGATT ATAAACAGCA GTGTCTCCCT GGACAAGTTT CTTGACCCTG AGAAGAATGT TAGACCATCA ACTTTGTGAG 000640 000641 GGGACAGGCA GTCTCCAACT TTTTTCCTGC TCAGTCCTTA TTGCATACTC TTGGAGAAGG TCTACTGACT TGTTCTCAGA 000720 000721 TGCCCCACGG CTCAGCCAAC CACCCATCCA GCAGAGCATT CAAAAGGCAT CTGGAAGGAA TGAGGGCGAC CGTTCTGCCA 000800 000801 GGCCCCTATT TTTTTTTGAG AGGAGTCTCA CTCTGTCGCC CAGGCTGGAG TGCAATGGCA TGACCTCTGC TCACTGCAAG 000880 000881 CTCTGCCTCC CAGGTTCAAG CGATTCTCCT GTCTCAGCCT CCCAAGTAGC TGGTACTACA GGCGTGCACC ACCACACCCA 000960 000961 GCAAATTTTT TCTATTTTTA GTAGAGATGG GGTTTCTCCA TGTTGGCCAG GATGGTCTCA ATCTCTTGAC CTCGTGATCC 001040 001041 ACCCGCCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCGTG AGCCACCGCA CCTGGCTTTT TTTTTTTCTT TTAGATGGAG 001120 001121 TCTTCCTTTT GTCGCCCAAG GTGGAGTGCA ATGGCACGAT CTCAGCTCAC TGCAACCTCC ACCTCCCAGG TTCAAGCAAT 001200 001201 TCTCCTGCAT CAGCCTCCTG TGTAGCTGGG ATTATAGGCA CCTGCCACCA TGCCCAGTTA ATTTTTGTAT TTTCAGTAGA 001280 001281 GACGGGGTTT TGCCATGTTG GCCAGGATGG TCTCGAACTC CTGACCTAGG GATCTGCCCG CCTTAGCCTC CCAAAGTTCT 001360 001361 GGGATTACAG GCGTGAGGCA CTGCACCTGG CCATTTTGTT TGTTTGGGTT TGTTTTTGAG ATGGAGTCTT GCTCTGTTGC 001440 001441 CCAGGCTGGA GTGCAGTGGC ATGATCTCGG CCCATTGCAA GCTCCACCTC CCGGGTTCAT GCCATTCTCC TGCCTCAGCC 001520 001521 TCCAGAGTAG CTGGGACTAC AGGCGTGCGC CACCATGCCC AGCTAATTTT TTGTATTTTT TAGTAGAGAC GGGGTTTCAC 001600 001601 CATGTTAGCC AGGATGGTCT GGATTTCCTG ACCTCATGAT CCGCCTGCCT TGGCCTCCCA AAGTGCTGGG ATTACAGGCG 001680 001681 TGAGCCACCA CGCCTGGCCT GTTTGTTTTT TTGAGACACA GTCTTACTCT GTTGCCCAGG CTGGAGTGCA GTGACGCGAT 001760 001761 CTCGGCTCAC TGCAACCTCC ACCTCCCAGG TTCAAGCGAT TCTCGTGTCT CAGCCTCCCA AGTAGCTGGG ATTATAGGTG 001840 001841 CGCACCACCA CGCCCGGCTT ATTTTTTGTA TTTTTAGTGC AGATGGGGTT TCACCATATT GGCCAGGCTG GTCTTGAACT 001920 001921 CCTGACCTCA GGTGATCCGC CTGCCTCAGC TTCCCAAAAT GCTGGGATTA CCAGCATGAG TCACCACGCC CGGCCAAGAA 002000 002001 AGACCCATAT TTTGTTTTGT TTTCTTTTTT GAGATGGAGT CTTGCTCTGT CGCTCAGGCT GGAGTGCAGT GGCGCAATCT 002080 002081 CTGCTCACTG CAACCTCCAC CTCCTGGGTT CAAGCCATTC TCCTGCCTCA GCCTCCCGAG TAGCTGGGAC TACAGGCGCA 002160 002161 CACCACCACG CCTGGCTAAT TTTTGTATTT TTAGTACAGA CAGGGCCAGA CTGGTCACGA ACTCCTGACC TCAGGCGATC 002240 002241 CACCCGCCTC AGCCTTCCAA AGTGCTGGAA TTATAGGTGT GAGCCATCGC ACCTGGCCAC CAGACCCCCA TTAACTTCAG 002320 002321 TAGGGATGGC ACCAGGTTTG AGAGGCCAAA AGAGATCCAG AGCCAGCAAA CAAGACTTAG GTTTGATTGA GGGGAATTTG 002400 002401 CATACAGAGC AGTCCAGTGG AGGTGGGCTA GATAGGAGAA CTGCCCCACC TGCAGAAAGT ATGCAGTATA TATAGCATTT 002480 002481 TCACTTAACA CCCTCCCCCT AACAACTTTG ATTTAACCCA AAACAAAGGG GCTAAATCCC CTGTACATCC ACAGGACAGA 002560 002561 ATGGGGGCTC AGATATTCCT CATGGGTAAG TAATGAATCT CTGGCTTGTC CTCACTTGGA ACTCCTAACA CATTCAGGTG 002640 002641 CATCTGCCAT ACAGGGTCAT TCTCAGAGTA TGCTTAAGTT ATTGCTGTCA GGTGCAGCTA CCATACACAG GTGTGTCTGC 002720 002721 CATATAGCCA CAGAAAGCAG GAGTCCTACA GCTGCTCCTC ATGGGTCTCC ATAGACTAGT CTTAGAGCAA GGATGGGCAG 002800 002801 CCAGCCTCCA GGTTTAACCT GGCACTCTCC CCCAGCACTT GGTCCCCAGA GCAGGGGTGC ATCTCTCCCA CTTCTGGTCA 002880 002881 ACCCTTCAGT AACAAGTAGA GCCCCCAGGG TCTTCATGAA TATCTAGTTG AACTGGCTAA AGAAGAGGGC CCTGTAACAG 002960 002961 GACAGAACAC CAGGAGCCTG AACCCCGCAT CTTCTACCCT TATCCTGCAT TTATCCTGCC TGCATTCAAG CCTGTCCAGC 003040 003041 TGCACCACTG GAGGTTACCA TGGCAACTAA ACCCAGAGGC AGCCGGTAAC TGAGATACAG CTGCCAAGAG CAACTGATGG 003120 003121 TGGAAAAGGC CAACAGCAGA GACTCTTAGA AGGAAGAGAA GTACACAGGA GGAGGCCCCA GAACCCTGAG GGTGCTACAC 003200 003201 TCAAATCTAA TCCTACTCAC AACTCAGCAG GCAGTCAACG TCTTGCACAA ACTGCCATAT CACCAACCAG AGGCACAGAT 003280 003281 ATGGACACAA TGGGGCCAAT TTGCAGAGAT GTTTGCAACT ATGGATGCAG AGTAGCACAG GTGGGACACA TGTTGTTTAG 003360 003361 CCCAAAAGCA ACGGGTCTTG GTGAAAACGA AAATGGGTAG GGTACCATTG CTGTTTCCAA TGAAACAATG AATTTTGTGT 003440 003441 GCTCTTG |
Predicted Small Protein
Name | ENST00000416209.2_smProtein_2033:2227 |
Length | 65 |
Molecular weight | 7189.1812 |
Aromaticity | 0.109375 |
Instability index | 54.5578125 |
Isoelectric point | 9.29852294922 |
Runs | 10 |
Runs residual | 0.00765625 |
Runs probability | 0.0382529588413 |
Amino acid sequence | MESCSVAQAGVQWRNLCSLQPPPPGFKPFSCLSLPSSWDYRRTPPRLANFCIFSTDRARL VTNS |
Secondary structure | LLLLHHHHHLLEEEHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEEEEELLLEEE EELL |
PRMN | - |
PiMo | - |