NONHSAT081901

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT081901

Source

NONCODE4.0

Same with

,

Classification

intronic(S)

Length

3230 nt

Genomic location

chr21-:36314114..36317337

Exon number

2

Exons

36314114..36317198,36317204..36317337

Genome context

Sequence
000001 TGGCCTACTG GTAACTGTGC TCATATTAAT CTTAGTTATG AAACAGATCA TTTAGAAGGT AACATTGGAA ACAAAAGGCT 000080
000081 AAACTCAGAT ACATAATTTG CCAGGCCTCA TAAACTGTGC TTTGATGTTC TAACAACTTC CCAATAGAAG CCATCTGTTA 000160
000161 AACAAGCAAT TCATTCAGAG TGGAGGTATT AACATTAAAC TGTCTTTCAT ATAGTTAGTC ATTAGTTATT ATACTATCTG 000240
000241 TCTCTCATAT AGTATAGTTA GTAATTACTG AAAATAAGAT AATCCTATTA ATGGCCACAA CATGATTCTG ATGGTTAGTA 000320
000321 TATTCTCTAG CCACAAAAAC CCAGACCTAG GCCTCCCCAA TTCTGCCATT CACACTGCAG CATGTGCCAT CTGTCCTAAG 000400
000401 AGGAGACGAC AGGCTCTCCT CTTCCTCCAT AGATTTCATT TGAACTGTGA GAACCAAGTT TGCCAAATGA CATTTGTGCT 000480
000481 AGAGGAAGCA CTCACATGAG GCCAACCACC CAAAGCAATA CAATATTTAA CCACATATTT CAGTGCTGGC ATTCGTACCC 000560
000561 ACACAAATAT TAACCAATCA AAACTTATAG TCTCCTTGCC AAATTCTCAG AAGAATGGCT GAAGAAGTAT CACATGGTCT 000640
000641 GAATCTATAC AACTTTACCC AGATTCCTGT GTAATTCAGG GGAATTCTCT AGTTTTGTTT AGTACGAGTG AGAAACTCAA 000720
000721 AGAATAAGAA AATTGTTGGG GACAGGCACA GTGGCTCATG CCTGTAATCC CAGCACATTA GGAGGCCAGC TGGAGGACTG 000800
000801 CTTGAGTCCA GGAGTTCGAG ACAAGCCTGG CCAACATAGT GAGACCCCAC CTTTACAAAA GACAAAAATA ATTAGTAAGG 000880
000881 CCTGGTGGCA TGCATTTGTA GTCCCAGGTA CTTGAGAGGC TGAGGCAGGA GGATCACTTG AGCCCAGGAG TTTGAGCCTA 000960
000961 CAGTGAGCTG TGATCACACC ACTGCACTCC AGCCTGGGTG ACAGAGCAAG ACCTCATCTT TAAAAAGTAA AAAAAAGTCT 001040
001041 TGAGGGCTAT AGTAAGCACA AGAAATTATT TCCTTAAACT CCTTTTTTTT CTTCCTACGC TATTAAAAAG GAAAAGATTT 001120
001121 GGGGAAAAGA GAAAGGGTGT GATATAAGCA AATGCTCTAT ATCGGCAAAT CTCTCTGTAA CTTTCACCAC TACTATCTCT 001200
001201 GTCAGGCGAG CATAAAGTGA AAATATTTAG CTCAACATTT GTGTAGGAGG ATTTGCGAGG GTTCCAAATT TTCTCCCCAC 001280
001281 AATTTTGCAA AATCCAGTTG CCAATAATCA GGTAATTTTT ACTTGAGTGC TTTAGAGCAA CCCCAAAGAC TGAATTTTAT 001360
001361 TTATCTCTGA GTCTCTTTAG GGGGTGAAAA AACATCAACA TAAACAACTG AAGGAATATC ACCTGGTAAA ATCATTGACC 001440
001441 CCACACCGGG ACTGACTCCA TGTTCCCTCC CCAAGCCATA TTGGCTTTCT ATAACAACCT TCTGTCTTTT GTATGAAGTA 001520
001521 AGATGGATTT TGATCCGTTG ATCCCCTTAT CTTTCTTTGG TGTAACAACT GGAAAACTGG CAATTCATGT GAAGTACTTT 001600
001601 GCCAAGAGGA TAGTAAAGAT CCTTTTAATG TTTTAAGCAT CCTTTGTTGA GTTATTGTGT GCTGGAATGG GCCTAAGGTC 001680
001681 CATTGTGTGG TTCATGTTGG AGCCTGGATG GGTACATCTG GTAAAGACAG CAAAGAAGTT AAAGGGGGAA ATGTTCCTCA 001760
001761 AGACAGATTC TTTTCTGTTT TTCAGTTCCT GGGCAGTTCT TTGTATTTTT CCACACAGGG TGAAAAACAC AGTCATAAAA 001840
001841 CCCCGAACAG TAGAACAGTC AGAATGAAAT TAGTTTGTTT TCACTGAAGC ATTTGTGCTT GTTTCCTTCA GAAGCTGATC 001920
001921 TTGATGACAT GATTGATTGT CCCTGTGGAT CCTTCCCCTT TATGCTTTCC ATATCCCCCA ACTGCTCAGA GATTGACCGC 002000
002001 GTGCCAATGC AGGCAGAAGG GACACGGGTT AAAAGCTGCC AAGGCGTACT CGCTGCCCTC AGCCCAACAC AGGGAAGCAG 002080
002081 TCCTAGGAGC TTGACTTGAC TTCAGGGTTC ACAGTTGATA TGTGCATCTG GTCCGGGGAG AAAGAGGGAA ACTTTCCTTC 002160
002161 AACCTGACCC AAGGCTCTAA GCTCACTTGC TGCCTCACTC TTTTCCTCTC ATAACACAAA GTTGTGCCAT GCTGAGTGGA 002240
002241 GCCTCCTGGC TCTGAAACCA ATGTGGTTTG CCATCTAGAT CAATGCAGAC TTCATTGTGC ACTGAATGCC AGCAAGAGAG 002320
002321 GGGCATCTGG TGTCATCTCA ACCATGGAAA GGGTACATTA GCCCTGCTAA TTGTCCTCAC CACTATGCCT TAAAAATCCC 002400
002401 CACTCACCCT TCCCAGGTGG CTGCAAAGGA TCACTCATAT TCACTTTGTG TCTGATCCTG GTATTTAATG CCATTTGTAT 002480
002481 TCATCAGACT AATGGAAATA TCAATTAAGC CCAGTCAAGA CTAGCTCTTA GAGCAATAAT GGACACTTGT ACCTGAGCTC 002560
002561 AGAGCAGAGG GAAGAAAGGC ATCTGGAGGT AAATGGTGAA TGACGGTCAA GTGACTATGA AACCTGCAAG TGAACCAATA 002640
002641 AAGTTAAAGT ATTGGTACAA ATTTGTACAC ATTTATATAG CCTTACCAGG TATAGTCATC AGTGTACAGG CAAATGATTT 002720
002721 TTTTTTTTTT GCTTGAGTGC ATTAGAGAAA GTCCAAAAAT GCAGTCTTAT AAATCTTGGA AAACTGCAAT TTAAAATAAG 002800
002801 CTGTGGAAGA TCACCTAGCC TAATAATTGA ACCAACACTG GGATCTAGCC CCAAATTCTA TCACTGTGAG CACACAGGCT 002880
002881 CACTCCAACA TAGGCGGAAC CTCGGGGAAT GATTTTTAAA TATCCCATTA CCATAAAATC TCTGGTATTT GAATCTCACT 002960
002961 AATTCAATCT GGTATTTGAT GGATGCAGCA GGGGGTGGCC GAGACATTCA TAAAGTTTGA AGGTACTCTT TCTTCATTGT 003040
003041 TTCCTAGTTT TCCTTGAGGA ATCAGATGAT ATTTTGCATA GTTCAGAGTG CCCTCCAAAA GAAAATCAAC AGTTTAAGGA 003120
003121 GTTTTCTGCA TTACTGAGAT GAAGTAAGCT TACAAGCTTT AACTTGCTGC ATATGTCTCT CATTAAAGAA TCTTCCAACC 003200
003201 TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA
[back to top]

Predicted Small Protein

Name NONHSAT081901_smProtein_1928:2098
Length 57
Molecular weight 5902.7786
Aromaticity 0.0357142857143
Instability index 59.1339285714
Isoelectric point 5.96209716797
Runs 7
Runs residual 0.0282982045277
Runs probability 0.0293486764075
Amino acid sequence MIDCPCGSFPFMLSISPNCSEIDRVPMQAEGTRVKSCQGVLAALSPTQGSSPRSLT
Secondary structure LLLLLLLLLLEEEELLLLLLLLLLLLLLLLLLEEEEHHHHHHEELLLLLLLLLLLL
PRMN -
PiMo -