NONHSAT100801

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100801

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

1535 nt

Genomic location

chr5+:27472391..27485935

Exon number

5

Exons

27472391..27472626,27475642..27475802,27476194..27476265,27477746..27477908,27485033..27485935

Genome context

Sequence
000001 GTGGTGACCT GGGAATCAAT GTGTGAGGTG GTGCTATGTA GCAGGAACCC CTCTTGCTTT GCAAATAGTT TTTTGTTTGT 000080
000081 TTTCCTTTTT GCCCAATAGA GCCCTGCTCT ACTGACCCTT CAATGTGCCC GTGTGCCTAA ATATTCCTGG TCGTGTGAAA 000160
000161 AGAACCCAGG TATTAGCTGA ACTAAGGAGC ACAATTCTGC AACATTTTGG CACCCAAACA CGGGGCTTGA GAAATGAATG 000240
000241 CAATTGGAGA AACTGGTTGT TTTACCAGGC GTTGATTGGA AATGTGTGCT TCCCTTTAAG CAGTCAAGCT CAACTTGCAG 000320
000321 AACTGATGGG AACCCCTTGG GAAAACTGGC CTCAAATGTT TGTCTACACA GTCCACATAC AGGGTTCTTA ACCTGCGACT 000400
000401 GACCCTGCTT GTGCCTGTGA ACCAACCAAC AATCTCTGGC TGCAGCTCAG AAAGGACAAA AGAGAATGGA TGAGGGAAAC 000480
000481 TTCCCAGGGC TTGTCTGGGT ATGCCCACAG TGGACTGGAG CCCAAAATGC ACACTGGAGG AAGTGGATGG AGCCACGTGG 000560
000561 ATGTCATGCC TTATGCAGGG GAGGAGCCTG CTCTCTTCAG CTCCTGTGGT AATGTGGGAA TCGATCTGTG AGGAAACACT 000640
000641 TTCTGAAAAT GTCAGAACTC TTGAAAACAT GGATGAGGAA AAGCATAAGT ACTGCTTTAT GCTTTTCTTA TGAATTTTAA 000720
000721 AAGATGTAAT TTAAAGGTAC TTTAAAGTGA TATTTGTGCC CTTGTAACTG ATTCAAAGTA GACTGCTAAG GACAATAAGT 000800
000801 AATAAGAGTC TACAAATATC AGTAAATTAT GAGAAAAAAC GGTCTAATGG TACTTAAAAT TGCATTGTGA TTTATTAAAT 000880
000881 AACATTGAAC AGCTCCACCT GTAACATTAA GATTCCATTT GAGCATATAG AAAATATATT TATCGCAACC TGAAGTGATT 000960
000961 TGGAAAGTAT GCTGGGAAGA AAATATATTT ACAAATTAGT CTCAGAGGGT AGGTATCCTG TAGCCAGATG GAATAAGAAG 001040
001041 AGAGGGAAGG GAGATTTTGT TCAAAGGGGG AGCATAAGCT AAGGAATAAA GCCACAAACA ATGATTAAAT TAGGTAAAGT 001120
001121 GCAATTAGCT GAATATTGCA TAAGCATACA TATGGGAAGA AGTGGAAAAA GGAGTGACAA TATGAAGCTA AAGGGTTTGA 001200
001201 AGTGTTCAAG ATAATACTAG GCTTCCTAGC CTATTCCGAG ACATCTTGTT TTAGAGTGCA ATAGATAAGA ACTCTAAGGA 001280
001281 TGAAAGTAGG AGACATGCAT TCTAGTCTAC GTTTTGGTTA ATGGACTGAA AGTGAAAGAT GGACTGAAAT TGGGGGATGG 001360
001361 AATAAAAATG TGATAGAATA GAAAATAGAA AAAGCAATTA GCCTAATGTC ATTGATACTT AGAGAAAATA TGGAAAATAG 001440
001441 TAAATTAGTG ATATATATCT TTTCTAATTC ACCAGGGTGA GTGAACAAAG GAAATCGGAT TTAAAAGTAT TCATGACATA 001520
001521 GATGCAATAA GGTTT
[back to top]

Predicted Small Protein

Name NONHSAT100801_smProtein_500:712
Length 71
Molecular weight 8108.3016
Aromaticity 0.0857142857143
Instability index 72.7942857143
Isoelectric point 4.36846923828
Runs 13
Runs residual 0.0408769448373
Runs probability 0.042395336513
Amino acid sequence MPTVDWSPKCTLEEVDGATWMSCLMQGRSLLSSAPVVMWESICEETLSENVRTLENMDEE
KHKYCFMLFL
Secondary structure LLLLLLLLLLLHHHLLLHHHHHHHHLLLLLLLLLLEEEEHHHHHHHHHHLHHHHHHLLHH
HHHHEEEEEL
PRMN -
PiMo -