NONHSAT091001

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT091001

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2732 nt

Genomic location

chr3-:106555658..106959479

Exon number

8

Exons

106555658..106557629,106558322..106558439,106639813..106639907,106640905..106640940,106646682..106646841,106758922..106758963,106848735..106848839,106959290..106959479

Genome context

Sequence
000001 GGCGTTGGCC TCCGCGGCCC GTGCGAGTGC GCCCTTCTCG GGTCCCCTAG GTCGCGGGGC AGCCGGGGCG GAAGGGGCTT 000080
000081 GTTCATTATT TTGAGAGTTT GTTGTGAACG TCATAACCTT TCGTCCTTAA ATATTGCCGA TACTTGACCT ACGCAAGAGA 000160
000161 CAATGTCATG TGATTCAGCC TAATATCTCA GAGGATGCAG CATTCAAGGT TCTATCTTGG AAGCAGAGAC TGTGCCCTCA 000240
000241 CCAGATGCTG AACCTGCTGA GCACCCTGAT CTTCCACTTC ACCTTCATCA GAACTGTTTT CGTATCAGGA TGATGCTGGC 000320
000321 CTCATACAAT GAGTTAGGGA CGCGCATGAA ATTTGGTGCC GTGACTCGGA TCGGCGGACC TCCCTTAGGA GATCAATCCC 000400
000401 CTGTCCTCCT GCTCTTTGCT CCATGAGAAA GATCCACCTA CGACCTAAGG TCCTCAGACC AACAAGCCCA AGAAACATCT 000480
000481 CACCAATTTC AAATCCGAAA TACTTCATCT AATCTGGCAA ACACATGTGG CAGGTTTTTC TCACCGCATA AAATATGTTA 000560
000561 TAGAGTAGTC CATCTATTTC TCTTCCATGG AAGAGTTCAT CTGGTAGATT GGACTACCAC TTTTCCTTGT GGAGACCTAT 000640
000641 TCTGTTGGGT TAGAGATGAC AGAAAATCTT ATTTGTCGGT TGGTTGGTTT GTTCAAGAAT ACACTGACCT TCTACAAGAA 000720
000721 AGGAGAGCTC TTTGACACCT GGAAATTTTA CAGAAGCTTG GGACAGCTCT GAATGGCAAC ACGGTCCTGA GATATGTGAT 000800
000801 TACCTGGGAC TTTCATTGGA CCTGCATTTC AGCTCTACTC TCCTCACTGC CCTATGCTGC TTTGACCCTA AAAGTGAAGC 000880
000881 CAGCTTGGCC AGGTGCAGTG ACTCACACCT GTAATCCCAG CACTTTGGGA GGCCGAGGTG GGCAAATCAA GAAGTCAGGA 000960
000961 AATCGAGACC ACCCTGGCCA ACATTGTGAA ACACCATCTC TACTAAAACA CAAAAAAATT AGCCAGGTGT GGTGGCACGC 001040
001041 ACCTGTAGTC CCAGCTACTC GGGAGGCTGA GGCAGGGGAA TCACTTGAAC CCGTGAGGAG GAAATTGCAG CGAGCCAAGA 001120
001121 TCGCGCCACT GCACCCCAGC CTGGGCGACA GAGAGAGACT CCATCTCAAA AACAAAACAA AACAAAACAA AAAAAACAAA 001200
001201 AAGTGAAGCC AACTCAATCG TCTCATAGAA CTGATCTTTA CGGTGTTTTT AAATAAACAT AGAAATTGGC CCTCCCAGTC 001280
001281 TTAAAACTTC AAAAACTTAC ATTTGTCTTA CCTCAGTTCC TTTCTCAGGA ATCCAATCAT CAAGGCTCCC AGATAGTATC 001360
001361 AATGAACGGA AACTTACCAG ATCACGATAT CTGGACAATG AGATGTCAGA TCCCCCTCAT CCATCCTGAT TACCTAACTG 001440
001441 ACTACCTGCT TCCTGTTGGC CAACTACTCT TCCTTACCCC TCCCTAATTC TTGTTTTCCC ACAAATGGTT ACATTTCTGT 001520
001521 CCTGCTATAG AAGCCCTTAA TTTTAATTGG TCGAGGAGAT GGATTTGAGA CTGATTTTTT GCTCTCCTTG GCTGTTGCAC 001600
001601 CTGAATAGCC TTTTTCCCTG GCAATACTTT TTGTCTCAGT GATTGGCTTT CTGTGCAATG AGCAATGGGA CCAAACTTCT 001680
001681 GGTGTTTTGG TAACAAATTT TGGTTTCCTG ACTGTGAATG CATTGCTTGT GACTCAGCTG CCATGAGCTG GGAGAGTTTC 001760
001761 AAAAGCCCTC TAAGCAGCTA CCTGACCATT TTGGCCAGAG GTGGATTTTC GTCTCTCCCC ATTTGGGCCC ACTTCTGCCA 001840
001841 ACCCCAACCA TGTTCCTGAT TGCCTAGGAA GAACAGCCTT TGAAATTTGA CGTCTATGTA CGGAAAGGTG AGTGTCTTTT 001920
001921 ATGGGTACTA GACAGTGGGA ATGGTTCCTC TCCATTTGGG AAATCCCAAA GGCATATCAG TTTGCAGGTT AAACAAGCCA 002000
002001 AACTGATGGA GAGAGGATAA TACCTTGACT GATTCAGTTT GAACACTCTT GGGGCTTGTT CATTGCTACA GAAGTTGGAT 002080
002081 TGTGTTTTGG TGATTGTTTG CTTGTGTATG TCTATGTAGT TATGGGATGT CAGGGTTTGA TCCCAGAGTG CAGCCCACAT 002160
002161 GGGTGCATTC TTTGGGTTAG TCTGTATGTA GTTGTGGGTT GACAGAGCAG GGGAAATTCT ACAGCCATAC CACTCTGAGT 002240
002241 GTGGCTGATC TTGTCTGATC GCGATGTCTT TGCATTGTAG TGCTGTCTTA CAATTGAATT CATTTTGCTG TTGAATGGGA 002320
002321 AAGCAGGATG GAGGTCCATG TATGTATGCT TTTATGCTGC TGCTCTGAAC AGGGTTGGGC CTGATTAGTA CTGTGATGCT 002400
002401 CTTCTGTGGT GCTGTTTGGC CCCAGTGTTC TTTGGAATCC GGGGAATTTT GGCCTTTAAA AGTTAAACTG CCATGGGAAC 002480
002481 TGATTTACCC AGTTTTAGTT CATAGCCTTC ATTGTATTAT CTATCAGGGC AAAAAAGTTA GCCGTGTGAA CATGTTCGTA 002560
002561 AACCAGTGAG TTTGTATTGC TATCTCATGC CTAGAATTCT AAAGTAAAAA CTATTGGGTC TTTGTTTGTA TATATGTATA 002640
002641 TGGGTCTAAA TGTTATGTGT TGTTTTTACA GGGTACAGAT TGGCTCATAA ATAAAAGAGC ACTCATAAAT TAAGTGAAAA 002720
002721 AAAAAAAAAA AA
[back to top]

Predicted Small Protein

Name NONHSAT091001_smProtein_1664:1858
Length 65
Molecular weight 7203.2188
Aromaticity 0.1875
Instability index 59.8578125
Isoelectric point 5.46319580078
Runs 10
Runs residual 0.00765625
Runs probability 0.0330242006712
Amino acid sequence MGPNFWCFGNKFWFPDCECIACDSAAMSWESFKSPLSSYLTILARGGFSSLPIWAHFCQP
QPCS
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLHHHLHHHHLLLLHHHHHHHHHLLLLLLHHHEEEELL
LLLL
PRMN -
PiMo -