NONHSAT128939

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT128939

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2523 nt

Genomic location

chr8+:125955738..125963329

Exon number

2

Exons

125955738..125957988,125963058..125963329

Genome context

Sequence
000001 GAAAGAGTGA GTTAAATGTT GGTAGGTGAT ATAGAGATAG TAATTAGGCA ATATAATTCG ATGTTGTTGA AAGAGCTGAG 000080
000081 CTATGAAATG CATTGTGATA AAGGGGCTGC CTGTGTAAGT AAATGGGTAT AATGAGGTGT AAAACTCCCT TTCTCTCTCT 000160
000161 CTTTCTGCTT CATAAGCCTT TCTGCTTCTC AAGTCCAGAC CTGCTGTTTT TGGGGTTTCC TTCCAATTTT TTTGCTGCTT 000240
000241 CCCTGGGGCC TTGGCTCTTG GGCCTGCTGC CTCTTCCCAC TCTACACCTT TGGATTCCTT ATCTTTGGCT CCACCTTATT 000320
000321 GCTCCATTCT CCAAAGACAA ATCCACAAGG CATAGGATTT TACAGAATAT GGGGTTACAG AAACAGCATA GGGAAGTGTA 000400
000401 AAAGCCTAGA AGCTTCTTAC AGACGCAGCC CCTTGTCTGT AGCTCCTGAC TCCCACTTAA ATGTCTGTTG GAAGCACCCT 000480
000481 GGGGTAGCAG AACCCAGCTC AGAAGATGGG GCTTATCCAC CATGCCTTTT CCTAACCTGA CACCcaccca gataatcttt 000560
000561 tcaatatgca aactagagcg tatcaccccc tgctcagagc ctttctgtga cctcccattt tgcttcccag gactccttgc 000640
000641 tccttagcca ggtctccagg tcctccgtga tcaggtctct gctcttctgc agcctcatct ctgccattct tcaccttttg 000720
000721 tgctccaacc caatctggtt ttcttctcat gctttgaaca ccccaagtct gtctgcttca ggagcttcgt actggctaat 000800
000801 cattcaccac agcatactcc catctatacg gctggctctt atattatggg tctcagctaa aatgtcatcc cttcaaagag 000880
000881 accttcttgg accacccccg ccattctcta tcctatttct ctcttttatt ttctttgaag cacttaccca ctttcagaaa 000960
000961 ctatctggct tgctcatgtg tttgtctcta gattgtctac ttcccctcct agactgtact ctccacaaga gagcaaagct 001040
001041 gtgtctgttg cattcactgt tgtcttccca gAAGCAAGCA GCATGCTGGA GACCCTCAAA TACATGTGAA ATGAAGGGAC 001120
001121 ACCTTGTGTT ATTTCACCTG TTTCCACCTG GTTCCCAAGT GGCAAGCTTG CATTGAGGTT ACAAAGTGGC AAAGCTGGGA 001200
001201 TATCACATGA TGTCACCATG ATGCTATATG AAATGTAACA TTAATTGAGT ACCTATGTGA CCAGTTCTTT AATTCTGTAA 001280
001281 GTGaactctc aaaaaaccct aggtaagtat tatcaactcg aactttaacg gatgaggaaa ctggaatttt ttaaaaaatg 001360
001361 acttaaccaa acatcacaga gctgtaaacc aggatttgaa TGGGTGAGCT GCACTCAGCC TTCATCTCAA GTCAGTTTGC 001440
001441 CTCTTTCTCC AAGAAGCCAC AGTTGATCCT GACCACATCA CATATGAGTT GCCTGGATTG CAGTCATTCA TTCACTAAGC 001520
001521 ATCTTTTAGG AGGTCACATG CCAGGCACCG TGTTAGGTAA TGATTTCTCC AAGTCACAGG AGCCGTGGTT CTGACTTTAG 001600
001601 AAAGTTTCTC ATCTGATGGG AGGGCAGGCA AGTAAATCAG TAAGGGTGGT CCCTGTCGGG ACACTGGTTG AGAAATACCC 001680
001681 AGGAACACTC TGATTGAGAA ATACCCAAAG TGCAAAGTGA CAGGGACTAG AGGCATCTCC ACTGGACCAG AGAGCTCTGG 001760
001761 ATTCGTCTTG AGCTAAGTCT CTGGAGGTAG GTGTCAGCTA CAAGGAGAAA AGCTGTATAT TTTGAGATAT GGGATGCAGG 001840
001841 GCCTTTTAGT TCCATAAATA AGAGAACTAT GGTGTGAATA CCACATGCCC TGGGCTGAAC TAagtgggct ctaggggaag 001920
001921 gtatactggg cagggagttt ggagaactgg tttctgtcta atccttctct aactctgtga ccttgaatgc atcatttccc 002000
002001 ctctgagtct cagattctct ttctgGCTGG CATTTAATGA AACCTAAATA TTCTAGACCT TCTACTTGTT CCATAAGTGC 002080
002081 TCAGTCCTAC ACTCACTCTC CTCTGGCACC AATATACTGA TTACTCATTT TATGCCCAGC AGAGCACTGC GTGTTTTGAA 002160
002161 ATATACAAAT AGTGGGAAAC GAAATCTTTG TTCTGGATGC AGTTATAATT TGGAAAGCGA GGAAATAAAT TATAACAAGG 002240
002241 GGAAAAAACA Gaatactact aaagtcatgc cgtgtgactt ctgaagctag ctcctaagaa gtccttgcag cttctgcctg 002320
002321 cgactcttag aacacttgct ttgaaggatg ccagatgctg tcttaaaagt ctgagcactc tgaaaccatc atgctgtgag 002400
002401 gaagcccaag ctaaccacat ggGCAAAAAA GATGCCCTCT GACTGCCACC ATTTGAGACA CTCTAATTAA GAACTGCCTC 002480
002481 ATTGAGCCCA TGAGGCCCTG TGAATGATAA TAATAAATTG TTT
[back to top]

Predicted Small Protein

Name NONHSAT128939_smProtein_122:400
Length 93
Molecular weight 10687.7318
Aromaticity 0.141304347826
Instability index 51.1923913043
Isoelectric point 10.0366821289
Runs 18
Runs residual 0.0604472943384
Runs probability 0.0520324245815
Amino acid sequence MGIMRCKTPFLSLFLLHKPFCFSSPDLLFLGFPSNFFAASLGPWLLGLLPLPTLHLWIPY
LWLHLIAPFSKDKSTRHRILQNMGLQKQHREV
Secondary structure LLLEEELLLLLLHHHHLLLLLLLLLLHHHLLLLLHHHHHHHLHHHHHLLLLLLLEEHHHH
HHHHHHLLLLLLLLHHHHHHHHLLLLHHHHLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHH
HHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTTTTT
TTTiiiiiiiiiiiiiiiiiiiiiiiiiiiii