NONHSAT104558

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104558

Source

NONCODE4.0

Same with

,

Classification

sense

Length

3119 nt

Genomic location

chr5-:149823801..149829290

Exon number

3

Exons

149823801..149823916,149825172..149825248,149826365..149829290

Genome context

Sequence
000001 TGTGGAGTCT GGAGACGACG TGCAGGTAGG AGGCCCGGGC GCGACAATCG GGGGGCATCC TGCGGCGAGG GGACCCTGTG 000080
000081 GGGCTTGGGA CGAGAGACGG GGGTCTTTCC GTGGGAACCG AGCTAGGTGC CGGGCAAGAG ACGCGCGGCT GGCCCACCTG 000160
000161 GATCCTGGCC AACTCGGGAT TGAGTTCGTT CCTGGTCTCA GAAGGCCCGT TTTGCTTTCA GGGAGGAGCT TGTGAAGTAA 000240
000241 GGGTGAGTGC GGGTCCAGCC TTTTAAGGCC TCGGCCCCGC AATACGGCCA CGGCCACGGC CGCGTTTGAG CTGCACAGCG 000320
000321 TAGTTGAGGG AACCCGGGAC AGACGTGGGC TCCCGCCTCT ACCTCGCCAA ACTTTTTTCT TGGTGATCGC AGGCCCACGC 000400
000401 CTAATCTCGT TTgtttcctc gtttgcaaaa taggaataac aatagcaccg atcccatggg tttgtagtga tcattcaaag 000480
000481 aaggaaagca gggaaaactc tCGCACTATG TTGGATGCTT CTAAATCTGG GCGATTCTTT CCGTTGCTGA GTCGGGCACG 000560
000561 TTGCAAGTTC TGGGCGCTCA GGGCCCAGCA CCAATGTTTG CTGTGCGCTT GCCCTGCGTT TATATACTCC TGATGACCAC 000640
000641 TCTGCTCGTT ACTTTAGGGG ATGTTAGGAA CGGAGAAACT GCAAATTGGC ATTTACTGAA TGGCCATCAT GCCGAAACAT 000720
000721 ACTCATGCTT ACGTATTTGA GATATGCTGG AGCCTTTCTA TGCTTCTCGA GGCAGAACTT TGGGCTTCTC TCCTGTGGCG 000800
000801 CGTTCCTTAC AATAGTTAAC GCACTGGGTC GTGCTCATTG GTCTGATTTG AAGATAGGAA CATTTAACTT CGTACACCCA 000880
000881 AGACTTACAC TTGAAGTACT TACTGTGGTC ACACACTTAA CTAAAGTTTA TATAGGGAAG GCAGAGGAGA GATCTAGGAA 000960
000961 CACCTGAGAC AGACTAGGGT AGTTATTTGT GGTGTAgagg catcttggca ggaggcagag agtgctgggc tgggagtcag 001040
001041 gacacgttgt ttctatgttt ggttttgcca ctgtctttac tgtgaccata ggctgagaca ctCTCTATCT TCACCATCTG 001120
001121 CAGGATATCG ATTAGTGTTA GTGTCTTCTG AAAATGTTAA ATTGTTCTGA ACACCAGGAG GTTCAGAAGG CCTGGGGCTT 001200
001201 TAGGCCGGAA GCAGTCTTAT CCTAGCCTTC CACTTTCTCA TTCATTCTCA CCCGCCTTCC TCCTGTCAGT TTCTCTCTCC 001280
001281 CTTAGATCTT GTGACAGTAT GATTGCAATT ATTATGAAGG TCATAATAGG TTAGAGTTAT TTAGTCTCAG GACCCATCAG 001360
001361 ATTTCTGAGA TGGGTCTTTT CCATTAGCCT GCCTTTTGGA TATATTTTAG TGCCTTTGAT ATTTGATGTT GGTGACAAAG 001440
001441 ATTTAGTTCT AAATAGTTGT GGGAGTCAAA TCAGTTGCAT TTTTGAACAT TTTGAAGGGA TCAGGTGGAG CACAGGAAAA 001520
001521 CAGGAGATGA GAGGCTTGAG GGCTGTGTTG GTAAGGGTCC CTTGAATTCT AAGCTGAGGA GTTCATGCAG GAATCAAGCA 001600
001601 GCTCCCTCGC CCCATGAAGG GTGTTAGGAA TGGTACCAGT ACATGGGTAG CTGTTGGTCT TGGGTTTCTT CTTGCCTTTT 001680
001681 GGGGTGCAGC CTTCAGGCAT CTGCTGCTAG ATCCCTAGGC TTCTATCTTG TAGGTGCTGT GAGCTGAGCT GTTGGTAGTT 001760
001761 TCTGGGGAAA GGATACTTCT CATGGTACTT TGCCATGTGG CCACAGTTGA TACCCCCCCA AGTGCAAGGA GAAAGAAGTT 001840
001841 TTAGTGAGGC AGAAATGAGA GAATACAGTA GAGTATTGGA TGGAAGAAGA CAGGGACATG AAGCAGGACT AGGGTTTCAT 001920
001921 TTAATAGCCA GGGGAAAGGG ATTCTGGAAA TGTTTTGTCT GTTAACCTTG AGTTTCTTTT CCTTCCACTC AGAAATGGCA 002000
002001 CCTCGAAAGG GGAAGGAAAA GAAGGAAGAA CAGGTCATCA GCCTCGGACC TCAGGTGGCT GAAGGAGAGA ATGTATTTGG 002080
002081 TGTCTGCCAT ATCTTTGCAT CCTTCAATGA CACTTTTGTC CATGTCACTG ATCTTTCTGG CAAGTGAGTA CCTGGGTGGA 002160
002161 GAGGCATCCA GCTGGCAAAA GGCTGAGGAA GGCAATGGCT GGGACGGGCT AGCAGTTCAG GGGATTCTCT CTAAAGAAAT 002240
002241 GCTGTTTTGT CCAGGTAAGA AAATGTGCTT GTCCATTTAG CCCACAAATA TTGTGATTTC CCAGGGGTTA CAAGAGAGGA 002320
002321 GACACATTCT TCGTCCTTAC AGCGTGATGG TCATTGAATC CTTGGTTTCT TGGGAATTAT CTTCTTTCCC TAGTTCCCTT 002400
002401 TGCAGGAGCA GCAGtggact gcaagtgggc tctgggtgcg ggttcgacgg ccatttagca agttgtgact tgtgtgaaat 002480
002481 cactcagatt ctgagttttg gtttcctcat atgtaaaatt aggacagtaa tgtctacctt gtggagtaat ggtaaggatt 002560
002561 aaatgcaaaa gtagatatat aaaatgttaa tacTGAGTAT AGCTATATTG GGCCATTCAA CCCCACTGGA CTGAACTCCA 002640
002641 TGAGGGCAGA GCCCATGACC ATCATCCTCC ACCATGACTG AGCTGTGTTG CTCTTCACTC ATTAGGATAT CGTAAGCACT 002720
002721 CCTTGTTGGA CGAGGAAGAA ATGACCCTGT GCTTTTGTCC CCAGGGAAAC CATCTGCCGT GTGACTGGTG GGATGAAGGT 002800
002801 AAAGGCAGAC CGAGATGAAT CCTCACCATA TGCTGCTATG TTGGCTGCCC AGGATGTGGC CCAGAGGTGC AAGGAGCTGG 002880
002881 GTATCACCGC CCTACACATC AAACTCCGGG CCACAGGAGG AAATAGGACC AAGACCCCTG GACCTGGGGC CCAGTCGGCC 002960
002961 CTCAGAGCCC TTGCCCGCTC GGGTATGAAG ATCGGGCGGA TTGAGGATGT CACCCCCATC CCCTCTGACA GCACTCGCAG 003040
003041 GAAGGGGGGT CGCCGTGGTC GCCGTCTGTG AACAAGATTC CTCAAAATAT TTTCTGTTAA TAAATTGCCT TCATGTAAA
[back to top]

Predicted Small Protein

Name NONHSAT104558_smProtein_2792:3070
Length 93
Molecular weight 9897.3433
Aromaticity 0.0108695652174
Instability index 61.9347826087
Isoelectric point 11.3439331055
Runs 11
Runs residual 0.0156396621833
Runs probability 0.0243211125564
Amino acid sequence MKVKADRDESSPYAAMLAAQDVAQRCKELGITALHIKLRATGGNRTKTPGPGAQSALRAL
ARSGMKIGRIEDVTPIPSDSTRRKGGRRGRRL
Secondary structure LLEEELLLLLLHHHHHHHHHHHHHHHHHLLLEEEEEEEELLLLLLLLLLLLHHHHHHHHH
HHHLLLLEEEELLLLLLLLLLLLLLLLLLLLL
PRMN -
PiMo -