NONHSAT033357

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT033357

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2635 nt

Genomic location

chr13-:41447704..41455429

Exon number

2

Exons

41447704..41449604,41454696..41455429

Genome context

Sequence
000001 TCAGCCTGAT CTAAGGAAGG AGACCACCTC TCTTATTGTC TCATACCTCA GAAAAAGAAT GAGGAAGTAA AAGTTAAAGA 000080
000081 AAGGCAAAAA TGAGATCAAT AGACAGCCCA GCACTGCACT TCAGGCCTGG TCTGGTAGTT AAAAATCAAC CCCTGACCTA 000160
000161 ACCACTTGTA TTAGCTATAG ATTCCAGACA TTGTATGAGG AAGCATTGTG AAACTTTCTG CTCTGTTCTG TTCTGTCTTG 000240
000241 ATTACCGATG CATGCAGCCC CAGCCACGTA CCCCATGTTT GCTCAATCGA TCACGACCTT TTCATGTGGA CCTCCTTGGA 000320
000321 ATTGTAAGCC CTTAAAAGGG GCAGGAATTT CTCTCTCAGG GAGCTTGGTT TTTGAGACAC AAGTCTGCTG ATGCTCCTGG 000400
000401 CCGAATAAAG CTACTTCCTT TCTCAACCTG GTGTCTGAGG GGTTTTGTCC ACAGCTTGTC CTGCTACATT TCTTGGTTCT 000480
000481 CTGACTGGGA AGCGAGGTGA TTAGCAGACA GTCAAGGCAG CCCCTTAGGT GGCTCAGGCC TGCCCTGTGG AGCATCCCTG 000560
000561 CAGGGGACTC CGGCCAGCTT GAGTGACACG GATCCTGAGA GCACTCCTGG GTAGGCATTT GCCCCGGTGG AACGCCTCAA 000640
000641 CAGAGGAGTG CATGGCAGGC CCCTGCGGAG GATCAACGCA GTGGCAGAAC ACCAGAAAGG AACTGGCACT TGGAGTCCGG 000720
000721 ACATCTGGAA TATGGTGATC AAGTGTGGAT CAAGGATTGG AATGTAGTCC CCTTGTGACC ACGGTGGAAA GACCCCAGAC 000800
000801 CGTCATCTTG ACCACCCCCA CAGCTGTAAA GGTTGAAGGA ATCCAAGCCT GGATCCACCA CAGCCATGTG AAACCTGCAG 000880
000881 CCTCTGAGAC CTGGAAGGTG AGACCAAGCC CAGACAACCC CTGCAAAGTG ACTCTGAGAA GGACGACAAG CCCTGCTGCA 000960
000961 GTCACACCCA GAAGCTGACT CATCTATGCA CAGCCAAAGC ATGAGGAAAC TCATCGTGGG ACTTATTCTC CTTAAAATTT 001040
001041 GGACTTATGT AATAAGGACT TCCACTGATT TTCCCCACAT GGAGGACTGT TCCCAATGTA TTCATCAGGT CACTGAGTAG 001120
001121 GGCAACAAGT TAAAACAGTT TCTGTTTTAT AGTTATTATT GTTGCGGGAC AATCAAAGAT GGGAGAGACC AAACAAAGTG 001200
001201 AGTTCAGGAA AGGTCTTTAT TAAAAGATGA TCACTTGGCT CAGTAGGATT AGTGTACAGG AAAGTCTGAG CCCTGGACAA 001280
001281 AGAAAGCAGC CACCTTTTAA GCAGTCAGTG GCTGGGAGCT ACGTGATGCA GGAAGCACAC TTAGAGAAGC AAGAACAAAG 001360
001361 GCAGTTGATC AGTCTTTTAC ATTTATCTAT ATTACATGTT CCAAATCCTT GGGAAACCAT GTTTCTGTAT AAACCTTGTA 001440
001441 ACTTTGCAGC TGCACGGGGG AGGTGAAGCA GAAACTCGCT GAGCCTCAAG GAATGTGAAA CTAGCAAGTA CAGATAAGGC 001520
001521 TCAAATGTAT TACATGAATC ATTATCAATC TGTCTTGCAG GAAGACCTAG GTAGTGAAGA TGAAAGTGAG AACTCCCACT 001600
001601 AATAAGTGAG ATTCTCAAAG GGGGGGAATG AGGAAGGAGA CCACCTCTCT TATTGTTTCA TACCTCAGAA AAAGAAGGAA 001680
001681 GAAATAAGTT AAAGAAGGCA AAAATGAGAT AAATAGTCAG GCAGCCTGGC CCCGCACCCC AGGCCTGGTC TGCTAATTAA 001760
001761 ATATCAACCC CTGACTAACT GCTTGTACTA TCTATAGATT CCAGACATTG TATGAGGAAG CATTGTGAAA CTTTCTGTTC 001840
001841 TGTTCTGTCT TAACTACCAA TGCATGGGGC CCCAGCCATG TACCCCATGC TTGCTCAATT GATCACAACC CTTTCACGTG 001920
001921 GACCCCCTTA GAACTGTAAG CCCTTAAAAG GGGCAGGAAT TTCCCTCTTG GGGAGCTCAG TTTTTGAGAT GCAGGTCTGC 002000
002001 TGACGTTCCT GGCCAAATAA AGCTACCTCT TCCCTCAACC CAGTGTCTGA GGGGTTTTGG CCATGGCTTA TCCTGCTACA 002080
002081 GATCCAACTC TGTGTTCCTT CCACCATTAT CCCATAACCT TAATATCCAT TCCCATGCCT GTTCTCCAGA TTGCTGTTTA 002160
002161 TATAAATCAG AGAACTCCAA CAGTTCAAGT ATAGTGCACT TCCTCATGGG TCACACTCTC AACCTCATCC TTAGGGGGCC 002240
002241 ACCAGGACTT TAGTTATAGG TTTAGAAGCA AACAGGTGTT GGGGGTGGCC TCTGAGGAGA CTCAACATTA TCTTGCCTGG 002320
002321 CAACTGGCTC AGGGGAGGAC ATCACTGTTG CTGCAGGCAG CATAGGGTTT TTCTCCTTGG ACAAAGGTGG AAAGACTGAT 002400
002401 AACAGCATAG GTCAGGGAGG GGATGTTGCC ACTACTGGGG ATGGGGAAGC TGTTTCTTCT GGCAAAAAAG GTTCACCGGA 002480
002481 GTTTACAAAC TCAGTGTCCT CAGCTTCATC AGGGTCCTCC CACATGTCCC CATTCCAAGT TGCCAGGTCG CATGCTTTTC 002560
002561 CAACAAATGC CCTCACTTTA ACAGTAGACA CCTGGTGAGG CTGTGCATGC ATCTTTCATT GCTGGTCAGC CACTC
[back to top]

Predicted Small Protein

Name NONHSAT033357_smProtein_1226:1465
Length 80
Molecular weight 9134.5118
Aromaticity 0.0886075949367
Instability index 49.3430379747
Isoelectric point 9.74029541016
Runs 11
Runs residual 0.00104040228889
Runs probability 0.0290022642963
Amino acid sequence MITWLSRISVQESLSPGQRKQPPFKQSVAGSYVMQEAHLEKQEQRQLISLLHLSILHVPN
PWETMFLYKPCNFAAARGR
Secondary structure LLEEEEEEEEELLLLLLLLLLLLLLLEELLHHHHHHHHHHHHHHHHHHHHHHHHHHLLLL
LLLLEEEELLLLHHHLLLL
PRMN -
PiMo -