NONHSAT103903

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT103903

Source

NONCODE4.0

Same with

,

Classification

sense

Length

2769 nt

Genomic location

chr5-:134670791..134681677

Exon number

2

Exons

134670791..134670831,134678950..134681677

Genome context

Sequence
000001 GCCCTTGGAA GTAGCTGGAG GTGAGGTGGG AAAGCCACCG TGCTTATCTG GTTGTGACAC ATGCAGTATT AGGAAAGCCA 000080
000081 TTTTGTGAGA GGGCTTAGAA TTCAGAGGTG TGTGGCAGCA CAGCCGCATC TTCAAGATGG AGGCCAAGGA CAGCCTGTAG 000160
000161 CTTTACATCT TAGAGTTTCA CCACATGCAG GTTGTCTGGG AAAATCACCT TTGCATTTTT TTTTATTACT TAAAATAGCT 000240
000241 CCTATATGTT TGATGTCTAG ATCACAAATT TTAAGAAATT ATAGGAGCTG AGTTTGGTAC ATCTAAGAGC TGAGAGTGAG 000320
000321 TGATACTGAG CCATCTGCAA AACTGCTGGT GAGAGATACA AATTAAAAAT CTTCAGCCAT CTGTGTGTGT AAGAGGAAAC 000400
000401 ATATAAATAC ACATTTCTCA TTTGCCAATT ACACCTTCCC TGGTATAAGG GGAAGAAAAC GACTTGGCAT CTGATGTGGG 000480
000481 GAGAATTATT GCTAAGACCT GGAGGCAGGG TACTTCTTAC TGCTTCAGGC CTGAATTTAG GTGGTCATTT Ggggtgtgca 000560
000561 ggacagagca cctcctaagg acctgagagt ccagtcagtg gggaggccca gctcccccac caactgtgtg aggacagatc 000640
000641 acccacttcc tcacttgggg ctctggtttt ctcttctTAC TACCCGATTC CAATGCCTAT TGTGTCTTTT AAATCCTCTG 000720
000721 TTTTAAAAAT GAATCTTAAG CTCTCTCGGA GGGTCCACCA GGCCGTGCAC CTCTATTCCC AGTGACAAAT GACACCCCTT 000800
000801 CCAGGCTAGT CTCAGGATCC AGTAGGATTC CCTTTGCTGA TCTGTGCCTT TGACCTCTGC TCTGCCACTT CTGTTTTACC 000880
000881 TGCAAGGGGC ATGACATACA GGGTAGTAAT TATGAAGTTC TCCCAGGCAG TTCCACCAAA GTCCCACATT GTACAAGGGA 000960
000961 AGGCATACAC AGTTTTAACT ACCTAGCCAG GGGCAGCTGG CCTTTCCCCA GGCCCTTTAA AACACCTGTT TGACCAGCCT 001040
001041 GTGGGTCCTT TCCCATCGGG GTTCCAGCTT GAGTCCTGCT GGTAGCCCTT GGCACAAGGC CTGGTAGGCA GCCTGGTGCT 001120
001121 GCCATGTACC AAACTCTGCT GGGGAAACCT CAGCCACAGC AGATGCAAAC CCATTAGGTA TAGCATGGTT CAAGATATTT 001200
001201 TTGCTTGTTT CAATAACACT TTTGTTCTTG ATTTTAAGGG CTATTTTAGA AACTGCTTTA AAATATTTTT gaaagtataa 001280
001281 agaggtgaat aaaacatact cctgttccac tgtgcaaaga tagtcactgt taatgtttct ttccagtcgt tttctatgca 001360
001361 taGTTTAGCC ATGACTTCTC ATGGGGACAA ACCCACTGGT TTAATTCAAC CCCCAATGTC AGGTTTTTAT AGGTCAGCCC 001440
001441 AGAGTTCTCA AGCCCAGTCA CAAAgcagag ccaggactca aattcacacc ctgtgactga gctgtttcca ctgtaccatg 001520
001521 agacactacc CACAAATCGC TCCTTCTCCA GTATCTATGA CACGACCATA ACAAACCATA CCCCTTGGAC AGATATTCCT 001600
001601 TAAAAAAAAA AAAAGTTTCA GAAATAGCCA ACCAAATACA AAAAGTTTGG ATGTTTTACT TTAGCAGTGG TGTGGAGACT 001680
001681 CTGGTGTGCT CTTGTATAGA AGCGTAGCAT GCAGCTAAAC CCAGTGCCCT GGAAACCTTC TTTGCAGAGC ATCTGTGGAC 001760
001761 AGGGGTTCTG GAACATGCTC TGGGAGACAT GGTACTTTGC AGTTTGTGGC AGACCCCAGG CCTGTGTCCT TAGATCTGGC 001840
001841 AAGGGAAAGG CAAGTCTTTG GTGGAGATGT ATAGCTAGAA AGGGGCCCCT CTTCTAGGAA AAGAAAAACA CAGTTCCTTA 001920
001921 AAAACTAATC TAATTCTGCC ACAAATAAAC AGGATCCGTT ATTCACACAG TGAGGCCATC CTCAGCAGGG AGAAGACAAT 002000
002001 GGCAGAGTGG AGATGCTGGT CAGGTTGCTT AGCAGATCAT TTAAAGCAGT TGCTTTCCTA GTAGCTTTGT CCTGAATTGG 002080
002081 GTTTGGGAGA TTTAGAGAAT ACATCATAGT AGAGACAAAT ACATCATATT GGTGTGCAAG TCGATAATTA GCACACCAGT 002160
002161 CGACTTGAAG AAGAAGGGTT GGAAACCCAA AACAATTATT TTGAGGCTGC AAATGTTCTG ACACGTTGAG TTTGCGTTCA 002240
002241 GTTGAGTTTT GATGGATAAG GCTCTGCCTC TGGGAGAACC GTGTATGCTT TCTGAGAGTG GAGAGAGGCT TACTACGAAC 002320
002321 TTGGTCAAAA TTCCTGGTGG TGACACATCA TAATGCCAAC TTTCCAAGGG TGTGGCTTGG CTCCAGCTCA CGTTGGAATC 002400
002401 ACAACGCAGC AGACAGCTGT GGAACTCCTC GGACACAGGG CATTGGAGGG CCCAGGAAGA TATACCGTGG CAGGGGAGTG 002480
002481 GGTCTGTGTT TGGCATCTTG TGCTTTGCAG TGAACCAGAG CTCACTGTTT CTGTTTCATC CCCGCACCAC TAGCTGCTGT 002560
002561 CAGCGCAGGC CATGGCCTGC CTGCCAAGTT TGTGATCCAC TGTAATAGTC CAGTTTGGGG TGCAGACAAG TGTGAAGAAC 002640
002641 TTCTGGAAAA GACAGTGAAA AACTGCTTGG CCCTGGCTGA TGATAAGAAG CTGAAATCCA TTGCATTTCC ATCCATCGGC 002720
002721 AGCGGCAGGA ACGGTTTTCC AAAGCAGACA GCAGCTCAGC TGATTCTGA
[back to top]

Predicted Small Protein

Name NONHSAT103903_smProtein_1355:1495
Length 47
Molecular weight 5027.5933
Aromaticity 0.0434782608696
Instability index 88.082826087
Isoelectric point 10.9032592773
Runs 8
Runs residual 0.0125517598344
Runs probability 0.0129173290938
Amino acid sequence MHSLAMTSHGDKPTGLIQPPMSGFYRSAQSSQAQSQSRARTQIHTL
Secondary structure LLEEEELLLLLLLLLLLLLLLLLEEEHHHLHHHHHHHHHHHHHLLL
PRMN -
PiMo -