NONHSAT093312

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT093312

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5203 nt

Genomic location

chr3-:176004974..176042942

Exon number

7

Exons

176004974..176005817,176006882..176007862,176018318..176018527,176031566..176031819,176032065..176032721,176036216..176037373,176041844..176042942

Genome context

Sequence
000001 CACAAGAACT TCCAAACGCC TGAACCGCAG CGGCCAGGCG TTCCTCCAGA ACCTCCTCCC CCAGGAGCTT GCTGCAAGTG 000080
000081 CCAGAAATCT GACCACCAGG CCAAGGAATG CCTTCAGCCC AGGATTCCTC CTAAGCCGTG TCCCATCTGT GTAGGACTCC 000160
000161 ACTGGAAATC GGACTGTTCA ACTCACCTGG CAGCCACTCC CAGAGCCCCT GGAACTCTGG CCCAAGGCTC TCTGACTGAC 000240
000241 TCCTTCTTGG CTTAGCGGCT GAAGACTGAT GCTGCCTGAT CGCCTCGGAA GCCCCGTAGA CCATCACGGA TGCCGAGCTT 000320
000321 TAGGTAACTC TCACAGCGGA AGGTAAGTCC GTCCCCTTCT TAATCAATAC AGAGGCTACC CACTCCACAT TACCTTCTTT 000400
000401 TCAAGGGCCT GTTTCCCTTG CCTCCATAAC TGTTGTGCAT TTTGACAGCC AGGCTTCTAA ACCTCTTAAA ACTCCCCAAC 000480
000481 TCTGGTGCCA ACTTAGACAA TATTCTTTTA TGCACTCTTT TTTAGTTATC CCCACCTGCC CAGTTCCCTT ATTAGGCTGA 000560
000561 GATATTTTAA CCAAATTATC TGCTTCCCTG ACTATTCCTG GACTATAGCT GCATCTCATT GCTGCCCTTC TTCCCAATCC 000640
000641 AAAGCCTCCT TTGCATCCTC CTTTTGTATC CCCCCACCTT AACCCACAAG TATAAGACAC CTCTACTCCC TCCTTGGCGA 000720
000721 CCGATCACGC ACCTCTTACC ATCTCATTAA AACCTAATCA CCCTCACCCT GCTCAAAGCC AAGATCCCAT CCCGCAGCAC 000800
000801 GCTTTAAAAA GATTAAAGCC TGTTATCACT CGCCTGCTAC AGCATGGCCT TTTAAAGCCT ATAGACTGTC CTTACAATTC 000880
000881 CCCCATTTTA CCTGTCCTAA AACCAGACAA GCCTTACAAG TTAGTTCAAG ATCTGCGCCT TATCAACGAA ATTGTTTCGC 000960
000961 CCATCCACCC CGTGGTGCCA AACCCATATA CTCTCCTATC CTCGATACCT GCCTCTACAA CCCATTATTC TCAAACATGC 001040
001041 TTTCTTTACT ATTCCTTTGC ACCCTTAATC CCAGCCTCTC TTGGCTTTCA CTTGGACTGT TCTCCTCCTT TTCTTTTCTT 001120
001121 TTCTCCTCCC TCCCCTCCTC CTCCTCCTCC TTCTTCTTCT TCCTCTTCTT CTTCTTCTTT TAAATAGGGT CTCACTCTGT 001200
001201 CATCCAGACT GTAATGCAAC TTTTAGTAGT TCCCTACTTG CTGAGGTTTT ACCTCAGTCT TGGCCTTTGA TGAGTGACCA 001280
001281 AAATCTTTGC TTAGGGCACT GAGAGAAGGA ATAGTTTTAT TCCCATGAGG TTGGCCATTT AAATTCATAC TATATTTGGC 001360
001361 TTAGATTGAT TCTCAGTTAT TGTCATAATA AGTTTCTTGT TTATATGATG CACAGCGAAA TGAAAATGGT CAGTTGTTAC 001440
001441 TGTGGAATCA GATAGCTACT GAAAATTCAG GCATGCCAAT AGGTTCCTTT CTTGGACCTT CTCTTTTTCT CAATTGATAA 001520
001521 ATCTCAGAGT GTCTTTTCAG TGCCTCCACA TTTTCAGCTC GAACACAAAT AAGTTTCCAG TATAATGTCC TATTGCGCTT 001600
001601 CATCCCAACT TGCTTCCTCT TCTTCTTATA ATTATGCTTT CTCCTATCTC CATGATTATT GATGACATCT CACTCTCCTG 001680
001681 GATTATTTTC TTAACTGTTC TAACCAAATC TTGTAAGACT GAGATCAATT TTCACCTCTT TCCTAAAGCC ATGCATGTCC 001760
001761 TTCCCCAAAC TTCAGCCTGA AATAGTATTT CCAGTTTATG AATAGTACTT CTAGCATTTA TTCAGGAGTG AATTATGTCT 001840
001841 ATATCTTATA TTATTTAAAA TATCTTAAAA ACTTTTACAC CTACTTACAC TTCTAGTTTA ACACTTATCT AACTTTATTA 001920
001921 GAGGGTGCAG CTTGAAGGCA TACTTACTGT TATGGGCTGA ATCGTGTTAC CCAAAACTCA TATTGAGGTC CTAACTCCTA 002000
002001 GTATCTCCGA ATGTGACTGT ATTTGAAGAT AAGCGTTGTC AAGAGGTAAT TAAGTTAAAA CAGTGTCGTT ACATTGGGCC 002080
002081 TTAACCAATA TGACTAGTGT CTTTATGAGA AGGGGAAATT AGAAGGTAGA CACACGCAGA GAAAAGACCA TGTGAAGACA 002160
002161 CGGGAGAAGA CCGCCATGCA CTACCCAAGG AAAGAGACCT CAGAAGATAC CAACCTTGAT GATCTTGAAC TTCTAGATCA 002240
002241 CACTCTGCAG CAAGAAGGTC CTGAGCCTCT CATTGTGATC TCAACACTTT TATTACAGTG ATGAGCTCAA GATTGGGCAC 002320
002321 CTGTCTTCAG CTGTTTTTAT CTGGATGAGG TCCTATTTTT CTTTAAATGA ATGGATGATG AAAGCTATTT CTTTTCCCAT 002400
002401 GTAGTATAAG AACAAGAGAG CATGCAGGAG CCTCTGGCTG CCTTTTAGGG CCAAGAGTTT ACAGCTGACC TAAGAATCAT 002480
002481 AAGCAAAACC AAGAAAGAAA AGTTGAGAAG GATAAACCAG GGTGTGATCA CATTGCTTTA ACAAAAATAA AGCCACAATG 002560
002561 AAAACGGGTT TATCTTTAGC CTCTTCAGTA ACATAAACCA ATATATTCCC CTACCAGAGA GTCCCATTCT TTGATATCCT 002640
002641 AGCTTCTGTC AGGTGGTCTC TACAGAGGTG TGATTCTGAA TCCTGCTATA CAGTTATAGT AACTACCTCC TCTTGCATCC 002720
002721 TTCCCATTCC AGATGGTTGT GGCTGTTTTG TGTTGTTTCT AATCCCCAGA ATATCTCCCT GTGTTCTTTT TAACTTCTCA 002800
002801 GCTCTTCTTT CACATATGGT GCAGATCCAT ATATTAAATT CTCCTTGCTT GAAATACCTA GAGTCGTTTC TGTTTTCCTA 002880
002881 ACTGTACTAT GACTGCTTTA GCACTCTTAA AATGAAGAGA AAGAGAGAGA CACTCAGAGT GTACATGCAC AGAGGAAAGA 002960
002961 CAATGTGAAG ACACAGTGAG AAGGTGGCCG TCTGCAAGTC TAGGGGCGAG GCCTCAGGAG AAACCAAACC TGACAACACT 003040
003041 TGGATATTGG ATTTCCAGCC TCTGGAACTA TGAAAAATAA AGATCTGTTT TTCAAGCCTT CATTTTTTAT GGCAGCCCTA 003120
003121 GAAGACTATT ACAGATGTCA TTTAACTATG AGTCTATGAT GATGTCTAAT TTGAAATTCA TGAGCTTTGC AACATCTTCA 003200
003201 CATCAGGACA AATAACCCCC CTGCTGTTGA AATCACTTGC ATAATTTATA ACGTTCAAAA TTGGAATTGC ATCATTGCCT 003280
003281 CCTTTGCAAT TACCCTGTAG TCAAGCCTTT CACCTTACTG AAAGGTCTTT AATTTTTCTT GATTTTTAAG CATTTCTCAA 003360
003361 TTTCTAATGG GCTCATTTTG ACGCAAAAGT CAAATGCAAA TAATTTGTTG ATAATTTAGT GGCAATTCAC TATTTGCTAT 003440
003441 GATCATTTCA AACACATTAG AAACACATTG CCATATTTTA TATTCTTTAG ACCTCCTATC TTTAAAATAA AAGTGTGTGT 003520
003521 GTGTGTGTAA TTTTATATAT GTGTGATTTT TTTTTTCAGG CTAAAATGAA ACAATAACTT GTCTGGCTGA TTAGATTTCC 003600
003601 TTGGAGGTGA ACTTTCACAA ATAGCCAAAA ATGTTTGCAA GTAAAGAGTA TAACTGGGCT TTTAAATAAC TCCTCTAAAA 003680
003681 TCAAAATGAA GAACAAAAGC TTCCTTCCTT TCAGAAAGTA ATCTGCAACC ACTCTCATGC ATTTGTGTGC TTTTTAAAAA 003760
003761 AAATATTGAG GATGGAAGGG AAGCATGGTA CATGTAAGTT CAAATCTTAC CTGTCACATG CCTAATCTAT GAATCCAAAT 003840
003841 GGATTTTTTC ATTAGTGGTT TTTAGACTAA AGTAACGTGA ATTAATTTAA CAAAGTAAAT GATGACATTG GAAACTAATA 003920
003921 AAGAACATAG ATACAGAACT TTTTCCTTTA TTTTGCATGC CTCACTTACT ATTTTAAATG AGTAAACCTC CCTCATGGCA 004000
004001 TCCATGAGAT GAGAGACAAT TCAAGGCATT TAAGGCAATC TTAGAGAAAC TTTCTTCACT ACTTTCTACC TTTTATTTTC 004080
004081 TCAATTCTTT CAAAAACGAG TCAACAGATT TCTGACATTT TAAACTTATT GTATTGAAGA TACATAAAAC GCCTGCTCCC 004160
004161 TTTACTCTCC TCTGCATGTA TGAAAATGAA AATCAGGCTA TTTCCATAAC CAACAAACTT TATTTCTTAT GAAATATTTC 004240
004241 AGATACAAAT GGTAGTAAGG CTCCTATTAA AACAAAAAAA TTTTACCCTC ACCACTAAAA AAAGAAATCA ACATTACCAA 004320
004321 GGTTGATCTT GAGTATCTCT CCTCTTCTCT CCACCAGAGG CATAGAATTA CACAATTTTA ATGACTTGAG CCCAAAAGAC 004400
004401 AGTAACAAAA CAGAAGAATT GGCAACTCCA TAGAGACTGC ATTTCATAAC TGGAATTCGT CACCTATGGA AGCAGACAAC 004480
004481 AGAGGATAAT TAAATGAATG ATCACATAAA ATGAACTGAA GTCACTGAAC ACTTAAACAG CTATCGCATG TTAATCATAA 004560
004561 TTTGTATCAG CTCTCATTCT TTGATCTTGG GATGACTTTC TTCTGCATAA TAATGAGGTT ATATTACTTG TGTTGACATC 004640
004641 ATTTAAAGAA GTGAATACAG TAGACTGCAA CTGGATTTGA ATCTAGATTT TTTTCTTCAA CACAGTCATC TAGCTTTAAG 004720
004721 AAATCATTTA AAAGGTTGAA TGTGAAACAG CTCTGACCAT TGTAAGAGTA AACATAAAAT GTACAAAGAA TTATTTTAAT 004800
004801 GGTTTGATTA TCTGTACTTT AGAGATAGTT ATTATATCCC ATCTTTGATT TGTGCATGTT CACTTAACTC CCTTTAAATG 004880
004881 TCTCGACCAC AGCTCATGTT ACTTTACTAT CATTTTTGGG TAATATTTTT ACTTCTCTGG AGTTTTAACC ATTTTAATGA 004960
004961 CAAAAAGTTC ACATATGTAT GAAAGCAATT AAATATATTA GATTTGTAAG AACTTACAAA TATGCAGTAC TTACATATTA 005040
005041 CAAATAATGC AGTAATTTGT TATGACATTT TGAGAGAAAT ACACTGTCAT CATGCTTAAT TTGATAAGGA AAGTGAAAGT 005120
005121 TGTCTTATAT CATCACAATT AAATAATGTT GCTTAAAGTT GTAAATATGT GACATTGGAA AAGTAACTGA TGAAAGGTAT 005200
005201 AAA
[back to top]

Predicted Small Protein

Name NONHSAT093312_smProtein_4877:4981
Length 35
Molecular weight 4399.1853
Aromaticity 0.294117647059
Instability index 42.7529411765
Isoelectric point 9.39910888672
Runs 5
Runs residual 0.0390915860015
Runs probability 0.0233527645293
Amino acid sequence MSRPQLMLLYYHFWVIFLLLWSFNHFNDKKFTYV
Secondary structure LLLHHHHHHHHHHHHHHHHHHHHHHLLLLLEEEL
PRMN LLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLL
PiMo ooooooTTTTTTTTTTTTTTTTTTiiiiiiiiii