NONHSAT114953

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT114953

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4299 nt

Genomic location

chr6+:132455118..132490514

Exon number

7

Exons

132455118..132455623,132456803..132457146,132467360..132468510,132475796..132475917,132481904..132482054,132485037..132486685,132490138..132490514

Genome context

Sequence
000001 CAGGAAGCCA GCATTTTTAA TAAATAGGAA GTAAATTATA TTTAATGACT AAATTCTCAC CTAAAGTTGA TACTAATATT 000080
000081 ACAATAACTC TCTTGTGCCC TATTTTGGTG TTTTAAACTT CTTTGCTGTA CATTTAAGAA AATCAGAATT TCAGTAGACT 000160
000161 CCTCGTGCAG TCTGAGGAGG CAGTTTCCTA TCTGGTTTGA CTTAAGGGGC TGCAGAAGAC ATTCTTATCC CACATATTTC 000240
000241 TTCTGCTGAT CAGTCAGCAA CTGAAATTCC AACTAATGGA TTCCATTCTT CCAGTTAAGA CATCATCTCC AGAAAACTGT 000320
000321 GCTAGTTGGC ATCCAGCAGT GCCCAAGTAT TATTTAACCC ACTAGAACCT GTGACTGCAT CAGCTTTATA TCCTGGACCT 000400
000401 CAGCCTGGAA CAAAGGAGAG CTACACTTGG ATACTAAACA CAAGTGGTCT GGATAGATTA TGCTGGGAAA AGAAGCCTGG 000480
000481 AGCAGGCAGT ACAGTGGGAG AACAAGCAAA GATGCAGGTG CGTCTATGAA AGCGAACTTT GTCGAGATCT CAGTGACACA 000560
000561 CTCTGCTTTT CCAAATTGCC TCAATTGGCA GTGAATATTA GAAAGAGTCA TGGAATGTCT CTGGAAAGAT ACATAGGAGA 000640
000641 ACTAGTGACA TTGGATCCTT GTGGGTAGGG AGACACTGTG TAAGTGCATC AAAGAGGCTG ACTTTTCACT GTGCACACTC 000720
000721 CTTCAGACCA CTGGAAATTC GGAATTGTAT ACGGTGTGCA TGTATTACCT CCTTAAAACT AGCTAGCTAC CTATCTACCT 000800
000801 ACCTACCTAC CCACCCACCT ACCTTCCTAC CTGGCTAATT CATTAGCTAA GAGAGAATGG AGTTGCAGCA GACTGCTCAG 000880
000881 GAATTGTACA TCACCAGTAG CTTCTGTCTC AGCTAAGTGT GGGAGACTGA TACTGATGAG CCATTAACAA TGGTGGAGAT 000960
000961 TCTGAATCAC TGAAGCACTT GATAAAAGAA ATGTCGTGGG TGATGTGAAA ATCTACATAT GCCCATGAGA GTATATTGAG 001040
001041 TTTGTAAGTG TTGTTGCTTA GTGTGTCACA GAGAAGTTTA TAAGAAACTA TTTTGGAGTA AGAAACAACA CAGAAGAACA 001120
001121 CAGTTTCAGC CTGAAATGGC TTATTTAGCC ACTGACCATC AGAAACAATT ACTTTTTACC TTAACCTAGA TTTGTAATAG 001200
001201 CAGGATGCAT TGAATTCCTT CATTGAAGTG AAACCTTAGC AAATCTTTAA GTAGTACTCT AAAGTGACAA AAGGTGTTTA 001280
001281 CATTTACTAA TGTAAATTTC TTGTGGAGAT AAAGTTTTTA TTCTTCTTGG GTCTCAAGAC TGCTCAAGAA GGAACCATCG 001360
001361 TTTGTTCTAC ACTGCTCTGT TCTAAATGTT TGTTCCCCCA ATATTCATAT GTTGAAATTC TAACCCCCAA GGAGATAGTA 001440
001441 TTAGGAGGTG GGGCCTTCGG GACATGATAA GGTCATGAGG ATGAAGCCCT CAAGAATAAG ATTCATTGCC TTATAAAGAA 001520
001521 AAACCAGAAA GATCCCTCAT TTCTTCAATC ATAAGAAGAC ACATGAGCAA GAAGACGGCC AGCTATGAAG CAGGTCGTGG 001600
001601 GCCCTCGCCA GACACTGAAT CTGCCAACAC TTTGATCTTG GATTTCCTAG CCTGCAGAAC TGTGAGAAAT AAATTTCTTT 001680
001681 TCTTTATAAT CCACTCAGTC TAAGGTACTT TGTTCTAGCA GCCTGAATGT ACTGCGACAT AACACTCCCT TGCTGTTTTA 001760
001761 TTGTTGAGAA AGGAGGAAGG GTAGGTCTTT TATTGTTGAA TAAGGTGGGG AGGAGAGGAC TTTTGGGGTG AGCACCGTGT 001840
001841 GCTTATTGCT GCTTGGAGTG GACTCTTTCT CACCCTATAT TTATGTGTTA AATGGAGTAG GATGTGATTG GGCATGTGGT 001920
001921 TTTCAGCTAC ATTAAAGTAG CTTCTTTGGG CTGTGTGAGA GACACTAAAT ACCATTTCAT AAACACTGAA AAAGAAGTCT 002000
002001 CACAGAAGTG CTTGCCTTGG TGATCCAAAA AAGGCAGGAT CCGGCCGGGG GCAAGGTGAT TATTCTGTCT GATGAGGAGA 002080
002081 ATAAGAGCCA GTTTGTGGTA TTGACAGAAG AATGAAAGTA AAGCAGTCTT GAAGATGGGA AGATCAGGAG CAGTCTGGAG 002160
002161 TCATGGTATA TGAATGAACT AGTGAGAGTG GGCACAAAGT CTGAAGATCT TTGTAGTGCA GCTAATGTCT AAGGGAGAAT 002240
002241 ATACCTACCC TAGAAGAGAC ATTAAACAAC CAGGAAGCTT ACTGCAGGAA AGAAATCCCT CAGAGTCCTG AAGATACAGC 002320
002321 TTTCAGAACT TAGGTGAAGC AAGAGGACTC AAGGTGTGCA AGAATGGACA GTACAGAATG CTCTGCTACA CTGCTCAGAT 002400
002401 TCCCCCTTCA GGAAGGAAGG ACAAATTCAC TCAACTTCTG GGAGCATTGA TGGTTGAGCT CTCAGCCATC AGACCTTTTT 002480
002481 GGAAGTTGCC CTCTTCTGAA AAGAGGCACA TTGCCCAAGA TCTTTCCTCC TCCAGAGCGC AGGCTGCACC TGAAGTCTGA 002560
002561 TTGATGTGGG GTATGATGGT CTGGTCTCCT TTCCCCATCT AAAGGACTGT TCTAACCTCA GGGCTCTCCT TGAGTTTCCT 002640
002641 TAGTTAGTTC TTATTACTAT GTTGCAGTCC AACTTCTTCT TTGGCCTAAT TCTACTTTCC TCCCATTCCC CAATTGTTGA 002720
002721 TCCTGACAGC AATTTCCCCC CAAATTTCTG TAAACTAAGC ATGATCTCAG AGTCACCTGA CCTGCAACAT TATTACATAC 002800
002801 ACTATAGTCC GTGCACTTAA AGATTTAGTG AAAGTATGCT GCTGAAGCTT GCCAGAACTT CCCTTCTTCC ACTGAAAATT 002880
002881 TGTCTGTTCA TAGATATACA CCATTCGTTC TCTTCTCTAC TTTGGCCAAT TATCTTATGT TTGGGAAGGT GTTCTGCTAG 002960
002961 AAGACAAAAT AGCAATTCTT AGAACTAGAA AGAAAAACAC ATCAAAGTCT GTCCTTTCTT ATGTCTTTTC TATCTACAAC 003040
003041 AAATACCCTG TGATAAAAAG GATGATAGGA AATATAAGTA GTTTGAGTCC ACCAAATATT TGGCTTGAAG TATTTGGGTA 003120
003121 TGAAATACAA ACTCACTTGA CCTGGGCTCA AATTTGCCAG CACTTCTTTT CTTTCTTCTC TTTTCAGAAA CAGACTTTAT 003200
003201 TTATATCAAT GTTAACATCA ATCACATAAC CATCAGGAAG GCTGTAACTT TGATTTTTTT TTGTCCAATA GTTGGGCCAA 003280
003281 GTTTACATTT CTCTTATTTT ATTTATTTAT TATTATTATT TTTTTGAGAT GGAGTCTTAC TCTGTCACCC AGGCTGGAGT 003360
003361 GCAATGGCGT CATCTCAGCT CACTGCAACC TCCGCCTCCT GGGTTCAAAC AATTCTCCTG CCTCAGCCTC CTGAGAAGCA 003440
003441 GGGATTACAA GTGTGTGCCA CCACGCCCGG CTAATATTTG TACTTTCAGT AGAGAACGGG GGTTTCACCA TGTTGCTCAG 003520
003521 GTGGTCCCGA ACTCCTGACC TTGCGATCCA CCCGCCTTGA ACTCCCAAAG TGCTGGGATT ACAGGTGTGA GCCACCGTGC 003600
003601 CTGGACCATT TCTTTTCTCA TATATCAGCC TATGCCAGGG GGGAGACTGG CTGAGGTTAA AGGCTAATGT CAAGTACTAA 003680
003681 CTGAACAACA AATGGATAAC CTCAACAATG CAATATCCTG GTTGAAATTG TTAGTGAATT TTTTTAGCAG ACCTGAGATC 003760
003761 TATGTATCTC CATTCAACAA GTAAAGGTAG ACTATTTCCT TAAATGCAAA TTCCTATAAA AAATATCAGA AGTATGTTAT 003840
003841 GTATTTAAAG TTAAAAAGCA TATAACTTTT TAAATGTTTA ATAAAATTCA CCAGTATACA CCATCATTTT CATTGAGAAA 003920
003921 AGTAGTTCAA CCAATCTACA AAGGCCAAAA ATATTTTCAC AGGCCCCCAA AAGATGAACT CCAATCTGCT GCCCCTAAAC 004000
004001 GCATTCTCCA CTGTCTGGAA GATGTGCAAA GGCAACGGCA CAGCGTTCTC CATCTCAAAT CCTTCAAAGT GCTGTCAGAA 004080
004081 TAAAAGAATC CCCACCAAAA GCCCCAGGGA AATCTATCTG TCTGTATGGT ATGGTAATGA CTGAGGTTAT TCCAGATAGG 004160
004161 ATTCTGGGAG AAACTCAGCT ATTGAAATCT AATGAATGGA ACTGAGAATT TCTTGTTATA ATACTGTTTG TTTGATTGAA 004240
004241 TATATCCTTG CATATAGGAA GACATAAAAA TAAATAATAA AAGCAAGATA TTATTTGAA
[back to top]

Predicted Small Protein

Name NONHSAT114953_smProtein_1562:1702
Length 47
Molecular weight 5133.8558
Aromaticity 0.108695652174
Instability index 53.8239130435
Isoelectric point 9.10028076172
Runs 7
Runs residual 0.00918737060041
Runs probability 0.036351477528
Amino acid sequence MSKKTASYEAGRGPSPDTESANTLILDFLACRTVRNKFLFFIIHSV
Secondary structure LLLLLEEELLLLLLLLLLHHHHHHHHHHHHHHHHLLLEEEEEEEEL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTo