NONHSAT114953
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT114953 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
4299 nt |
Genomic location |
chr6+:132455118..132490514 |
Exon number |
7 |
Exons |
132455118..132455623,132456803..132457146,132467360..132468510,132475796..132475917,132481904..132482054,132485037..132486685,132490138..132490514 |
Genome context |
|
Sequence |
000001 CAGGAAGCCA GCATTTTTAA TAAATAGGAA GTAAATTATA TTTAATGACT AAATTCTCAC CTAAAGTTGA TACTAATATT 000080
000081 ACAATAACTC TCTTGTGCCC TATTTTGGTG TTTTAAACTT CTTTGCTGTA CATTTAAGAA AATCAGAATT TCAGTAGACT 000160 000161 CCTCGTGCAG TCTGAGGAGG CAGTTTCCTA TCTGGTTTGA CTTAAGGGGC TGCAGAAGAC ATTCTTATCC CACATATTTC 000240 000241 TTCTGCTGAT CAGTCAGCAA CTGAAATTCC AACTAATGGA TTCCATTCTT CCAGTTAAGA CATCATCTCC AGAAAACTGT 000320 000321 GCTAGTTGGC ATCCAGCAGT GCCCAAGTAT TATTTAACCC ACTAGAACCT GTGACTGCAT CAGCTTTATA TCCTGGACCT 000400 000401 CAGCCTGGAA CAAAGGAGAG CTACACTTGG ATACTAAACA CAAGTGGTCT GGATAGATTA TGCTGGGAAA AGAAGCCTGG 000480 000481 AGCAGGCAGT ACAGTGGGAG AACAAGCAAA GATGCAGGTG CGTCTATGAA AGCGAACTTT GTCGAGATCT CAGTGACACA 000560 000561 CTCTGCTTTT CCAAATTGCC TCAATTGGCA GTGAATATTA GAAAGAGTCA TGGAATGTCT CTGGAAAGAT ACATAGGAGA 000640 000641 ACTAGTGACA TTGGATCCTT GTGGGTAGGG AGACACTGTG TAAGTGCATC AAAGAGGCTG ACTTTTCACT GTGCACACTC 000720 000721 CTTCAGACCA CTGGAAATTC GGAATTGTAT ACGGTGTGCA TGTATTACCT CCTTAAAACT AGCTAGCTAC CTATCTACCT 000800 000801 ACCTACCTAC CCACCCACCT ACCTTCCTAC CTGGCTAATT CATTAGCTAA GAGAGAATGG AGTTGCAGCA GACTGCTCAG 000880 000881 GAATTGTACA TCACCAGTAG CTTCTGTCTC AGCTAAGTGT GGGAGACTGA TACTGATGAG CCATTAACAA TGGTGGAGAT 000960 000961 TCTGAATCAC TGAAGCACTT GATAAAAGAA ATGTCGTGGG TGATGTGAAA ATCTACATAT GCCCATGAGA GTATATTGAG 001040 001041 TTTGTAAGTG TTGTTGCTTA GTGTGTCACA GAGAAGTTTA TAAGAAACTA TTTTGGAGTA AGAAACAACA CAGAAGAACA 001120 001121 CAGTTTCAGC CTGAAATGGC TTATTTAGCC ACTGACCATC AGAAACAATT ACTTTTTACC TTAACCTAGA TTTGTAATAG 001200 001201 CAGGATGCAT TGAATTCCTT CATTGAAGTG AAACCTTAGC AAATCTTTAA GTAGTACTCT AAAGTGACAA AAGGTGTTTA 001280 001281 CATTTACTAA TGTAAATTTC TTGTGGAGAT AAAGTTTTTA TTCTTCTTGG GTCTCAAGAC TGCTCAAGAA GGAACCATCG 001360 001361 TTTGTTCTAC ACTGCTCTGT TCTAAATGTT TGTTCCCCCA ATATTCATAT GTTGAAATTC TAACCCCCAA GGAGATAGTA 001440 001441 TTAGGAGGTG GGGCCTTCGG GACATGATAA GGTCATGAGG ATGAAGCCCT CAAGAATAAG ATTCATTGCC TTATAAAGAA 001520 001521 AAACCAGAAA GATCCCTCAT TTCTTCAATC ATAAGAAGAC ACATGAGCAA GAAGACGGCC AGCTATGAAG CAGGTCGTGG 001600 001601 GCCCTCGCCA GACACTGAAT CTGCCAACAC TTTGATCTTG GATTTCCTAG CCTGCAGAAC TGTGAGAAAT AAATTTCTTT 001680 001681 TCTTTATAAT CCACTCAGTC TAAGGTACTT TGTTCTAGCA GCCTGAATGT ACTGCGACAT AACACTCCCT TGCTGTTTTA 001760 001761 TTGTTGAGAA AGGAGGAAGG GTAGGTCTTT TATTGTTGAA TAAGGTGGGG AGGAGAGGAC TTTTGGGGTG AGCACCGTGT 001840 001841 GCTTATTGCT GCTTGGAGTG GACTCTTTCT CACCCTATAT TTATGTGTTA AATGGAGTAG GATGTGATTG GGCATGTGGT 001920 001921 TTTCAGCTAC ATTAAAGTAG CTTCTTTGGG CTGTGTGAGA GACACTAAAT ACCATTTCAT AAACACTGAA AAAGAAGTCT 002000 002001 CACAGAAGTG CTTGCCTTGG TGATCCAAAA AAGGCAGGAT CCGGCCGGGG GCAAGGTGAT TATTCTGTCT GATGAGGAGA 002080 002081 ATAAGAGCCA GTTTGTGGTA TTGACAGAAG AATGAAAGTA AAGCAGTCTT GAAGATGGGA AGATCAGGAG CAGTCTGGAG 002160 002161 TCATGGTATA TGAATGAACT AGTGAGAGTG GGCACAAAGT CTGAAGATCT TTGTAGTGCA GCTAATGTCT AAGGGAGAAT 002240 002241 ATACCTACCC TAGAAGAGAC ATTAAACAAC CAGGAAGCTT ACTGCAGGAA AGAAATCCCT CAGAGTCCTG AAGATACAGC 002320 002321 TTTCAGAACT TAGGTGAAGC AAGAGGACTC AAGGTGTGCA AGAATGGACA GTACAGAATG CTCTGCTACA CTGCTCAGAT 002400 002401 TCCCCCTTCA GGAAGGAAGG ACAAATTCAC TCAACTTCTG GGAGCATTGA TGGTTGAGCT CTCAGCCATC AGACCTTTTT 002480 002481 GGAAGTTGCC CTCTTCTGAA AAGAGGCACA TTGCCCAAGA TCTTTCCTCC TCCAGAGCGC AGGCTGCACC TGAAGTCTGA 002560 002561 TTGATGTGGG GTATGATGGT CTGGTCTCCT TTCCCCATCT AAAGGACTGT TCTAACCTCA GGGCTCTCCT TGAGTTTCCT 002640 002641 TAGTTAGTTC TTATTACTAT GTTGCAGTCC AACTTCTTCT TTGGCCTAAT TCTACTTTCC TCCCATTCCC CAATTGTTGA 002720 002721 TCCTGACAGC AATTTCCCCC CAAATTTCTG TAAACTAAGC ATGATCTCAG AGTCACCTGA CCTGCAACAT TATTACATAC 002800 002801 ACTATAGTCC GTGCACTTAA AGATTTAGTG AAAGTATGCT GCTGAAGCTT GCCAGAACTT CCCTTCTTCC ACTGAAAATT 002880 002881 TGTCTGTTCA TAGATATACA CCATTCGTTC TCTTCTCTAC TTTGGCCAAT TATCTTATGT TTGGGAAGGT GTTCTGCTAG 002960 002961 AAGACAAAAT AGCAATTCTT AGAACTAGAA AGAAAAACAC ATCAAAGTCT GTCCTTTCTT ATGTCTTTTC TATCTACAAC 003040 003041 AAATACCCTG TGATAAAAAG GATGATAGGA AATATAAGTA GTTTGAGTCC ACCAAATATT TGGCTTGAAG TATTTGGGTA 003120 003121 TGAAATACAA ACTCACTTGA CCTGGGCTCA AATTTGCCAG CACTTCTTTT CTTTCTTCTC TTTTCAGAAA CAGACTTTAT 003200 003201 TTATATCAAT GTTAACATCA ATCACATAAC CATCAGGAAG GCTGTAACTT TGATTTTTTT TTGTCCAATA GTTGGGCCAA 003280 003281 GTTTACATTT CTCTTATTTT ATTTATTTAT TATTATTATT TTTTTGAGAT GGAGTCTTAC TCTGTCACCC AGGCTGGAGT 003360 003361 GCAATGGCGT CATCTCAGCT CACTGCAACC TCCGCCTCCT GGGTTCAAAC AATTCTCCTG CCTCAGCCTC CTGAGAAGCA 003440 003441 GGGATTACAA GTGTGTGCCA CCACGCCCGG CTAATATTTG TACTTTCAGT AGAGAACGGG GGTTTCACCA TGTTGCTCAG 003520 003521 GTGGTCCCGA ACTCCTGACC TTGCGATCCA CCCGCCTTGA ACTCCCAAAG TGCTGGGATT ACAGGTGTGA GCCACCGTGC 003600 003601 CTGGACCATT TCTTTTCTCA TATATCAGCC TATGCCAGGG GGGAGACTGG CTGAGGTTAA AGGCTAATGT CAAGTACTAA 003680 003681 CTGAACAACA AATGGATAAC CTCAACAATG CAATATCCTG GTTGAAATTG TTAGTGAATT TTTTTAGCAG ACCTGAGATC 003760 003761 TATGTATCTC CATTCAACAA GTAAAGGTAG ACTATTTCCT TAAATGCAAA TTCCTATAAA AAATATCAGA AGTATGTTAT 003840 003841 GTATTTAAAG TTAAAAAGCA TATAACTTTT TAAATGTTTA ATAAAATTCA CCAGTATACA CCATCATTTT CATTGAGAAA 003920 003921 AGTAGTTCAA CCAATCTACA AAGGCCAAAA ATATTTTCAC AGGCCCCCAA AAGATGAACT CCAATCTGCT GCCCCTAAAC 004000 004001 GCATTCTCCA CTGTCTGGAA GATGTGCAAA GGCAACGGCA CAGCGTTCTC CATCTCAAAT CCTTCAAAGT GCTGTCAGAA 004080 004081 TAAAAGAATC CCCACCAAAA GCCCCAGGGA AATCTATCTG TCTGTATGGT ATGGTAATGA CTGAGGTTAT TCCAGATAGG 004160 004161 ATTCTGGGAG AAACTCAGCT ATTGAAATCT AATGAATGGA ACTGAGAATT TCTTGTTATA ATACTGTTTG TTTGATTGAA 004240 004241 TATATCCTTG CATATAGGAA GACATAAAAA TAAATAATAA AAGCAAGATA TTATTTGAA |
Predicted Small Protein
Name | NONHSAT114953_smProtein_1562:1702 |
Length | 47 |
Molecular weight | 5133.8558 |
Aromaticity | 0.108695652174 |
Instability index | 53.8239130435 |
Isoelectric point | 9.10028076172 |
Runs | 7 |
Runs residual | 0.00918737060041 |
Runs probability | 0.036351477528 |
Amino acid sequence | MSKKTASYEAGRGPSPDTESANTLILDFLACRTVRNKFLFFIIHSV |
Secondary structure | LLLLLEEELLLLLLLLLLHHHHHHHHHHHHHHHHLLLEEEEEEEEL |
PRMN | LLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHL |
PiMo | iiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTo |