NONHSAT100620
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100620 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3316 nt |
Genomic location |
chr5-:17130137..17217531 |
Exon number |
3 |
Exons |
17130137..17132009,17162578..17162674,17216186..17217531 |
Genome context |
|
Sequence |
000001 CTGCTCCCCT CCCCTCCTCC GGCGCAGACC CTCCCCTCTC CCCTCCAGCC TGGACACGCC CGCCTCCCCT TGACTCCCCC 000080
000081 CAGCTCTGGG CCCCCACCTC CCCTCCCCTC CAGCACAGTC ACCCCCATTT CTCTCCTATC CGCCATCCTG GTTCCTCCCT 000160 000161 TCCCCCCACC TCCCAACTCT GTGCCCCGCC AACGTTTCCT AAATGCCCTC TATTCAGATC CCCCCTCCGC CTCCCCTCTC 000240 000241 CTCTCCTCCA TTCCTGCGTC CCCCTTCCCC CGCCGCGCCG CCTGGGCTGT CCGTGGACTT CTCCCACTCT CTCACTCTCT 000320 000321 CACTCACTCT CTCTCTCTCT CTCTCTCTCT CACTCTCTCT CTCTCTCTCC CCCTCATTTA TTTGGAACCG TTGGATAAGA 000400 000401 AGTGCTCGGG CTCTCGCTCA GACTTAGGGA GCTGCCTCGA GGTGATGAAT GACACCCCCT GGCACCAGCT ACCCTTCTCA 000480 000481 GACCCCAGTC CAGCCCGCTC CCGACGTCGA CTACGATTCC GCTACCTCGG CTGGCAGCGA GGTTGGGGTG AGCCCCAGCT 000560 000561 GCAGGCGCGT CTGGGCTGCG CCGCTGCAAA CGAGTTGCGC ACCTTGGGCG GCTCCGCACC TGCACCCGCA CCCGCGGGGC 000640 000641 TCAGCCCCGA AGGCTGCAGC TTCGGGGGAG GCGCGGTCGC CGAGGTCCAG CTGGTGGGGC GAGAGACGTC GCCCCTCGGA 000720 000721 GGATGCTCTC GGAACTTGGG AGAGGAAGGA GGGAAGAGAA GAGGGGAAAG GGGCCGTCGA TGTTTTTGAT GTCTGTGCTT 000800 000801 TAATGGAGGC CACCAATATT GAGAAGACGG GGTTGGCCGA GGCAGCCCGC ACGCTGCTGC TTGCGAGCGC TCGAGTCAAA 000880 000881 GCTAGGGCCA ACCGCGGCTT GTCCGGGTGC CCTAAGGGGG CGGACACTTG GTTTAGCACC GGGACACAGA ATAGCCACCG 000960 000961 GGGTAGGAAG ATGCGTTCAC TTTGCTTACC TGTTGGCAAG AGGGACATAC AAAAATAACG TAACGTGACA TCGTTGACAA 001040 001041 CGGTAGCTCT TTGATTACAC AAAAGCCAAT TTTACCTTCC CGCAAAAGCC AGTTGACGCC TTTGGAACTT TTATTTGCGG 001120 001121 CATTTTGGCG CCCTCTGGCT GTGTTTGGAT CGCTTTCATG CTCGCCTGCG TCCCAGCCAA GAAAAAATCG ATGGAGCTGC 001200 001201 AGGTTGTCCT CAGGATGGTT CTGCCCTCAG GACCTGGGCG TGAATTCAGG GACAGGGTGG CCCTCCAGAA CCGGAGTGAC 001280 001281 AAACTGTAAC CATACTTAGG GAGGCAGACG TCAAAGGCAA GTACATCTGT ATTCAACTGG GTAAAGCCAT GTGAAGACGT 001360 001361 GCCTGCCTCC CTGTTACCTT CCACCATGAT TGTAAGCTTC CTGAGGCCTC CCCAGCCATG CCTGCTGTAT AGCCTGCAGA 001440 001441 ACAGTTTCAG TCGATTTGAT TATAAAGCAA AGGTTTGGGA AGAAAGACTC ATATTCCTCC TGAGATGCAA CCTCAAGTCT 001520 001521 TAGGAAAAGA AAGTTTTGAA CTCAGGTTCC GGGAATGTGG AAGAGGAAGC TCTTAAAGGG CAAGTAGACT TTGCAACATG 001600 001601 CTCGTTTTCA GGATTCTCCT CCTTTGTCCT CAACCCTGCC CAGTGTTCCC CTAACTCCAC CTAAGCCACT CCATAAAGTT 001680 001681 AGGTCCCATT CCTCTATTCC TTCACTACTA ATGACAGCTA ATAGAGCTAA ACAGAATACC ACAATAAGCC AGAAGCTGCC 001760 001761 TTTTTTTATT ATTATTATTA TTCATTGCCC AATTTGTAGA ACTCTCTGAA TTCAATTAAA GTTTGGCAAA GCCTGTGCAA 001840 001841 ATGAAACCAA GTACCTATTT TTTGCTCATC ATAAACCCAA AAGTTTTCTA GAGAACTAAC TGAAAGAGAT TTTCACCAAA 001920 001921 TCTTTTTATT TTTTTAATCT AGAGAATACA ATTGAGAACC AAATCAATAA ATATCCTCAA CTGTTACCTT TGTTATAAGG 002000 002001 GAACTACGAT GAACCGTGCT TGCCCCACAT TTACCCTAGC AGCAACTATG CTTTTTCTAT CTCTGGCCTT ACCCTGCCTT 002080 002081 CCTGCCTCCA GAGTCTGAGA TGGAGAAAGG CAAAGTCAGA TGGAGGATAG AGCTGGGCAG GGAGTTGCTG CCAGCAACAA 002160 002161 TTGGAGTTGC TGGTTTGCTT TCAACACTGA CCCCACTTTA TTGGCCATGA GTTAAGGCAG TCAAATGGGC ATCTAGGAAA 002240 002241 CATGACCAAG ATCTGCATTA GGGAAGCAAA GCAGATTAAA AGGCACAATT GCTGGCCAGG CATGGTGGCT CACACCTCTA 002320 002321 ATCCTAGTGA GAGGTGAAGC CAGTTGGACT TCCTGGGTCG AGTGGGGTGG GGTCTTGGAG AAATTTTCTA TCTAGCTAGA 002400 002401 GGATTATAAA TGCACCAATC AGCTCTGTGT CTAGCTAAAG GTTTGTAAAC GCACCAATCA GCACTCTGTA AAAACGCACC 002480 002481 AATCAATGCT CTGTGTCTAG CTGAAGGTTT GTAAATGCAC CAATCAGCAC TCCGTAAAAC GGACTGATCA GTGCTTTGTA 002560 002561 AAATGGACCA ATCAGCAGGA TGTAGGCAGG GCCAAATAAG GGAAAAAAAG CTGGCACCCA AGCCAACAGG GGCAACCTGC 002640 002641 TTGAGTCCCT TTCCACATTG TGGAAGCTTT GTTCCTTCAC TCTTCATGAT AAATCTTGCT GCTGCTCACT CTTTGGGTCC 002720 002721 GCACTACCTT TATGAGCTGT AACACTCACC ACGAGGGTCT GTGGCTTCAT TCCTTAAGTT AGCAAGACCA CGAACCCACA 002800 002801 GGGAGGAACA AACAACTCCG GGCACACCAC CTTTAAGAGC TATAACACTC ACTGCGAAGG TCTGTGGCTT CACTCCTGTA 002880 002881 GTCAGCAAAA CCACAAACCC ACCGGAAGGA AGAAACTCCA GACACATCTA AACATCTGAA GGAACAAACT CTGGACACAC 002960 002961 CATCTTTAAG AACTGTAACA CTCACCGTGA GGGTCCGTGG CTTCATTCTT GAAGTCAGGG AGACCAAGAA CCCACCGGAA 003040 003041 GGAACAAACT CCGGACACAC TAGCACTTAC AGGAGGCCCA GGTGGGAAGA TCACTTGAGG CCAGGAGTTT GAGACCAGCC 003120 003121 TGGGCAACGT AGTCAGACCC CATCTCTACA AAAAAATTTA AAAAATTAGC CGAGCATGGT GGCAAGTGCC AGTAATCCCA 003200 003201 GCTACTCTTG GAGGCTGAGG CAGGAAGAGC CCTTGAGCCC AGGAGCTGGA GGCTGCAGTG AACTATGATC AAACCACTGT 003280 003281 AGTCCAGCCT GAGTGACAGA GTGAGACCCT GCCTCT |
Predicted Small Protein
Name | NONHSAT100620_smProtein_1385:1522 |
Length | 46 |
Molecular weight | 5451.4353 |
Aromaticity | 0.155555555556 |
Instability index | 53.9133333333 |
Isoelectric point | 9.58856201172 |
Runs | 7 |
Runs residual | 0.00669191919192 |
Runs probability | 0.0283644989528 |
Amino acid sequence | MIVSFLRPPQPCLLYSLQNSFSRFDYKAKVWEERLIFLLRCNLKS |
Secondary structure | LEEEELLLLLHHHHHHHHLLLHHHHHHHHHHHHHHHHHHHHHLLL |
PRMN | - |
PiMo | - |