NONHSAT104558
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT104558 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
sense |
Length |
3119 nt |
Genomic location |
chr5-:149823801..149829290 |
Exon number |
3 |
Exons |
149823801..149823916,149825172..149825248,149826365..149829290 |
Genome context |
|
Sequence |
000001 TGTGGAGTCT GGAGACGACG TGCAGGTAGG AGGCCCGGGC GCGACAATCG GGGGGCATCC TGCGGCGAGG GGACCCTGTG 000080
000081 GGGCTTGGGA CGAGAGACGG GGGTCTTTCC GTGGGAACCG AGCTAGGTGC CGGGCAAGAG ACGCGCGGCT GGCCCACCTG 000160 000161 GATCCTGGCC AACTCGGGAT TGAGTTCGTT CCTGGTCTCA GAAGGCCCGT TTTGCTTTCA GGGAGGAGCT TGTGAAGTAA 000240 000241 GGGTGAGTGC GGGTCCAGCC TTTTAAGGCC TCGGCCCCGC AATACGGCCA CGGCCACGGC CGCGTTTGAG CTGCACAGCG 000320 000321 TAGTTGAGGG AACCCGGGAC AGACGTGGGC TCCCGCCTCT ACCTCGCCAA ACTTTTTTCT TGGTGATCGC AGGCCCACGC 000400 000401 CTAATCTCGT TTgtttcctc gtttgcaaaa taggaataac aatagcaccg atcccatggg tttgtagtga tcattcaaag 000480 000481 aaggaaagca gggaaaactc tCGCACTATG TTGGATGCTT CTAAATCTGG GCGATTCTTT CCGTTGCTGA GTCGGGCACG 000560 000561 TTGCAAGTTC TGGGCGCTCA GGGCCCAGCA CCAATGTTTG CTGTGCGCTT GCCCTGCGTT TATATACTCC TGATGACCAC 000640 000641 TCTGCTCGTT ACTTTAGGGG ATGTTAGGAA CGGAGAAACT GCAAATTGGC ATTTACTGAA TGGCCATCAT GCCGAAACAT 000720 000721 ACTCATGCTT ACGTATTTGA GATATGCTGG AGCCTTTCTA TGCTTCTCGA GGCAGAACTT TGGGCTTCTC TCCTGTGGCG 000800 000801 CGTTCCTTAC AATAGTTAAC GCACTGGGTC GTGCTCATTG GTCTGATTTG AAGATAGGAA CATTTAACTT CGTACACCCA 000880 000881 AGACTTACAC TTGAAGTACT TACTGTGGTC ACACACTTAA CTAAAGTTTA TATAGGGAAG GCAGAGGAGA GATCTAGGAA 000960 000961 CACCTGAGAC AGACTAGGGT AGTTATTTGT GGTGTAgagg catcttggca ggaggcagag agtgctgggc tgggagtcag 001040 001041 gacacgttgt ttctatgttt ggttttgcca ctgtctttac tgtgaccata ggctgagaca ctCTCTATCT TCACCATCTG 001120 001121 CAGGATATCG ATTAGTGTTA GTGTCTTCTG AAAATGTTAA ATTGTTCTGA ACACCAGGAG GTTCAGAAGG CCTGGGGCTT 001200 001201 TAGGCCGGAA GCAGTCTTAT CCTAGCCTTC CACTTTCTCA TTCATTCTCA CCCGCCTTCC TCCTGTCAGT TTCTCTCTCC 001280 001281 CTTAGATCTT GTGACAGTAT GATTGCAATT ATTATGAAGG TCATAATAGG TTAGAGTTAT TTAGTCTCAG GACCCATCAG 001360 001361 ATTTCTGAGA TGGGTCTTTT CCATTAGCCT GCCTTTTGGA TATATTTTAG TGCCTTTGAT ATTTGATGTT GGTGACAAAG 001440 001441 ATTTAGTTCT AAATAGTTGT GGGAGTCAAA TCAGTTGCAT TTTTGAACAT TTTGAAGGGA TCAGGTGGAG CACAGGAAAA 001520 001521 CAGGAGATGA GAGGCTTGAG GGCTGTGTTG GTAAGGGTCC CTTGAATTCT AAGCTGAGGA GTTCATGCAG GAATCAAGCA 001600 001601 GCTCCCTCGC CCCATGAAGG GTGTTAGGAA TGGTACCAGT ACATGGGTAG CTGTTGGTCT TGGGTTTCTT CTTGCCTTTT 001680 001681 GGGGTGCAGC CTTCAGGCAT CTGCTGCTAG ATCCCTAGGC TTCTATCTTG TAGGTGCTGT GAGCTGAGCT GTTGGTAGTT 001760 001761 TCTGGGGAAA GGATACTTCT CATGGTACTT TGCCATGTGG CCACAGTTGA TACCCCCCCA AGTGCAAGGA GAAAGAAGTT 001840 001841 TTAGTGAGGC AGAAATGAGA GAATACAGTA GAGTATTGGA TGGAAGAAGA CAGGGACATG AAGCAGGACT AGGGTTTCAT 001920 001921 TTAATAGCCA GGGGAAAGGG ATTCTGGAAA TGTTTTGTCT GTTAACCTTG AGTTTCTTTT CCTTCCACTC AGAAATGGCA 002000 002001 CCTCGAAAGG GGAAGGAAAA GAAGGAAGAA CAGGTCATCA GCCTCGGACC TCAGGTGGCT GAAGGAGAGA ATGTATTTGG 002080 002081 TGTCTGCCAT ATCTTTGCAT CCTTCAATGA CACTTTTGTC CATGTCACTG ATCTTTCTGG CAAGTGAGTA CCTGGGTGGA 002160 002161 GAGGCATCCA GCTGGCAAAA GGCTGAGGAA GGCAATGGCT GGGACGGGCT AGCAGTTCAG GGGATTCTCT CTAAAGAAAT 002240 002241 GCTGTTTTGT CCAGGTAAGA AAATGTGCTT GTCCATTTAG CCCACAAATA TTGTGATTTC CCAGGGGTTA CAAGAGAGGA 002320 002321 GACACATTCT TCGTCCTTAC AGCGTGATGG TCATTGAATC CTTGGTTTCT TGGGAATTAT CTTCTTTCCC TAGTTCCCTT 002400 002401 TGCAGGAGCA GCAGtggact gcaagtgggc tctgggtgcg ggttcgacgg ccatttagca agttgtgact tgtgtgaaat 002480 002481 cactcagatt ctgagttttg gtttcctcat atgtaaaatt aggacagtaa tgtctacctt gtggagtaat ggtaaggatt 002560 002561 aaatgcaaaa gtagatatat aaaatgttaa tacTGAGTAT AGCTATATTG GGCCATTCAA CCCCACTGGA CTGAACTCCA 002640 002641 TGAGGGCAGA GCCCATGACC ATCATCCTCC ACCATGACTG AGCTGTGTTG CTCTTCACTC ATTAGGATAT CGTAAGCACT 002720 002721 CCTTGTTGGA CGAGGAAGAA ATGACCCTGT GCTTTTGTCC CCAGGGAAAC CATCTGCCGT GTGACTGGTG GGATGAAGGT 002800 002801 AAAGGCAGAC CGAGATGAAT CCTCACCATA TGCTGCTATG TTGGCTGCCC AGGATGTGGC CCAGAGGTGC AAGGAGCTGG 002880 002881 GTATCACCGC CCTACACATC AAACTCCGGG CCACAGGAGG AAATAGGACC AAGACCCCTG GACCTGGGGC CCAGTCGGCC 002960 002961 CTCAGAGCCC TTGCCCGCTC GGGTATGAAG ATCGGGCGGA TTGAGGATGT CACCCCCATC CCCTCTGACA GCACTCGCAG 003040 003041 GAAGGGGGGT CGCCGTGGTC GCCGTCTGTG AACAAGATTC CTCAAAATAT TTTCTGTTAA TAAATTGCCT TCATGTAAA |
Predicted Small Protein
Name | NONHSAT104558_smProtein_2792:3070 |
Length | 93 |
Molecular weight | 9897.3433 |
Aromaticity | 0.0108695652174 |
Instability index | 61.9347826087 |
Isoelectric point | 11.3439331055 |
Runs | 11 |
Runs residual | 0.0156396621833 |
Runs probability | 0.0243211125564 |
Amino acid sequence | MKVKADRDESSPYAAMLAAQDVAQRCKELGITALHIKLRATGGNRTKTPGPGAQSALRAL ARSGMKIGRIEDVTPIPSDSTRRKGGRRGRRL |
Secondary structure | LLEEELLLLLLHHHHHHHHHHHHHHHHHLLLEEEEEEEELLLLLLLLLLLLHHHHHHHHH HHHLLLLEEEELLLLLLLLLLLLLLLLLLLLL |
PRMN | - |
PiMo | - |