NONHSAT100003
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100003 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
5716 nt |
Genomic location |
chr5+:1345309..1351024 |
Exon number |
1 |
Exons |
1345309..1351024 |
Genome context |
|
Sequence |
000001 GCCGCCCTCC TGGGGTGCCT CTCAGGGACC CAGGCGCTCA GCTGCTGGGG CCGCGGAGGG CGTTCGCGGG GCGCCGGGGA 000080
000081 AGGCTAGGGC CGCGGGGTCC CCGGGGCGCC GTCCCCAACC GCCGACGCCT GCCAGACCAC CAAGTGGGGG CCATGTCGAC 000160 000161 GTGTGAGGAG CCTGGTGGAA GGTGATACTG GAATTCCGGC ACTGGGCCTG CAACCCTTTT AGTCCACAGT TGTCAGAATC 000240 000241 TAAAGGGAAG AAAAACGAAG TGCTCAGAAC CCTCACAGAA GCCGGGCACC TCATCCCGTG TGTGGGTTTT TTCCCCAGTA 000320 000321 CAAACCCATT GTGGGGACAG GCGCTTTTGA AGAAAAAATT CAGATTCCCG GGCTGAAGTG GCTCCTCACA GCCTTGAACC 000400 000401 TGACCCCCAG CTTCCCGCTA AAGCCGAGCG TGCCCAGCAC GGCGAGGTCA GCAGAGGCCC CTCGGAGGGC GGCCGGGGGA 000480 000481 GGTCAGTGGG AGGCCCTCGG GCAGCCGGTG GATGGGCCGG CGACCTCCCC GACTCCATCA GAAGCGCTCA GTCTGTGCCC 000560 000561 TTCAGTTACT GGAGCAGCTC CCGTCCTTTG GCCAGTTCTC TGGAAGGTCG TTTCCGTTTC TGTGTCTGCC ACACGTGCAG 000640 000641 CACAATCGAA AAACCTTGTT CGACATTTCT AGAAAGAGTA GATGCTACTG AAGACCCAAT AGATTGCTGA GGGTTTTCAG 000720 000721 AGCAGGGACA ATTGCATCAG GGTGCTGGGC TCCGGGAAGA ACTCAGAGGA CTTGGCTGAC GGGCGGATGG GACAGTGATT 000800 000801 GGTGCCGAGA ATGGGCCAGA GAAGCAGGTC TGCCTGGAGC AGGCACCCGA GGCCGCGTGC AGGGGCGGTC CTGGGCCACC 000880 000881 AACCCCTGGA AGCTGGGCCA GGAGCTTCGG AGCCATCCGA GGCTTTTTTC GCTCCTCTCA CTCTCATGCA CCTAGTTACC 000960 000961 CTCCCCACAG AGTCTTCTCA CGGCGCGGTT TTCGGCTACC ATTGCAGTGG TGTCTGACTA CACCCAGTGC CTCTATCGTT 001040 001041 CCCCCCTCAG TCTGTTCTCG CTTCTTGGCA AACCTGTCCT TGGAAAGCCC TCTGCTGACT CCTGTTACCC TGATCAAATG 001120 001121 CATCCTCCTC AGGAGGCTCG TGAGCCCTTT GCAGGGTGGC CCAAGAGGCA TCACGCCCTG CTCTCCAGCC ACTCTTGGGG 001200 001201 TCCCCTACAG GGCCAGGCGT CCCACACCAG CAGCGGGTGC TGGTCCTGCC ACAGTCTGTG CGACACGTGA ATGAAGGCTG 001280 001281 TAGACCTGTG GTCTGCGGGA GGCAAATTCC AACTCCAGCT GTCAAACATA AGGGGACTGG TAGGCTCTCC AGGGTGGATT 001360 001361 GTCTCACGGC ATCCAGAGGC AGGGACCGAG GCAGGGACCT AGCCAGGCCT CGGGAAGGAG CTGCAAACCC AGGAGCTCTG 001440 001441 GGGTGTGGCT TCATCTCCCT CCCTCCCTAC CACCCTACCC CCATCTCTCC GTGTTCTTCA GATGCCTCAT TTTCCTGTCA 001520 001521 GGCACCTGGC CAGCTCTAGG AGCTGGAATC AGGTTGGTGC CTCCTTGGTC AGTGTCCACC CCTAGTGTCA TCCACAGTGA 001600 001601 CCAACGGAAG GGGTGGCCGG TGGTACAGGA CATTGGCCAT CTTGGGAAAG ATCACAGAGA CAGGAAGGGC CTCTCCAGCT 001680 001681 GAGCAGATGA GAGGTTCTTC AGTGGTCTGG ACACATTTGC CCACTCATGG CCGTCCCCCT GCCCTGTTGA GAGAACCGAT 001760 001761 GAGGAAAGGG AGCCCTGCAT CTGTCGCCTT CCGACTCCGA CCGCCTGAGG CTGGGCTGCC GGCAAGCCGC GGCAGCATAT 001840 001841 GACAGCACTT GAAAGAAAGT CCCTGGAAAG AATGTGGATC ATTGCAGAAC TCCCTCAAGA AAGCGCCAGA GGAGGACCCC 001920 001921 ATTCAGCACC CCGCAGAACT CCTCAAACTC CTGCGTCCAC ACTCCCCCAG CGTGGTGGCT GCTTCTATCT CCCCAGGCTT 002000 002001 AGCTGGCCTG GGTCTTGAAG GCCTCTCCTC CTGCTGTCCC CACGCAGCAT CCTCTCTGCC ACTGCACAGC TGGATGCAGT 002080 002081 GAAGGCCAGG CGGCTGTGGC TGCTTGGGCA ACAGGAGCTG CTTGACCCTG TTCTGCTGGC CTTGTTGGCC TCACTCCCGT 002160 002161 AGAGAGCCAA GGTTACCCAC GTCTCTGAAG CTAAGGCTTT CTTTCTAGGC CTGGGGGAGT CACATCTTGC ATTCGTACTC 002240 002241 ACTGAGCTCC ACGTCTATGG TTAGAGAATT TATCAACATT TTATAATGCT TTTATCAATT CCATCATTTG AGGGCCCTCT 002320 002321 GGCAAGGAAC CTGTTAGTAC ATGGAGTGCC TGGAATGTTC CAGAAAGCTA ACATGAACAG CTGGCCCTTG CTGTCCTTAT 002400 002401 ACATTGAGGA CTCCAAAGGC CACTCAGGAT GACAGGAAAT GCAGGCTCCA AAAGATTCGG TTCCTTTGTG AGGCCCTCGG 002480 002481 GCAGTGGTCA GACACGAAGG AAATAGGAAA TCACCCCCAA ACCACACCTC TCCCTATGAG GCCACAGCTG CCGATGCAGC 002560 002561 CTTGGGAGGG GGATGGAGGG AGAGAGGAGG GCATCTGAAT GTGGACACAC CCTGCTGTGA GGGCTGTGGG CGGTCCTGAG 002640 002641 ACTGGAGCCA GCCCCCAGGC TCAGCTCAGC TCGGGGACCT CGTGCCCGGG CTTTCCTGGG GAGGAAGGTG TTGGATCTCC 002720 002721 TGCCTCTCTG TATTCATTCA CACTTTGGGT TTCTCTCTCT TCCGCGCATT CTAACATCTG CAGGAttttt tttttttttt 002800 002801 tttttttttt gagacagagt ctcgctctgt tgcccaggct ggagtggagt ggtgcgatct cagctcgctg caagctccgc 002880 002881 ctcctgggtt cacgccattc tcgtgcctca gcctccagag taactgggac cacaggcgcc cgccacgacg cccagctaat 002960 002961 tttttgtatt tttagtagag acggggtttc accgtgttag ccaggatggt ctcgatctcc tgaccttgtg atccgcccgc 003040 003041 ctcggcctcc caaagtgctg ggattacagg cgtgagccac cgggcctggc ACATCTGCAG GATTTATGGA TGAAGGGCCA 003120 003121 CAAACCCACA TTTGGTCTCA TTCAACAGAT GTTGTGTCGG CAGGGCGAGG TGGGACTCTG TCTCTGCCCA CGCTACCTGC 003200 003201 CCCTGGGGTT GTAGTTCTAG GAAAGCAAGT CTCAGACTGA GTTTTGATCT GGATTAAATT CTGTCCCAGG TGCAGATATT 003280 003281 TATAGGAAAG CAGGGACAAG AGCCCTCAGG CAAAGCTGGG TATCCTGCGA TGGGTGCCTC TGGGAGTTCG CAGATAACAA 003360 003361 GGCCTTTTCC TGGAGGCAAA GCAGAAGCGT TCTGTGGCAG ACCAGTCAGC TTTCTAGCAG GTCAGGGGCA GGGACATCTA 003440 003441 GGCCCAAGGT GTCACACCGT GCCTGGCTGC TGGCTCTCCT CTTGCAAGGA CACAGCTCTT ACAGGACTTC CTAGCCCACC 003520 003521 CAAGAACATC TTTCCATCAT CGAAAAGTTA TTGATAAGAT TTTTCCATCA CTGAAAAGTT ATTGATAAGT GCCACCATGT 003600 003601 GCATGAGCAG CCCTCCAGGT GCTGGCTTGT GGGCCGGTGT TTGAGAATCT TGCCAGTTGA ATGTGGGCAT CTGTTGAGGG 003680 003681 TTTCCCAAAG GCTAACCGTT CGTGCCAGAG GGAGGCCGTG GCTCATTCCT AGGGCCCTGG GTGAGACTGG GGCTCATGCA 003760 003761 CACACAGGTA TCGATGGCAT GGGCTGTAAC GAGCTGAGTG CCTGCTGCCC TTTCCTAGTG GGGGGGGTGG GCTCCTGGAG 003840 003841 AAAGAGGGGA GGCTGCTACT CTCCAGGGCC ACAGGGAGCC CAGAACGCCA CCTCCTGGTg tcagcagcag caaatccata 003920 003921 taagtctgca gcaacttagc ttttgcctcc tcagaggaaa taattcatcc agggggcaca aggcagggtc agagactgag 004000 004001 gtaagtttca gagcaagagt gaaagtttat taaaaaagtg ttagagcagg aacgaaagga aataaagtac acttggagga 004080 004081 gggccaggtg ggcatcttga gagagcaagt acacggtttt gacctttgac ttggggttta tatcttggca tgcttctggg 004160 004161 ggctgcgtcc cttctcccct gattcttccc ttggggtggg ctgtccgcat gcgcagtggc ctgccgacac ttgggagggc 004240 004241 cgcgtgcaca gtgtgcttac tggagttgtg cggtgccctc ttgaggcagt cttcccttac cagttcctag gggaaggtca 004320 004321 cacgctggtt aaactttgcc actttgcctc gtagtgtgca tgcttgacct cactcaccaa ctcctgagat tttttttaaa 004400 004401 ttttttaatt taagttctag ggtacatgta cacaacgtgc aggtttgtta cataggtatg catgtgccat gttggtttgc 004480 004481 tgcatccatt aactcgtcat ttacattagg tatttctcct aatgctatcc ttcccacccc acgacaggcc ctggtgtgtg 004560 004561 atgttgcccg cactgtgtcc aagtgttctc attgttcaat tcccacctgt gccaactcct gggatcttat cgggaagtgg 004640 004641 ctcatcatca gctttaggtg ttttctatct attgggagcc tgcctttctc tggcaccagc tgcaaccaat aattatttta 004720 004721 gacagtttaa caacagcttg accatcacct gatgatcacc tgacatttct gttggggcgc gggcccatct cctgccccgc 004800 004801 tcatgtGGGG CTCAGTCTGC CCAACCTTTC CCCAAGGGCC TGTCTTTACT TCATTTGTGG TAAGCCCCAA GCAGAATGGA 004880 004881 CTCTTGTTAT GATCCCAGGA ACTGCTCACG GCCACCTCTG TGGATGAACA CCTAGCGGCT GCTCTAAATA CATGCtgtga 004960 004961 aagttgtcag aatcaaaatg gagtcactaa tgttaagaaa gccctgacaa agagttggtg gggaaggcca cgaagagagg 005040 005041 acacttatgc ttgcatgcct gataccagaa aggactacaa aaaccacagc ccggcacaga ggccatcaca cccttacaca 005120 005121 aaaaatattt ctgcaaggac aactgcccag caattgcctg tccaacctca gactggaatg acctttgtta ttgatgtttg 005200 005201 tagccaagga gaattatctc aaaaccactg tgatcctgct cgcttttcct ttaaagacct ttgtcttcct tgacctccct 005280 005281 gaataggcat atggtttact atggcgtgtg tactctcctt gtaatgttct gttctcaagc taacatctat tcttctagag 005360 005361 aacctctctc ttaggctgac aACGCTGTGG ATCTGGGCTC CCAGGAACTG GGGCTGACAT CAGAGGGGTG GAAAGAGAAC 005440 005441 AGCAGCTCTC CCTGAGGGGA TGCTTGCCAA GGATAAGGAG ACGATGGTTC TCTAAGGATT ATTTTATATT CATCTAAAAG 005520 005521 GGTATTTTTG TTGTTACCCC TAGCCTTTTT TCCCTTTTAA ttttctgatg aataatttat ataaagaaaa cccattctaa 005600 005601 gtgtgcaatt ctatgagttt tgacaaatgt cttcacccat gcaaatcctg ttacaacgga gatatagaac atttccatta 005680 005681 cttaaaaaca gccaggtgct gtagctcatg cctgta |
Predicted Small Protein
Name | NONHSAT100003_smProtein_4457:4711 |
Length | 85 |
Molecular weight | 9076.5104 |
Aromaticity | 0.0952380952381 |
Instability index | 42.375 |
Isoelectric point | 8.58807373047 |
Runs | 17 |
Runs residual | 0.0683955110707 |
Runs probability | 0.0440111391669 |
Amino acid sequence | MHVPCWFAASINSSFTLGISPNAILPTPRQALVCDVARTVSKCSHCSIPTCANSWDLIGK WLIISFRCFLSIGSLPFSGTSCNQ |
Secondary structure | LLLLLEEEEEELLLEELLLLLLLLLLLLLEEEEEHHHHEELLLLLLLLLLLLLHHHHHHL LEEEEEELEEEELEELLLLLLLLL |
PRMN | LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHH HHHHHHHHHHHHLLLLLLLLLLLL |
PiMo | iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTT TTTTTTTTTTTToooooooooooo |