NONHSAT100003

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100003

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5716 nt

Genomic location

chr5+:1345309..1351024

Exon number

1

Exons

1345309..1351024

Genome context

Sequence
000001 GCCGCCCTCC TGGGGTGCCT CTCAGGGACC CAGGCGCTCA GCTGCTGGGG CCGCGGAGGG CGTTCGCGGG GCGCCGGGGA 000080
000081 AGGCTAGGGC CGCGGGGTCC CCGGGGCGCC GTCCCCAACC GCCGACGCCT GCCAGACCAC CAAGTGGGGG CCATGTCGAC 000160
000161 GTGTGAGGAG CCTGGTGGAA GGTGATACTG GAATTCCGGC ACTGGGCCTG CAACCCTTTT AGTCCACAGT TGTCAGAATC 000240
000241 TAAAGGGAAG AAAAACGAAG TGCTCAGAAC CCTCACAGAA GCCGGGCACC TCATCCCGTG TGTGGGTTTT TTCCCCAGTA 000320
000321 CAAACCCATT GTGGGGACAG GCGCTTTTGA AGAAAAAATT CAGATTCCCG GGCTGAAGTG GCTCCTCACA GCCTTGAACC 000400
000401 TGACCCCCAG CTTCCCGCTA AAGCCGAGCG TGCCCAGCAC GGCGAGGTCA GCAGAGGCCC CTCGGAGGGC GGCCGGGGGA 000480
000481 GGTCAGTGGG AGGCCCTCGG GCAGCCGGTG GATGGGCCGG CGACCTCCCC GACTCCATCA GAAGCGCTCA GTCTGTGCCC 000560
000561 TTCAGTTACT GGAGCAGCTC CCGTCCTTTG GCCAGTTCTC TGGAAGGTCG TTTCCGTTTC TGTGTCTGCC ACACGTGCAG 000640
000641 CACAATCGAA AAACCTTGTT CGACATTTCT AGAAAGAGTA GATGCTACTG AAGACCCAAT AGATTGCTGA GGGTTTTCAG 000720
000721 AGCAGGGACA ATTGCATCAG GGTGCTGGGC TCCGGGAAGA ACTCAGAGGA CTTGGCTGAC GGGCGGATGG GACAGTGATT 000800
000801 GGTGCCGAGA ATGGGCCAGA GAAGCAGGTC TGCCTGGAGC AGGCACCCGA GGCCGCGTGC AGGGGCGGTC CTGGGCCACC 000880
000881 AACCCCTGGA AGCTGGGCCA GGAGCTTCGG AGCCATCCGA GGCTTTTTTC GCTCCTCTCA CTCTCATGCA CCTAGTTACC 000960
000961 CTCCCCACAG AGTCTTCTCA CGGCGCGGTT TTCGGCTACC ATTGCAGTGG TGTCTGACTA CACCCAGTGC CTCTATCGTT 001040
001041 CCCCCCTCAG TCTGTTCTCG CTTCTTGGCA AACCTGTCCT TGGAAAGCCC TCTGCTGACT CCTGTTACCC TGATCAAATG 001120
001121 CATCCTCCTC AGGAGGCTCG TGAGCCCTTT GCAGGGTGGC CCAAGAGGCA TCACGCCCTG CTCTCCAGCC ACTCTTGGGG 001200
001201 TCCCCTACAG GGCCAGGCGT CCCACACCAG CAGCGGGTGC TGGTCCTGCC ACAGTCTGTG CGACACGTGA ATGAAGGCTG 001280
001281 TAGACCTGTG GTCTGCGGGA GGCAAATTCC AACTCCAGCT GTCAAACATA AGGGGACTGG TAGGCTCTCC AGGGTGGATT 001360
001361 GTCTCACGGC ATCCAGAGGC AGGGACCGAG GCAGGGACCT AGCCAGGCCT CGGGAAGGAG CTGCAAACCC AGGAGCTCTG 001440
001441 GGGTGTGGCT TCATCTCCCT CCCTCCCTAC CACCCTACCC CCATCTCTCC GTGTTCTTCA GATGCCTCAT TTTCCTGTCA 001520
001521 GGCACCTGGC CAGCTCTAGG AGCTGGAATC AGGTTGGTGC CTCCTTGGTC AGTGTCCACC CCTAGTGTCA TCCACAGTGA 001600
001601 CCAACGGAAG GGGTGGCCGG TGGTACAGGA CATTGGCCAT CTTGGGAAAG ATCACAGAGA CAGGAAGGGC CTCTCCAGCT 001680
001681 GAGCAGATGA GAGGTTCTTC AGTGGTCTGG ACACATTTGC CCACTCATGG CCGTCCCCCT GCCCTGTTGA GAGAACCGAT 001760
001761 GAGGAAAGGG AGCCCTGCAT CTGTCGCCTT CCGACTCCGA CCGCCTGAGG CTGGGCTGCC GGCAAGCCGC GGCAGCATAT 001840
001841 GACAGCACTT GAAAGAAAGT CCCTGGAAAG AATGTGGATC ATTGCAGAAC TCCCTCAAGA AAGCGCCAGA GGAGGACCCC 001920
001921 ATTCAGCACC CCGCAGAACT CCTCAAACTC CTGCGTCCAC ACTCCCCCAG CGTGGTGGCT GCTTCTATCT CCCCAGGCTT 002000
002001 AGCTGGCCTG GGTCTTGAAG GCCTCTCCTC CTGCTGTCCC CACGCAGCAT CCTCTCTGCC ACTGCACAGC TGGATGCAGT 002080
002081 GAAGGCCAGG CGGCTGTGGC TGCTTGGGCA ACAGGAGCTG CTTGACCCTG TTCTGCTGGC CTTGTTGGCC TCACTCCCGT 002160
002161 AGAGAGCCAA GGTTACCCAC GTCTCTGAAG CTAAGGCTTT CTTTCTAGGC CTGGGGGAGT CACATCTTGC ATTCGTACTC 002240
002241 ACTGAGCTCC ACGTCTATGG TTAGAGAATT TATCAACATT TTATAATGCT TTTATCAATT CCATCATTTG AGGGCCCTCT 002320
002321 GGCAAGGAAC CTGTTAGTAC ATGGAGTGCC TGGAATGTTC CAGAAAGCTA ACATGAACAG CTGGCCCTTG CTGTCCTTAT 002400
002401 ACATTGAGGA CTCCAAAGGC CACTCAGGAT GACAGGAAAT GCAGGCTCCA AAAGATTCGG TTCCTTTGTG AGGCCCTCGG 002480
002481 GCAGTGGTCA GACACGAAGG AAATAGGAAA TCACCCCCAA ACCACACCTC TCCCTATGAG GCCACAGCTG CCGATGCAGC 002560
002561 CTTGGGAGGG GGATGGAGGG AGAGAGGAGG GCATCTGAAT GTGGACACAC CCTGCTGTGA GGGCTGTGGG CGGTCCTGAG 002640
002641 ACTGGAGCCA GCCCCCAGGC TCAGCTCAGC TCGGGGACCT CGTGCCCGGG CTTTCCTGGG GAGGAAGGTG TTGGATCTCC 002720
002721 TGCCTCTCTG TATTCATTCA CACTTTGGGT TTCTCTCTCT TCCGCGCATT CTAACATCTG CAGGAttttt tttttttttt 002800
002801 tttttttttt gagacagagt ctcgctctgt tgcccaggct ggagtggagt ggtgcgatct cagctcgctg caagctccgc 002880
002881 ctcctgggtt cacgccattc tcgtgcctca gcctccagag taactgggac cacaggcgcc cgccacgacg cccagctaat 002960
002961 tttttgtatt tttagtagag acggggtttc accgtgttag ccaggatggt ctcgatctcc tgaccttgtg atccgcccgc 003040
003041 ctcggcctcc caaagtgctg ggattacagg cgtgagccac cgggcctggc ACATCTGCAG GATTTATGGA TGAAGGGCCA 003120
003121 CAAACCCACA TTTGGTCTCA TTCAACAGAT GTTGTGTCGG CAGGGCGAGG TGGGACTCTG TCTCTGCCCA CGCTACCTGC 003200
003201 CCCTGGGGTT GTAGTTCTAG GAAAGCAAGT CTCAGACTGA GTTTTGATCT GGATTAAATT CTGTCCCAGG TGCAGATATT 003280
003281 TATAGGAAAG CAGGGACAAG AGCCCTCAGG CAAAGCTGGG TATCCTGCGA TGGGTGCCTC TGGGAGTTCG CAGATAACAA 003360
003361 GGCCTTTTCC TGGAGGCAAA GCAGAAGCGT TCTGTGGCAG ACCAGTCAGC TTTCTAGCAG GTCAGGGGCA GGGACATCTA 003440
003441 GGCCCAAGGT GTCACACCGT GCCTGGCTGC TGGCTCTCCT CTTGCAAGGA CACAGCTCTT ACAGGACTTC CTAGCCCACC 003520
003521 CAAGAACATC TTTCCATCAT CGAAAAGTTA TTGATAAGAT TTTTCCATCA CTGAAAAGTT ATTGATAAGT GCCACCATGT 003600
003601 GCATGAGCAG CCCTCCAGGT GCTGGCTTGT GGGCCGGTGT TTGAGAATCT TGCCAGTTGA ATGTGGGCAT CTGTTGAGGG 003680
003681 TTTCCCAAAG GCTAACCGTT CGTGCCAGAG GGAGGCCGTG GCTCATTCCT AGGGCCCTGG GTGAGACTGG GGCTCATGCA 003760
003761 CACACAGGTA TCGATGGCAT GGGCTGTAAC GAGCTGAGTG CCTGCTGCCC TTTCCTAGTG GGGGGGGTGG GCTCCTGGAG 003840
003841 AAAGAGGGGA GGCTGCTACT CTCCAGGGCC ACAGGGAGCC CAGAACGCCA CCTCCTGGTg tcagcagcag caaatccata 003920
003921 taagtctgca gcaacttagc ttttgcctcc tcagaggaaa taattcatcc agggggcaca aggcagggtc agagactgag 004000
004001 gtaagtttca gagcaagagt gaaagtttat taaaaaagtg ttagagcagg aacgaaagga aataaagtac acttggagga 004080
004081 gggccaggtg ggcatcttga gagagcaagt acacggtttt gacctttgac ttggggttta tatcttggca tgcttctggg 004160
004161 ggctgcgtcc cttctcccct gattcttccc ttggggtggg ctgtccgcat gcgcagtggc ctgccgacac ttgggagggc 004240
004241 cgcgtgcaca gtgtgcttac tggagttgtg cggtgccctc ttgaggcagt cttcccttac cagttcctag gggaaggtca 004320
004321 cacgctggtt aaactttgcc actttgcctc gtagtgtgca tgcttgacct cactcaccaa ctcctgagat tttttttaaa 004400
004401 ttttttaatt taagttctag ggtacatgta cacaacgtgc aggtttgtta cataggtatg catgtgccat gttggtttgc 004480
004481 tgcatccatt aactcgtcat ttacattagg tatttctcct aatgctatcc ttcccacccc acgacaggcc ctggtgtgtg 004560
004561 atgttgcccg cactgtgtcc aagtgttctc attgttcaat tcccacctgt gccaactcct gggatcttat cgggaagtgg 004640
004641 ctcatcatca gctttaggtg ttttctatct attgggagcc tgcctttctc tggcaccagc tgcaaccaat aattatttta 004720
004721 gacagtttaa caacagcttg accatcacct gatgatcacc tgacatttct gttggggcgc gggcccatct cctgccccgc 004800
004801 tcatgtGGGG CTCAGTCTGC CCAACCTTTC CCCAAGGGCC TGTCTTTACT TCATTTGTGG TAAGCCCCAA GCAGAATGGA 004880
004881 CTCTTGTTAT GATCCCAGGA ACTGCTCACG GCCACCTCTG TGGATGAACA CCTAGCGGCT GCTCTAAATA CATGCtgtga 004960
004961 aagttgtcag aatcaaaatg gagtcactaa tgttaagaaa gccctgacaa agagttggtg gggaaggcca cgaagagagg 005040
005041 acacttatgc ttgcatgcct gataccagaa aggactacaa aaaccacagc ccggcacaga ggccatcaca cccttacaca 005120
005121 aaaaatattt ctgcaaggac aactgcccag caattgcctg tccaacctca gactggaatg acctttgtta ttgatgtttg 005200
005201 tagccaagga gaattatctc aaaaccactg tgatcctgct cgcttttcct ttaaagacct ttgtcttcct tgacctccct 005280
005281 gaataggcat atggtttact atggcgtgtg tactctcctt gtaatgttct gttctcaagc taacatctat tcttctagag 005360
005361 aacctctctc ttaggctgac aACGCTGTGG ATCTGGGCTC CCAGGAACTG GGGCTGACAT CAGAGGGGTG GAAAGAGAAC 005440
005441 AGCAGCTCTC CCTGAGGGGA TGCTTGCCAA GGATAAGGAG ACGATGGTTC TCTAAGGATT ATTTTATATT CATCTAAAAG 005520
005521 GGTATTTTTG TTGTTACCCC TAGCCTTTTT TCCCTTTTAA ttttctgatg aataatttat ataaagaaaa cccattctaa 005600
005601 gtgtgcaatt ctatgagttt tgacaaatgt cttcacccat gcaaatcctg ttacaacgga gatatagaac atttccatta 005680
005681 cttaaaaaca gccaggtgct gtagctcatg cctgta
[back to top]

Predicted Small Protein

Name NONHSAT100003_smProtein_4457:4711
Length 85
Molecular weight 9076.5104
Aromaticity 0.0952380952381
Instability index 42.375
Isoelectric point 8.58807373047
Runs 17
Runs residual 0.0683955110707
Runs probability 0.0440111391669
Amino acid sequence MHVPCWFAASINSSFTLGISPNAILPTPRQALVCDVARTVSKCSHCSIPTCANSWDLIGK
WLIISFRCFLSIGSLPFSGTSCNQ
Secondary structure LLLLLEEEEEELLLEELLLLLLLLLLLLLEEEEEHHHHEELLLLLLLLLLLLLHHHHHHL
LEEEEEELEEEELEELLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHH
HHHHHHHHHHHHLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTT
TTTTTTTTTTTToooooooooooo