NONHSAT119847

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT119847

Source

NONCODE4.0

Same with

,

Classification

sense

Length

5583 nt

Genomic location

chr7+:30118702..30124266

Exon number

1

Exons

30118702..30124266

Genome context

Sequence
000001 ATTGTATTGT TTTTATATTT TAGTCTAATG GGCCACCCAA ACCCAAGCTG AAAATCAGCA AATTCCATAT TAAGTACCAT 000080
000081 AATTCATAGC CAGTGTTTCA GCCAACTTAG ACTAGACATT TGGAGGTAGT ATAAGCTGCT TTGTTGAAGC TGTGTAGAGT 000160
000161 TTGCTGTTCC TAGATGTTCT TCAGTGGACC CTCTTCACTG CAACTCTGTC AGTGATAAGG GCCTGTGTAG TAAAGATGTT 000240
000241 CAGGGCATTC ACATGACCAT GCAATTGTGG GAGGCGAAGA AGACGTGGAC AGGAGTCCCA TCCTTGCTGA CAGGCATGAA 000320
000321 ACCGTTGCTC TGAGAAGATT AATGGTGTGC CCTAGCCCCA AGTTGGAGGG GAGAATATGA GAGAGGTGGG GCAGGTCATT 000400
000401 TGAGATGACA CCTCCCAACT GCCTACCATT TACCAGCATG TTCCCCATGC ATTATCTCAA TTGGACATCA CAAGTAATGA 000480
000481 TACCCAGAGG GATTATTACT CCACTTCAAA AGCAAGGTTT AGAAGTTGAG GGATCTGTTC ACAGTCACAT AGTTTTTAAG 000560
000561 CAGAGGAGCC AGATAATTTC CAAGTGTGAC CTGGACTGCC TCTGCATCAA AGTCATATGG AGTGCTTGTT CAAACAGCAG 000640
000641 ATTCCCAGGC CTTATTTTGG CCTAAAGAAC CAGAGTCTAG GTGGTGGGAC ATAGGAATCT GCATTTCAGT AAACTTTACA 000720
000721 CGTGATTCTT CTGCACACAG TATTGAAGAG CAACTAGATT AAATTCTAGT TTACAAAATT ACCAGTTTTC TTCAAGAACT 000800
000801 AAATGATATG TCCTTTTTTT TTTTTTCAAA GAGGATAAGG CTGCTATTTA AATAAAATAG CTAAATGGAG AGTGAGAAGT 000880
000881 GGAGCAGGTT CATTCAGCAG CATTCTTAAT TGAGCCAGCA TTGACACCCA GCCAGCAGGC CTTTGCATTG CATTCGGGGA 000960
000961 CCATGACTCT GAATCTGCTT ACCAATCAAT CTCGGTTTAA TCACCAAAAG TGCAGAGCAG GCAAAATGCA GCTGTTTATC 001040
001041 AATCTCAAAA GCTTTGGGAC AGTGTCATAG TTGAAAGATG AGACTTAAGA AAACAGTTTC TTAAACTTCT TAAAACTTAA 001120
001121 GAAACATTGT TTCATAAAAC AATATTGAGT GGGCATTCTT CTGCACAGTG TGATGCTCCA ACCCTGGCCC TAGTCTCAGT 001200
001201 AGACCATGCT GCCTCGAGTG TGCATCGGAG AGAAGCCATG GGTACCTTCC CCATTAGAGG CTACTTCCTT CTAGTAACAG 001280
001281 GAAGGGAAGT TCCAGCATGA GGTAGTTATC CAGGGTAGAA GGTCCTTTGA GGGGCTTGGT TGAATTGAGA GCATCATCTC 001360
001361 TAGATGATGC TGTTCCTGCT GCAGATCTCT AGGATGGAGA GAATTCTCTC TTTAGTCAGA GAAGTTTATG TAGGGAGGGG 001440
001441 TATTGGTTTT GCCTTTGTGT GTCTTTAAAC AAATGAACAT TTATTTAGCT CAGATTAATT AGGTATTTTG CCCACATAAA 001520
001521 GACTTCTGGA AAATACTTAA ACTTGAAAAA TCAACATCAC ATGTTTTAAA GCTAGGGAGA AAGAAGGGGG GTATTAAAAT 001600
001601 GATGTTGATT ATTTTGTATT TTGCCAAGGT GTGTGTGTGT TATTTCCCTC CCACTCTCAT GAGCAGTGAG TATAGATCTC 001680
001681 CTTCTCTGAT TAGTATGAAT ATGATGGCAG GACTCGGGGA TAGTCCCTGC CCTTGACATA GCCCCCTAAA ACGGAAAAAG 001760
001761 AAAAAGCCAG TTTTTGCTTG TACTTTGAAT GAAGATGTTT TAGGCATTGT ACCATTTAGC GGGGATGATA CCAGGTGGTT 001840
001841 GTTAGAATTG TGCAGTGTGA TCATTCTAAA CAGCTGCTGG TGCTCCCTGT CACCTCAGGT GAACTCTGTG GTCTCTTGGA 001920
001921 GAGGTAGCAC TCTGAAAATA CCTCAGGTTT GCCACCGCAA CCCTGAATAC ACACAAAAGG AAAGCTGCTC AGCATGGCCA 002000
002001 TTTTGCATTT GTATAGGTAG TGACTAGATG TACACAACTT AATTTGCTGG GAATGAGGGG CTTAATATTA TCTGAGATCA 002080
002081 TTGAGAACCC AGATCAGACA GAATAGCTTG AATAAGTTAC ATTTTCCAAT TACCCTTTTT CCACATCTGT AGAAAGAGGG 002160
002161 TAATATTTTT TAATAGGTAT TTTCCCCACT GGAGCATATT ACGTTTGCCT AAGATGTATA AAAGTTTGTT TAAGATGTGT 002240
002241 AAAAGTTTGC TTAAGGAACT GAGGATCCTT AAATAAAAAT ATTAGAAATT AGAAATTGAA CCTAATACTA AACAGTAAAT 002320
002321 TCAGCTTAAC CTGAACCTTG GCATAGTCAG AGCTTCCTCC TACATCTAAA GTATTTGCTC TCTGTTTTAG TTAAAGTCAT 002400
002401 AATTTGCGCT GATGTGTAAT CACTTTCCAA GAAGAGGGCA ATGAGAAAAG ATATTTAAAG CTTTCTCTCC ATAGCCCTCC 002480
002481 AAGACTTCTG GGACAACTAA ATTTACTTTC ACCATTACTG TGAGAGGAGG TGAGAAAACT CTAGTATTTT GTTGGCAGAG 002560
002561 TAATCACTTT GTTCTCATCG CTCAAAGCAT TTTTAGGATT ATTTTTCTAG CGTAACCTTT AGAGAGAACT GGAAGAAAAA 002640
002641 GGAAATTGGT CTACTAGGTA TTGTAGACAC AAATAAGTAA CATTAGGCTA ACCCCTTATG AGACATTTCC ACACAATTTC 002720
002721 ATCGTGCCTG TACTTTTCTC TATGGTAAAA GCCAGTGTTT ACACTTTGTA GGGATCAGGG TGTATTTGTT GAATTAAACA 002800
002801 AAATATTTTC AATGATGGCA AGTCTCTTGA CTTTTGAAAG CAAGTCAGAT TCCTTATAGC TAATGCTGGT GAGAAATGTT 002880
002881 AAATTGGAGA GATCCCTTTT GGGAGTGAAA CCAAATTGTA ACTATGAGGA GAAGATGGTC TTCTCATTGG CTCTTGATGT 002960
002961 AGCTCTGAAG GGAGTTCCAG AAGAGGAGCT CTCACAGAAT GTTGAGCCTG TGGGCCCAAG ACATTGACTT CGAAGGGTAG 003040
003041 TTCTCATTAG GATGTATAAG TAGTGGCTTG AGGCACCTTC TTATCATTTT TTGCATGTTA TTCTGATTAT TAAACTTCCC 003120
003121 CCAATGTCAT ATTCCATGAT GAGGGATTTC TGAACTCCAT AGTCCAGCGT TGTTGCTTTT CTCTCTTCTT TGCTACTGAA 003200
003201 AATTGCCAGC AGTACCGCCA TCAGCACACC AAATCTACCC CCACTTATGT TTGTTCTGCC CCATTTCCCA GGAGCAGCTT 003280
003281 CTAGCACATA TGTAGAGTAT CTGGCACCAC CTTAGCCCAG GGCTGCGTGC CTGATCAGTG GGGATTCTGT TCCCCCACCC 003360
003361 CCCAGACTGC AAGAGCTTCT TAAGAAGGAG CCCATATTCC CATTTGTAGC TGGAAAGCGG GTGAATGACA TGACATGGGG 003440
003441 CGCCTAGGAA AGATGATTAT TAGAGGAGTG CAGCGGAAAA AAATTTGCAC TCTTCTCCTT TTGGTTATTA CTTTCCAAAT 003520
003521 ATATTAACAA AAAGTTGATG CTTTTAACTT TATATTTTCA GAAAAGTGTT TTTAATTAAA AATATGTGAT AGGGACCAAA 003600
003601 TAAGTAAAGT ACATTTTTCT CCACTAAATT TGAAGTGAGG GAAAGAGGAG CCAAAGTAAG AGATTTTTTT TTAAGGAAAC 003680
003681 TTAATCTGAT TGTGAAAATC ATACATATGG AGAAACATCA GATCAGGCAA TAGAGTCAGA GGGTCATGAG CAATAGACGA 003760
003761 TGATGCGAGG CATTTGGGGA GCTTCCTGGA GGAAAATTAA GTTTTTTTCC TAGCAAACTA CCATGTCCTA CAAGAACTTG 003840
003841 GTTATATAAT GGTGCGTCTC TGAATCACTG ATTAAAACCA GTTGCTTCTG ATTTTAGTCA CAGGTTTTAC AAGTATTCAG 003920
003921 CTCTCCCTCA TGTTTCATTT CTTTTTTTAA GATAATCTAT CAACCTTTTT TAAATTTTAA AATTTTTAGA TGTAGAGTTT 004000
004001 ATAAGTAAAA TATATTTTTA GCCATTGTTC TGTTAGCTGA GCTGATGTGT TTGGTCTTAG AGGGCCTGAC TTCAGATACT 004080
004081 CTTTGTGATC TTGTAAGGGC TCTACACAAA CTTCATTATG TATGGTAAAT TTGTATTCTT ATGGATTGTA TATAGAATGC 004160
004161 TTTCGTTAGA AGTACATTCT ACTTCTGTAT GTCCCTTTGT AATCCGCAGT TGCTTACTCA GGGGTTTCAT AGTCATTTCA 004240
004241 TAAAAAATAA TTCACTAGCT GTCTAATGGT ATTTTAAGAC TGTTTATCTG TATCACAACG TCATTAGGAG TTCTTTCAAC 004320
004321 AATTCCATAA ATATACTGTT TACTAGACCT TCCCTGTAAA TGTTCCCAAT TCCCATCCTG TCTCAGACAG TCAATAGTCC 004400
004401 TGTGTACAGT GACTATTTGC ATGATTTCTC ATTGCACTGC TGCATTCAGG CACTCCAGGG CATGATTAAA CAGTCATTAA 004480
004481 CAGTGCCTCT CTGGTACAGT TGATGCATGC ATTGATCTTT CTTCTCTGCT GTTTTTATAT AGCCTTTAAT TAAAAGGAAA 004560
004561 AAAATACCAC TACTCTGCAA TGCAAAAGTC TTCAAAATTC TTTGTTTCCT GTATTAATCA CTTCTGTTCC AGAGTGAACA 004640
004641 AATGTTTTCA GCTAAGCTAT GTGAGAATGT AAGAATAATA TCCTGCTTGT TCTAAATAGT TCATATATTT AAAGTGTGGT 004720
004721 CAGTATTTCC TCCCTGTACC TTACAAACAG AAACCACCCT GGGATGGTTG ATACCCCTTA CAAAGTCGAT CTTACCCACA 004800
004801 CAGACTCCTG TGTATGCGTG TCTGTTTATA GGTGTATATG GAGTCAGTGT TGATAGGAAG GATGCTCTAG AAGTACTTCT 004880
004881 GTTGTTTCCT AGAAGGCTAT GAGCCAGTTC CATGGCATGT TTAATGTATA ATTCCCATGT ATCATGAGAA TTTCACTAGA 004960
004961 ATGTCATTAA ACAGCCCAAC TACCTCATGT GAAATTGGCT GTGGACAATC TGTGTCAGAT GAGAAATGTG TTCAGATAAA 005040
005041 TTTAATCTGG TTAATAGACT TAACAAATTA ATGTCTACAT AAAGAAGAAA CATGATAGAC CAGATGCCAA AGGCTAAAAT 005120
005121 GTACATAGAT TTCCTTGGAT TAATTTTTAA GTCACTGTTT AATTCCATGC CTAGTATTCT TATGAATGTT TGTGGTTTCA 005200
005201 TAGATTTATG CACTTTGAAT ATCTGTCACG TGCAGTGTTA ATGTTACCTG TTCTTGTCTC TCAGCATTTT GAATGAGCAT 005280
005281 CATAATCAGA GTAGAAGGCA AGTTAAACTA TAAAAGTGTC AAGTGGCTTG TTAACTTCTT AATTTAATGG ACCTTTACTT 005360
005361 AGAATATAAT ATGTTGGAGC CTCTTGGGAC CAACCGATGA GCGACAGTTT CATGTTTAGA TTTGTATTGT TTCTCTGTCC 005440
005441 AAGTCCTTAT TCTCTATCTT GTGGGGAGGG GTGGCAGGGG AGGGTTTTAC TTTTTTTGCA AAAATGTTTG AAAATATCTG 005520
005521 TCAGATTTTA TATTCGTTAG TTATAATAAA CTTATTTTTA AAGTAAAAAA AAAAAAAAAA AAA
[back to top]

Predicted Small Protein

Name NONHSAT119847_smProtein_404:664
Length 87
Molecular weight 9704.4357
Aromaticity 0.0813953488372
Instability index 59.8720930233
Isoelectric point 8.73626708984
Runs 15
Runs residual 0.034796945505
Runs probability 0.031837720073
Amino acid sequence MTPPNCLPFTSMFPMHYLNWTSQVMIPRGIITPLQKQGLEVEGSVHSHIVFKQRSQIISK
CDLDCLCIKVIWSACSNSRFPGLILA
Secondary structure LLLLLLLLLLLLLLHHHLLLLLEEEELLLEELHHHLLLEEEEEEEEEEEEEELLLEEEEL
LLLLLEEEEEEEEELLLLLLLLLLLL
PRMN -
PiMo -