NONHSAT121038

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT121038

Source

NONCODE4.0

Same with

,

Classification

intronic(S)

Length

2760 nt

Genomic location

chr7+:64141511..64147263

Exon number

3

Exons

64141511..64141600,64144465..64144583,64144713..64147263

Genome context

Sequence
000001 CTCGGCTTGG CGGCTGAGGA ATGACACTGC CCGAACACCT CGGAGGCCCC ATGGACCATC ATGGACCTCG GGTAACTCTT 000080
000081 AAGGTGGAGG GAAAGGTTGA AAGAGCCAAC GGCCTCTTAA AAACTCACCC TCCAACTCCG AAAGGACTGG ACTGTTCTCC 000160
000161 TACCACTTGT GCTTCTCAGG ATCTGAGCCA CCGCCCATGA ACCTACCAGA TACCCAGCCA TATGAGGACA CTCTGGCCAG 000240
000241 ACAGCCCATC CTGATAAAGA GCTTAACCCC CTGATCTCTA CAACCTCAGT GGACAGGACC CTTCCTGGTC GTTTATGGTA 000320
000321 CCCCAATGGC TGTCTGCTTA CAAGATCCTC CTCAGTGGAT TCACCCTTCC AGAGTAAAGC TGTGTCCATC TGACACTCAG 000400
000401 CCTGACCCTT CTTCCTCCTC TTGGAAGTCG CAAGTACTCT CCCCGACCTC GTTAAAACTC ACCGAAATCC CTGAAGAAAC 000480
000481 TTAAATGTCC TGCTCCTGTC CGCCCTGCTT CTTCACCCTC TTCCTCCACT CTATTTGCCA AGACATCTCC TGGTTTCATC 000560
000561 CCCAAACTCC CACCTTAGAT TCTCTCTTAA ACTGGATAGA TGATCTCATC TTTTAGGGCA CTCTGTATAA CTTCTTCCCA 000640
000641 GATGAGACGC CTCTGTTTAC CTTCCTACTC ACTCTTTATC TATCCCTCCT GCTCCTTTGG CTACCTGGCA TGGCCGCACT 000720
000721 CCCACTTCCA GTAATGCCTA ATTACCTCTA CAAAACTCTC AACCCACTCT CTGTTAAACC AGTCCAACCC TTCTTTGGCA 000800
000801 AAGGACGCTG GCTTTGCATT TCTCTATCAG CTACCACTTA CGTTGCCACT CTCATTCCCA CAAAAAACTG GGTAGTTACT 000880
000881 AGCTTAACCT ACCACCCTCG TAATGAAGGA AAAAGCCCCT TCTAACTCCT ACATATGCAG TCATTGGCCA ACTTCTCCAT 000960
000961 CAATGAAATG ACCGAAAATA CCCTGACAGG TCGTGCAGTT CAACTTTTAC GCTCCTACAT GTCCAGCCTC ACCCATTACA 001040
001041 CAAGTAATGA AAAGCCCATA CACGGCCCTG TGACTACAAA CGCTGTCTTA ACTTTCCAAG CCCCTTTATG CATCCAACAC 001120
001121 AACCTGTCAT CAGGCCTACC TCTAGGTCAC CTACTGTTCC ATCAGTGCAA CTACGCCTTG CAGCTTCAAG CCCCAACTGA 001200
001201 CTGTATTAAC TTCCAGGTCT CCCAAACAGC TACATTCAAA CAGCCTGTCC GCTTCTCAAA GCCCCCAGAA GTCATCAGCA 001280
001281 CCTCTCTGCT TAACAAACAA TCCGGGTTTT GTAATGGCAG GCATACGCAC TGCATGACCA TTCACCCCTG GACCTCCTGC 001360
001361 AGCAGCACTC CCACCACTGA CGAATGCCTC CTTATCCCCT CTTTCGATTA CCCTTCTGAG TGGCTCCTAG TAGATACAAA 001440
001441 ATGATTTTTC CTCCAATGGG AAAATAAAAC ACAGGGAGCC ACTCAGCTTA TCCCAAACAT CCCTTTTCAG CCGCTCACTG 001520
001521 GGGCCGCCTT GGCAAGTACC CTAGGGGTGT GGGAAAATGA AAACAAATTC ACACACCTTT TTAACATACA CAGCCAGTTC 001600
001601 TGCCTACCCA GCCAAGGCAT ATTCTTCTTG TGTGGAACTT CAACCTATGT CTGCCTCACC ACTAACTGGA CAAGCACCTG 001680
001681 TACCCTAATC TTCCTAAGCC CCAAAATTGA CATTGCCCCT GGAAACCAAA CCTTACCAGT CCCTGTCAGA GCCCAAGTCC 001760
001761 ATCAGCATAG GGCTGTGCAG ATAATACCCT TGCTTATAGG ACTAGGAGTT ACCAATGCCA CAGGAACTGG AATAGCAGGT 001840
001841 TTGTCCCCTT CCCTGTCCTA CTATCATACA CTCTCAAAGG ATCTCTCAGA CAGCCTACAA GACATAGCAA AATCCACCCT 001920
001921 TACTCTCCAA TCCCAAATAG ACTCCTTAGC AGCAGTAACT CTGCAGAACC ACCGTGGCTT AGACCTCCTA ACCGTCGAAA 002000
002001 AAGGTGGGCT ATGCACCTTT TTAGGGGAAG ATTGTTTTTC CACCAACCAG TCTGGACTAG TACGAGATGC TGCCTCGCAG 002080
002081 TTAAATGAAA AGGCTTCTAA AATCAGACAA TGTCTTTCAG ACTTTTAACA CCAATCTTTG GAGCTGGGCG TTGTGGCTTC 002160
002161 TCCCTATAGC TGGGCCCCTC ATTTCCATCA TTCTCCTTCT CCTATTTGGA CCCTGCCTCT TCCGTCTAGC CTCTCAGTTC 002240
002241 CTACAAAATC TCATTCAAGC TATTACCAAT CAATCTATGC GACAAATGCT ACTCCTAACT GCCCCTCAAT GTCACCCCCA 002320
002321 CCCCAAGATC TCTCTCCCAA TTAGGAGTCC ATGCCGCCCC AAGTCCCGCT CGAAGAATCC CTGAGAAACA TCACCCCGTC 002400
002401 CACTTCTTTT CTTATTAAAA CAAAAAGACA GGAATGTCAT ATCCCACAAA TATGGCTGCA TTATTGCCAG CAGGCCATTA 002480
002481 TTGACGGTGG GCCATTAAGA ACTCTGTGAC CAGCACATAT GCCTCCAGAC TTCCTGGAAC CAGAAAACCT GCAACAACCA 002560
002561 AAAACCACAA AAGAAGAACA ACGGGTTTCT GTTCCAACTG GTTGATCAAC TTCTTGAGTC AACAAGTCCC AAAACCATGT 002640
002641 TGTAATTTTC CACCCGCCAC TTTGCCCTGT AAAGACTGCT CTCCCTCACA CCTCCGGGCT GACTCCCTTT TCCGATTCAG 002720
002721 CCCACTAACA CCCAAGTGAA TAAACGGCCT TGTTGCTCAC
[back to top]

Predicted Small Protein

Name NONHSAT121038_smProtein_2084:2383
Length 100
Molecular weight 11347.534
Aromaticity 0.0808080808081
Instability index 51.6929292929
Isoelectric point 11.399597168
Runs 9
Runs residual 0.0427807486631
Runs probability 0.0456731241044
Amino acid sequence MKRLLKSDNVFQTFNTNLWSWALWLLPIAGPLISIILLLLFGPCLFRLASQFLQNLIQAI
TNQSMRQMLLLTAPQCHPHPKISLPIRSPCRPKSRSKNP
Secondary structure LLLLLLLLLEEELLLHHHHHHHHHHHHHLLLHHHHHHHHHLLHHHHHHHHHHHHHHHHHH
HHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTTTTTTTTooooooooooooo
ooooooooooooooooooooooooooooooooooooooo