NONHSAT118941

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT118941

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2821 nt

Genomic location

chr7+:5862791..5894066

Exon number

7

Exons

5862791..5863157,5873158..5873199,5879580..5879706,5880321..5880487,5886397..5887232,5888195..5889147,5893738..5894066

Genome context

Sequence
000001 TGCTTTTACG CAGTGTTTTG AGGTGCAGGA CTGCAATGCC CGCACTTCCG GACCCCAGAC TCTTCCTACG GCGACTGACA 000080
000081 CCTCAGACAC GCGTTGTGGG GGGCGGCGCT CTGGCTCCGG GGCTGTGCCC AGCGCGAGCG TGGCCGGCAC CCACTTGCTG 000160
000161 CAGAGCCGGC ACTGCACTGG GGTTCTTGGG GGCGGCTCCT CCAGACGTCT CTGCGGGTTG TCGGGTGCTG CCGAGTTGGG 000240
000241 TTGGGGGACG CTACAGGTAG TGGGTCCGGA GCAGCGGCTG GCCCCTCATC TCACTGCCCT TCCTCGGACC CCACCGTCCT 000320
000321 CTGGAAGCTT GTGCAGGGCA CCTGCCACTG TGACCACCTG GACGCAGATA CCTGCTTTCC CACAACAGCA AGGAAGAACC 000400
000401 ATGGCCCAGG GGTCACTGTC ATTCGGGGAC GTGGCTGTGG GCTTCACCCG GAAAGAGTGG CAGCAGCTGG ACCTGGAGCA 000480
000481 GAGGACCCTG TACCAGGATG TGATGCTGGA GAACTACAGC CACCTGCTCT CTGTGGGCTG AACTGTCTGC TTGCTTGGAA 000560
000561 GGCCCAGCTG GCTCAGCACA AATCCTTTAA CCGTTTCTCC CCAAGAGGGT GTCAAGTCAG CAAACCAGCT GTGATCTCCA 000640
000641 GTTTGGAGCA GGGGAAGGAG CCATGGATGG AGGAGGAAGA GATAAGGACG TGGAGCTTCC CAGAAGAAGT TTGGCAAGTT 000720
000721 GCTACCCAGC CAGATAGCCA ACAGCAACAC GAAGACCAAC ATTTGAGCCA TACGTTTCTA GACAAGAAAG ACTGGACCGG 000800
000801 AAATGAGCTT CATGAATGTA ACGAACTTGG AAAAAAACTC CATCAGAACC CAAACCTCCT TCCATCAAAA CAGCAGGTCC 000880
000881 GCACACGTGA CTTGTGCAGA AAGAGTTTGA TGTGTAACCT GGACTTCACT CCTAACGCCT ACCTGGCGAG GAGGAGATTT 000960
000961 CAGTGCGACG GCCACGGAAA CTTCTTCTCT GTTCGAAACT TGAAACTCCA CCTTCAGGAG CGAATCCACG CGGAGGTCAC 001040
001041 CAGTGTGGAA GTGCTTTAAG CTGTGACGAG GGAGTTCCTG CAGCTCAGGG AGCCAGTAGT GAGAAACCCC ACGAATGCAC 001120
001121 GAAGTGTGGG AAAGCCTTGT GCTGCAGATC GGACCTCAGG GTACATCACG GGGTCCACGC GGGGGAGAAG TCCTCTGCGT 001200
001201 GCAGTGAACG GGGGAGTGGT TTCAGGGAGA AGCTTTGCCC TGACAAACAG GGAACTCACA CAAAGGAGAA ACCCGCTAGA 001280
001281 GACAGCAGAA GTGGTAAAAC GATCTTCCGG AAGACACGCC TGTGTGTCCC GGGCACAGTT CACGCCGGAG CGAAGCCTTA 001360
001361 CAAGTGTTGG GAGTGTGAGA AAACCTCCCA CAAGTCGCGC CTCATCGAGC ACCTTCGCTC CCACACGGGG GAGAAGCCCT 001440
001441 GCGGCTGCAG GGAATGCGGA AAGGCCTTTT TCCAGAAGTC ACACCTCATC CTGCGTCAGA GGACTCACAC GGGGGAGAAG 001520
001521 CCCTGCGACT GCGCGGAGTG CGTGAGCCAC TTTAGCCAGG ATATACTTAA TTATAAAGAC AACTGTGGAT ATGAAGTTTT 001600
001601 TTTTTCTTTT GAGATGAAGT TTTACTCTTG TTGCCCAGGC TATGGGAAGA AATGAAATCA TTACGACTAA CAGATCTAAA 001680
001681 AAGCGTGAGA ACATCCCAGA GGGACGCAGG GGTGTGTTCC AAGTGTGCTA CAGACAGATA ATTTTTATTT TTGTGTTGCA 001760
001761 CTGGTGAACA GAACATAATT GTGAATCTGT AGACATACAT ACCATAATAT ATAGAAGATC TATGGATACT TTCTATGCTA 001840
001841 AATTATACCC TTCTGTGTTT GTGATATAGA ACTGGAATAA GATGAATATA TTATTAAAGC AGAGAGGCCC ACTGCAGCAG 001920
001921 CACATACCAT TTCTTCTGTT TGTCATCACG TATTACAGAG CCCCTGAACT TGGAAGCTAA CAATTTCATG ACATCTGATA 002000
002001 ACTCACAGTC AACTAGTTTA TAAGTTATAT TTTTGGCTGG GCTCGGTCGC TCACACCTGT AAACCCAACA CTTTGGGAGG 002080
002081 CTGAGGCAGG CAGATCACAT GAGGTCAGGA GTTTGAGACC AGCCTGGCCA ACACGGTGAA ACCCCGTCAC TGAAAAATAT 002160
002161 AAAAATTAGC TGAGCATGCT GGTGATTTTC TGTAATCCCA GTAATCCCAG CTACTTGGGA GGCTCAGGCA CGAGAATTGC 002240
002241 TTGAGCCCAG CAGGCAGAGG TTGCAGTGAG CTGAGATTGT GCCACTTCAT TCCAGCCTGG GCGACAGAGC AAGACTCTGT 002320
002321 CTCAAAACAA GTTATATTTC TATATTAGGA CTCCATTTTC TGGCAATACG GTGGACCGTC TAAAGTGAAA ATCACCGGGG 002400
002401 CAGTGGCTCG TGCCTGTGAT CTCAGCACTT GGGAAAGTCA AGGTGGACAG GTCACTTGAC CCCAGGAGTT CAAGACCAGC 002480
002481 CTGGGCAACA CGAAAGCTGA CTTCATGAGA CCGAGGATCT CGTTCGTTGA TCGATGCTAC ATATTCAGCA TGTTAGATAA 002560
002561 TGAGGTTGGT GAGATTGATG AAATCCCGCC CACCAGGGCC GACCCACAAA GAGCCACTTT ATGAAGAACA TGCTCCTGAA 002640
002641 CTCCACTCCA CAACCAGCTT TTGAAAAGAA AAATCTGAAA AAATTTGACC TCAGTTAAAC TGGTCCAAAT GCCTGAGCAT 002720
002721 AACCAAAATC TTTCCTCATT CACTCTTCAC GTGGACCGTG TTTAATTTGG TTGCTGGTGA TCATATTTGT TTTAGTAAAG 002800
002801 GCTGTCACTC TATCTTGGGC G
[back to top]

Predicted Small Protein

Name NONHSAT118941_smProtein_35:259
Length 75
Molecular weight 7668.9795
Aromaticity 0.0675675675676
Instability index 49.1364864865
Isoelectric point 11.1419067383
Runs 11
Runs residual 0.00599868160844
Runs probability 0.0459122223828
Amino acid sequence MPALPDPRLFLRRLTPQTRVVGGGALAPGLCPARAWPAPTCCRAGTALGFLGAAPPDVSA
GCRVLPSWVGGRYR
Secondary structure LLLLLLHHHHHHHLLLLLEEELLLLLLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLL
LLEEELLLLLLEEL
PRMN -
PiMo -