LINC00996

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC00996

Approved Name

long intergenic non-protein coding RNA 996

Previous Symbols

_

Synonyms

_

Chromosome

7q36.1

RefSeq ID

NR_034033

OMIM ID

_

Ensembl ID

ENSG00000242258

pubmed IDs

12477932

Sequence

>gi|300360530|ref|NR_034033.1| Homo sapiens long intergenic non-protein coding RNA 996 (LINC00996), long non-coding RNA

000001 GGAAGAAGTG TATTCAGCAT TCTACCTTCT TCCTGATTCT GTGAGCTTAG ACCTGCTTCC ACTTTCAATC TCTCTCATTG 000080
000081 CAGGCTGAAG AGGACTGGCC TTTCAGTGGC CCTCGTGGAA CGGTAGGAAG CTGAGCTGGA CTCTCTGCCA CATCGTTCGG 000160
000161 TTCCCACAAC AGCCTGATGA AGCAGCAGCA GAAAAAATGG AAGCCATTTT GGTTAACGGG TCACATCTTA GCAGTTGGCA 000240
000241 GCGTAAGAAG CGGAAGGATC CACTGGTCAG AAATGGAGAT TGAGGAGAAG CTGAGAGCAC GGAGGGCTTC AGGGGAGAGC 000320
000321 CTGTCCTTTG GAAAAGCGGG CAATTTGCAC TCGTGGTAGA TGAGTGGGGA GGCCGCTCAG TCTCTCTGCC CCTTTGCACA 000400
000401 TGCTGCACCC GTTTAGTTTG ACGTCGTGGA GAGCTCCTCC AGGCACGGGC TGTGTGAAAG GGTTTAAAAG TTTGGGGGCA 000480
000481 TGGCTGGGGG TCCCCGCATG AGGGGTGCTG TGTAGGCTCC CATCTTTTCT GCCGGTCATG CTGGGAGCCA ACAGCACCGG 000560
000561 CTCCTGACGT GGGGACTTGC CCTGTGCCAT GACGTGGGGA CTCGCCCTGT GCCCTGCGCC CCGGTTCCTG CTGGTTCTGT 000640
000641 GCCTTCTGGA CTCATTTCCT TTCCAGGGTC CGCAACCGCT TCCGACACAA TCACTATTTT GGCTGCGGCC ACGGCTTCAT 000720
000721 TCGCTTCAGC TGCGAGCTTT TCTCAGCGCA GAGAGCACAG GGAAGCTGGG AGGAGTCCGC CTAGATGGAA TCTTCCAGAA 000800
000801 GCAACATGAC TCTTTTAGGC CTTTGCTGCC AGCTAAGAAG GTGAAGAAAA GTCACCACTG CAATAAAAAG AGGGCACTTT 000880
000881 GTCTTACTTG GCAATAAACC ACCACCAGCA GCAGCAGCAA TAGCAGCACC CCCACCCCCA CCAAACCCAA AACAAACAAG 000960
000961 AAAAGCAAAA GGAAACCTTC GTTTTCAGTG AGAGGATTGG CATGAAGAAT CCTTGGGACA CAGGAATCAA TCAAGACCTT 001040
001041 TGGATGCCCC TGCCTGCTCT GTCTCTAGAA GCCTCAAGCC CAGCATCCCC ATGCCGACGT GCTCATCTCT AGCAAAACAC 001120
001121 CAGCCTCTCA GCCCTCTCTC CATAAAGGCC ACTAGAAAGT ACTGTGATGC CAGGCTGCTC CACATCCAGG CAGCCTGGGT 001200
001201 GACCACTGCC CTTTATGAGA ATCAGAAAAA GCTGTCTCCC GAGTAGTTGG AATTACAGGT GCATGCCACC ATGCCTGGCT 001280
001281 AATTTTTGTA TTTTTAGTAG AGATGGAGTT TCACTATGTT GGCCAGGCTG CTCTTGAACT CCTGACCTCA AATGATCCAC 001360
001361 CCGACTCGGC CTCCCAAAGA GCTGGAATTA CAGGAGAAGC TAAACGTTTT AAAATAAGTA TATTATTTCT GAATTACTAT 001440
001441 AACAGACATG GACTCCATTT GAACAAGATA CTTAATTCCA TCAAAAACTT GTCAAAATAT ACAAATTTAG TCACAAGACA 001520
001521 CTTTACTACC ATCTACCAAA GCTGTAGTAA ACACACACAT TTTCCATCTT TGAAATGAAA TAACGTATAA TAATGTTTTC 001600
001601 CAAAATAACA AAGATGTCTG CAGTGAATAG CACTCCCTTT GATGTGGCAA CTGGGTCAAT CGTGTTAACA ACTGTGAATT 001680
001681 CATCTTTTTA AGAAATTCTT TTATTGTTTC TTTTCTTTTC CGCGACAGTC TCACTCGTGT TGCCCCGGCT GGAGTGCAGT 001760
001761 GGCGCTATCT TGGCTCACTG CAGCCTCCGC CTCCCGGGTT CAAGCAATTC TCATGCCTCA GCCTCCTGAG TAGCTGGGAT 001840
001841 TATAGGCACG TGCCACCACA CCCAGCTAAT TTTTTTGTAT TTTTAGTAGA GACGGGGTTT GACCATGTTG GTCAGGGTGG 001920
001921 TCTCGAATTC CTGACCTCAG TTGAGCCGCC GGCCTCGGCC TCCCAGGAAG TGCTGGGATT ACAGGCATGA GCCACCAGGC 002000
002001 CCGGCCTTAT TGTTTCTATT TGTAATAAAT CTGTTCCCCT TGC

Predicted Small Protein

Name LINC00996_smProtein_176:358
Length 60
Molecular weight 6962.9373
Aromaticity 0.1
Instability index 70.4483333333
Isoelectric point 10.6051635742
Runs 11
Runs residual 0.0335992907801
Runs probability 0.0201458142634
Amino acid sequence MKQQQKKWKPFWLTGHILAVGSVRSGRIHWSEMEIEEKLRARRASGESLSFGKAGNLHSW
Secondary structure LLHHHHLLLLEEEELEEEEEEEELLLLEEELHHHHHHHHHHHHHLLLLLLLLLLLLLLLL
PRMN -
PiMo -