LINC01252

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01252

Approved Name

long intergenic non-protein coding RNA 1252

Previous Symbols

_

Synonyms

_

Chromosome

12p13.2

RefSeq ID

NR_033890

OMIM ID

_

Ensembl ID

ENSG00000247157

pubmed IDs

12477932

Sequence

>gi|299829205|ref|NR_033890.1| Homo sapiens long intergenic non-protein coding RNA 1252 (LINC01252), long non-coding RNA

000001 GGGTTGGGCC AGATAAGAGA ATATAAGCAG GCTGCCCGAG CCAGCAGTGG CAACGCGCTC GGGTCCCCTT CCACACTGTG 000080
000081 GAGGCTTTGT CCTTTCGCTC TTTGCAATAA CTCTTGCTAC TGCTCACTCT TTGGGTTCAC ACTGCCTTTA TGAGCTGTAA 000160
000161 CACTCACCGC GAAGGTCTGC AGCTTCACTC CTGAAGCCAG CGAGACCACG AACCCACCAG AAGGAAGAAA CTCCAAACAC 000240
000241 ATCTGAACAT CAGAAGGAAC AAACTCCGGA CACGCCGCCT TTAAGAACTG TAACACTCAC GCGAGGGTCC GCGGCTTCAT 000320
000321 TCTTGAAGTC GACAGAGTCT TGCTCTGTCG CCCAGGCTGG AGTGATGAGG CGCGATCATC ACTCCACCTG GGCTCACTGC 000400
000401 ACCTCCGCCT CCCGGGTTCA AGCGATTCTC CTGCTTCAGC CTCCTGAGCA GCTGGGATTA CAGAGTCCCT GTCATCCAGA 000480
000481 CTGGAGTGCA GTGGTACAAT CCCGGCTCAC TGCAACCTCC ACCTCCTGGG TTCAAGCGAT TCTCCTGTCT CAGCCTCTCA 000560
000561 AGTACCTGGA ATTACAGGGA ATACAACCAA CCTCTTCATC ACATGAAGAC ATTTTTTTCA TGCTTTTCCA TGTTCCTGAA 000640
000641 ATGCCCTCCC CTAGGAGCTT CCTCCAATTT ATCTACTCTT TCACATTCAA GAAGAGTTGA ATCAAGATGT TTAAGTCTCT 000720
000721 GGAGAGTTAA GTATTTTGTA GGAGGAAGAA TATCAAGGTT GTGGAAAAAA AAAAAAAAAA ACCTTGTTGT GGCTGAGAAA 000800
000801 CTGGGCTGGA GTCAGGAGGC TGGTGATGGC CCCAGTTCTA CCACTAACTA GCCATGTGCC TACGATGCTT CATTTCACCT 000880
000881 CTTGAGATCT AAATTTCCCC GTACGTCAAA GCAAAGAGAC AGATCTCGCT TTGTCGCCTG TCAACGGCTA GAGTGCAGTA 000960
000961 GCGTGATCAT AGCTTACTGC AGCCTCGAAC TCCTGAGCTC AAGCAACCCT CCCTCCTCAG CCTTCCCAGG TCTCCTGATT 001040
001041 TCAAGTACAG TTCTTGCCAA CACACAGTGG TAAAGGGTAT GATTACCTAA GCCTTAGCAC AGAAGGTAGC ACCCCTTACT 001120
001121 GGTGAGCTAT CATGAAACTA TGCGACAATT TTCTCCATCT TTTTTAGAGT TGTTTTCCTT GTCTTCCTCC TCATCACGAT 001200
001201 TATTATTGCT TTTTCATTCA GCAAAAAATC TGGTAGGAGA CTGCACAGAA GCATGATCAC TTGCTATCAA CAGTGTAACC 001280
001281 CTACCTAACT TCCCTAATCC CTAGCAAATG AAGGAAAAAT CATATACAGG CACTGACAAA TAGCAACACA AGAAAGGGCA 001360
001361 ACAAAATTAC TAAAGAGTAA CGCCATCCTC CCCAACTCAA ACCAGGGAAA GTAAGCTTGC CTAGGAACTT AGGCTGAAAC 001440
001441 TCACCACAGT GCCAAAAACC TGTTTTCTTG AAAAGAGGTA GTGCAGATAG TGAATCCACT GAGCCACATA GGGTGGGAGT 001520
001521 TTGCTTTAAG TGCTTTTCTT CTTGCCCACA AACTAGCGTA GGCTGTGACT GGCTCCAGCT GATCCTTATC AAGGCAGCTA 001600
001601 GTTCTGAGGG CTTGGTTAAG AAGCTCTTCG TTTACTTAAG GGAGCAAAGA GAAGACACGT GAATTGGAAA AGGATATCTA 001680
001681 AAATGTCCCC TAAGGCTTCA TAAAGTTGAC ATCTTTCGTC ATGACTTCTA CAGTTCAGGA AACCAGGTAG CAGTAAACAA 001760
001761 TTATGAACTC TACTCACTCG CCCTATCACT TTCACATCAA ACTGGGGGTA CTGTCCTTTG AACAGAAGAC TCATGAGGAA 001840
001841 AGCGCAGATT CCTTCCAGGT GGGAAGAAAG CTTTGTCCCT GCTCCATGTC TGCTGATCTG CAGGAAGCAG AGAGAGCAGG 001920
001921 GCTGTTCGAC TCCGTTTCCC TAATGATGTC CCAGCCTCTT TCAGTTCTGT GAGCAAAACC AAGGCTGGGT TTGTTAGCAC 002000
002001 AGCCTGTGAC AGAGCCTGGG CAGGTCCTTG TATTCTGGCA AAGGCTGAAC CAAAGAGCAG TCTCATTGCA CTATTTACTT 002080
002081 TCTTTTTGTT ATACTCTCTC TGGCTCAGCA CAATCAGTGC AGTCCGATGC TGCAGAAGAC AGACTGTTCC CTCTCTCTTC 002160
002161 CCACAGTGTT TCTCGTCTAC CTTTTGCCCT TATTCAATAG TGGGAAGAAA TTTCTTCTTG ACCTCAGCAA TCCTTTCTGA 002240
002241 CAATTACCAT ATGCCTCAAG AGACCTGGGA AGCTTCGCCC TGTCTGGCAC AGGGAGACAA TTCCTATCCA GAATAAAACT 002320
002321 GAAAGCTTTC AAAAAAAAAA AAAAAAGAAG AAGAAGAAAA GAAAGAAAGA AAGAAAGAGA AAAAGAAAGA AAGGAAAGAA 002400
002401 AGAAAGACTG TAATAAGTTA CCTTGTCTCT GGGAGATTCT GTAACAATGA ATTGCACAGG CTAATTATCC CCTCTATTAA 002480
002481 AAAGTTTTTA AAAGGTTCCT CTTCATCATT TAGAGTATGT TGCCTTGGCA CCGGCCTGGA GAGGGTTAGC ACAGGAGCCA 002560
002561 GGCTGAGTTT AATTAAAGAA TTTCCGTGTG TCAGCAGCAG GAGACTTTGA TTAAAGTGAG AAGCAGCCAA GATTCCGGGA 002640
002641 AGCGCGTCTT TCTCCTGTCT CCTCCCCAAG ATGTGCGGGT GTCCTTCCCC CAACACCATG GAAGAAGGGG TCCAGGAAGA 002720
002721 AAATGTCACA CTTGCCTGTG ACCCCCACCT GGTCTGCAGA AAGGCAGAGA AGACCTGCTT ATGGAGGAAA AAGACGGCCT 002800
002801 CTGTCTGGAG CTTCTATGAA TTACATAACG AATAAGTGAT TTCACTACAA AGCAGTGAAA TTTTTAAAAG TTAACCAGTG 002880
002881 GCAGAAGTGT CAGGACAGTC TGCACGGGCA AGAATGATAC CTAAATCACT ACTCAGGCAC AGGAGAGAGC AGGACTGGCA 002960
002961 TGGCTGGCAG TTCTCTATGT AGCAGGCGCT GTACTTTGGC CAAGCCAATC TCCATTTGCA ACATGTTTCT CCCTTACTTG 003040
003041 ACTCCACTAT AGAGAGAGTA AAAGCTAAAT TTTGATGTTC CACCCCCACC CCCTTATTGA TATGAGAAAT AAGTCTTCTG 003120
003121 GTGCATTCCT CTCCAATTTC TGTCTTCTTT CCACTTGAGC TGCCAGGAGA TGCAACAGTC GTTTTGAAAC CAAAAAGATA 003200
003201 AAAACTGCAT ACTAAGAATG CATGAACCTA AAGATTTACA GAGCCTGGGA CACTGGGGAT ACTTGGAGTC AATGTATCAG 003280
003281 CCTTGGACTG ATTCCCATGG GACTTCTTAT TACCCAAGTG AAACAGTCCT TCCTTTCATC TGTTAAGCCA CTATGGTATG 003360
003361 TTTTGCTGGT TCTCGTAGCC AAATACAGTT TTGACTGCTA TACCCATTTT TTTCTTATGG ATGTTCATGA CTTCTTTTAT 003440
003441 ATTGTGGATT GTATTGTGAA TATGTATCTT ATTTGTTGCA AATATTTTCT CAGTCTTTTT GTCCTTTAAT TTAAGATATT 003520
003521 ATTCAATGCT TGTAAGGTTT TTATGTTTAT GCTGTTATAT ATATCCATTT TTTCTGATTT ATTCAATTGA AAAATTGAGA 003600
003601 AACGTAATTG ATAATTTTGA ACATTATCGA AGAAGACATC AATAAAACAA AACATAGTCT AGCAATTACA AAAAAAAAAA 003680
003681 AAAA

Predicted Small Protein

Name LINC01252_smProtein_3221:3508
Length 95
Molecular weight 11214.3323
Aromaticity 0.189473684211
Instability index 10.5453684211
Isoelectric point 6.67535400391
Runs 14
Runs residual 0.0183174471612
Runs probability 0.0398402604285
Amino acid sequence MNLKIYRAWDTGDTWSQCISLGLIPMGLLITQVKQSFLSSVKPLWYVLLVLVAKYSFDCY
THFFLMDVHDFFYIVDCIVNMYLICCKYFLSLFVL
Secondary structure LLEEEEEELLLLLLHHHHHHHLLLHHHHHHHHHHHHHHHLLLHHHHHHHHHHHHHHLLLE
EEEEEELLLLEEEEHHHHHHHHHHHHHHHHHHLLL
PRMN LLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLHHHHHHHHHHHHHHHHHHLL
LLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLL
PiMo iiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooTTTTTTTTTTTTTTTTTTii
iiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooo