LINC01152

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01152

Approved Name

long intergenic non-protein coding RNA 1152

Previous Symbols

_

Synonyms

TCONS_00025128, CMPD

Chromosome

17q24.3

RefSeq ID

NR_110124

OMIM ID

_

Ensembl ID

ENSG00000256124

pubmed IDs

8789441

Sequence

>gi|571026728|ref|NR_110124.1| Homo sapiens long intergenic non-protein coding RNA 1152 (LINC01152), transcript variant 1, long non-coding RNA

000001 ACAGGTTGCC CCATAGTGAC CCTGGAGACT CCCGCCTGCC CCCAGTGAAG GAATTGGTTA CTCTCCCAGA GCCCTCCCGG 000080
000081 GCATCGCCCC CTTCCTCACT TCCTCCTCTT CCTGCCCCCA GCCTCTACCC CTCCAGCCTC CTGTGGCTTC ACAGGCAAGT 000160
000161 GCCGCATCCC AGGAAGAAGG AGTTGGAGCT CCGTGATGCC ATTCAAAAAC AGATGAGATG GGGACAGATT TAACCCAAAG 000240
000241 GACATAGTTC AACTCCTGGT CTTGACAGAG GAGAAGCCCT CAAAGATGGA GTCTCACTCT GTTGCCAGGC TAGAATACAA 000320
000321 TGGCACGAAC TTGGCTCACT GCAACCTCCG CCTCCTGGGT TCAAGAGATT CTCCTGCCTC AGCGTCCTGA GTTGCTGGGA 000400
000401 CTACGGGCAC ACACCATCAT GCCCAGCTAA TTTTTGTATT TTTAGTAGAG ACAGGGTTTC ACCATGTTGA CCAGGATGGT 000480
000481 CTCGATCTCT TGACCTCGTG ATCTGCCCAC GTCGGCCTCC CAAAGTCCTG GGATTACAGG CATGAGCCAG TGCGCCTAGC 000560
000561 AAACGACAGC TTTTTGAGCC TCTGTTTAAC AATCTCTTAA ACCTTATAGG TAGGCAGTTC TGAGACAGCT CCTAATCATC 000640
000641 TCCCCTCCTG GTATTCAAGC CCTGATGGAA TACTCTCTTC TTCAATGTGA GCTGGACTTA GTGGTTTTCT TCTAATCCAT 000720
000721 TGAATACACA CAAGTGATGA GCTGTCATTT CCAAAATTAG TTCACACAAA GCTGTAACTT TTGTTTTGTT CACAGCCTCT 000800
000801 CTCAATTGCC TTCTCAGCTT GTATGCTTTG AAGAAGCAAG CTGCAGTGTT GGAGAGGCCC ATGTGGCAAG GAAATGAGGG 000880
000881 TGGCCTCCAG TCAATGGCTG GTGAGAAACT GAGACCTTCC TTCCAACAAT TGTCAAGTAA CTGAATACTC CCAACAATCA 000960
000961 TATGGGTTAG GTTAGAAGTG AACCTTTCTC TAGTTGAGGT TTCAGATAAG ACCACAGATA TTATACTAAC ACCTTGATTG 001040
001041 CAGGCTTATG AGAAACCTTG AAATGAGAAG ACTAAGTAAA ACCATATCTG GATTCCTGAC TTGCAGAAAC TGTGTGATGA 001120
001121 TAAATGTGTG TTATCTTAAG CCACAAAGTT TTTGGGGTGA TTTGTTCTAC AGCAATTGAT AATGAATACT TCAACCTATA 001200
001201 GTGACAACTG GATGTCTATT TACTCATTTT CCCCACCAAA GTGTAAAGTT CCTACAAGGC AGAGGTTACA ACATATTCAT 001280
001281 CCTGCTACCT CTCACAGCCC ATAATATACA ACCTGACCCA TAAAACATGA TTAATCAATG TTTGTTAAAT TAACATGTAT 001360
001361 TTTGAGTTTC ATTGAAATCA GTTAGTTGAA GTGATAATAA CATTTAAATG TGCATGAGCA GTAGGACAAG GCAGATAGCT 001440
001441 GGTTAGGATA GAAAGTTTTT GAAAATTGTG AGGGGAAAAA TTATTTCTAT TTGTACCCTG TTGTACCCAG CTTAGGTTTC 001520
001521 AAAAACTTTC AGGGGGTTTT TATTCATTTA CCAAAGACTT CAAAGTTGGT GCCTCCCCAC CACTCTAAAG AGGTCTTTGC 001600
001601 CCTTATTGAC CCAACCATAC TGATTTGCCT GGTGCACAGT GGGAGCTATT ATAAGGAGAG GGGAGAAGAA GGAAATAGAA 001680
001681 ACAACTGTGA AAAAGGTTGA AAAAAATAGC GAAAAGGATA GAAAAAGAGC AGAGGAAAGA GAAGAAGTAG TCAACATCCA 001760
001761 ACCCAGATGC AGAGAAATGA TTCAAAAGAA AAAGTCTGAT TTTCCTGTCG GAAATATTTT TAAAAGGAGG TAGGCATTTA 001840
001841 TCAAAAAGGA AAAAAAAAAA AAAAGAATAA CAGAGTTTGT CTTCCAAAGG ACCCAATACT CTTGGAATTC AATGACCTAA 001920
001921 TTCTGTGTAT GGTTGCAGTG TGGAGATTGC CTCTTGTCTG ATTCTCATGG CATTGAGACC TCTGATTTCT CAACAGCACA 002000
002001 TGCCACAGCA CTATCAGAAG TCAGGAATAA AGAACAGAGG GTGTGGGGGG CAGTCTAGAT ATTATGCTCA GCAAACATAT 002080
002081 TTTTAAATCA GTGTAAATTC TATCTTGCTG GTTGCTTATC TTAGCAGTTA AATCAACTTT TCGTTACTTT TGTAGACATT 002160
002161 ATATAATATC TTCTCCCCGC AGCTCCAGCC ACCAGCTGTT AGTCAGCAGA ATGGGAGAGG GACTGGGTCA GCTTAAGCTG 002240
002241 AATAACTTCC ATGGTGACAT TTGAGGGAAC CAATTCCCAG CCACTGAACA AACCTAAATG TTCTGCTATG GCCATAGAAC 002320
002321 CAAATAGATC TTAACAAAGA AACCTGAGAG CATTCAAAGG TCAAAACAAT TGGAGCTAGT TCCTTCAAGC TTAAATATGC 002400
002401 TCATCTGTAA AGTGGGGACA ATAACTGTGT GTCTTATAGG GTTATTTTGA AGGCTAAATA TGTGTAAAAG CCTCAGTCTA 002480
002481 CTGAAAGGTT GCTCAACTGT AACACTCTTT TCATTTTGTG AGGGATCATT CTTTGTTGGA GGTTAGGGAC ACTGTCTCGT 002560
002561 GCATTACAGG ATGTTTAGCA ACATCCTCAG CTTCTATCCA TCAGATGTTA GCACCCCCCC CACGGACCAC CATCTTCTCC 002640
002641 ATCATCACCA TCAAATATGT CTCCAGACAT TTCCAAGTGT CCCCTGGGAG GCTAAAACTG CCCCTGGTTG AGAACCACTG 002720
002721 TTCTAGTACT TGGCATTGTT CCATAAACAT CAGATGATAT TTTTATCATT ATAATTATTA CTAAAGGCAA CATCACCCTG 002800
002801 CTATGGAGAC AGGTCCTTGG TCTCACGGAA TCCCTGCAAT AGTTGCTGCA TTTCTGCATC TTTCTGAGAG TAAGGGTTGA 002880
002881 AAAGAGATAA GGTCAATTCA TCTGACAACA ATCCAATGGG AAACCTACTG TCTCCGTAGA AATTCTTTTC ATGGAAAGAA 002960
002961 AAGGGGCTAC CATTTTGATC TGCTGCAGGA ACTAGGTCTA TTCCTCAGCC TCACAGTTTG CAAAACGCTA CAGACTTTTC 003040
003041 CATGGGCATA AATTCTCCAG CTTTAGGGAT TGCTGGTATA GTTGACTCTT GAAGCTGCTC TTGGATTTCC AAGGGTAGCA 003120
003121 CCATTTTTAC CTGAGAAAAG AAGGAAAAAG ATATTCAAGA AGGAGAACTT TTCAACAGAA GTTACTCCAT TTGCTGTTTT 003200
003201 TGTGGTTTTG TGCTTGTTTA GGTTTGCAAG TGTGTTTTCT GCCCAAATTG TTCTTGCATA TTCTAACAGT GTTTGCTTTA 003280
003281 CGTGTTTTGA TGCCACGCTG TTTTGAAGGT ACAAATGTTT TTCATGGCAT GTCTTGTGTT AAGTGTACAA ACTGAGACGT 003360
003361 GAATGTCAGT GACTGAATAG AGTACAGTAT TCCTCAAAGT TTGTAACTAA ATAAACCCTA AGAGAAAAGT GAAGCGAGAT 003440
003441 TTCTGGAGGT CTGGAAGTGG CCAGCTGACA ATATACATCT CTGTATCTCC TATTTTTATC TTTATGCTAA ATGCAAATTT 003520
003521 TACTTTCTGT GCTAAAATAA TGTTTCATTA GTTTCAAGGT AAAAATGTGT AAATAGAC

Predicted Small Protein

Name LINC01152_smProtein_2570:2710
Length 46
Molecular weight 5222.8246
Aromaticity 0.108695652174
Instability index 71.7436956522
Isoelectric point 6.24249267578
Runs 7
Runs residual 0.00918737060041
Runs probability 0.0291089408737
Amino acid sequence MFSNILSFYPSDVSTPPTDHHLLHHHHQICLQTFPSVPWEAKTAPG
Secondary structure LLLLEELLLLLLLLLLLLLLHHHLLLLLEEEELLLLLLLLLLLLLL
PRMN -
PiMo -