LINC01108

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01108

Approved Name

long intergenic non-protein coding RNA 1108

Previous Symbols

_

Synonyms

LncRNA-ES1

Chromosome

6p23

RefSeq ID

NR_108097

OMIM ID

_

Ensembl ID

ENSG00000226673

pubmed IDs

22193719

Sequence

>gi|564112240|ref|NR_108097.1| Homo sapiens long intergenic non-protein coding RNA 1108 (LINC01108), transcript variant 2, long non-coding RNA

000001 GGAGTTGTAT ATTTCCCTAA AGGGTCAAAA ATAGAACATT TATTAAACGA CGAGTTATTC AAAAACCGGC ACAGGCCCCA 000080
000081 GTTGTCTTTC AAAAGCTCTG ATTTACTGCA TATTTCTCCC TTGAGAAAGG CCTTTTTTAA ATGTTCATTT CCATTTTAAG 000160
000161 AGACTGTGGG CTTTGATCTG TTGGGAGCAT TCTTTTTCTT TCCCTGACTG GGGCCTGTCG GCTACTCAGG CCTGTGTTTT 000240
000241 TCCATGGCCT TCCCCAGCAT TCACACGGCA TCTCCTGCAT CTGCGTTCAC CCTGGCTAAT TTGGAGGTCA GAACTGGTGA 000320
000321 GTCCCTGTAG CAGGTGACAC CAAGAACCAA TGAGCCATAT TTGCTCTTCA CACCTCAACA AAGATGCTGC TTCCAGGAGA 000400
000401 TCTGCAAGAA CTCTGAACTC ATAGATTATG TCCAAACTGA GAAGCACTGG ACAGTCATTT TGCTTATCTG CGGTTTCTCA 000480
000481 TTTCTGAGAA GAATCTCCAT ATGGACATTA TCACTTTTGT TATAATCTCG AGGATGGTTA AGGTAGCTTT GATCGCCACT 000560
000561 TTCATATCTC CTGTGATAAT CAGCACCAAT GGAATGTGCA GAAATCCATC CCTATTCCTA AGGACTCCTG ATGTCATCTC 000640
000641 ACCAGGTGGG ACGGTTTGTC CTGCTCGAGT CTGCACGCCC AGCTCCAACA GCTTCGCCCA AGTTTGGGGA CAGGCTTCAC 000720
000721 CCAACACCCA GAATCTGTGC AGTTATTTGG CCCCTCATTG AAAAGCCTCT CACGGTTACA GACAAAATTA ACTTGGTCAT 000800
000801 CCCTGCAGCG CCGGAGGGAT ACTTCTGGAC TCTCTGTTTG GGGAGTGTAC GCTCAAACCG TTTCAAGAAG AGATTTAAAC 000880
000881 CTCTTCTTGG AGAAGACTGA GGCTGGCACA AGGCTAACAA ATCAAATAAC TGCACCGAGG CAAAGCTGCC GTCTCTCCCG 000960
000961 GGCTGTACCC TGCCTGCTTC TCCCATCCCT GCCTTCTGGC CGATTTGTCT TTAAGGTTTT AGCATTATCC CTCTGTTGAC 001040
001041 AAATGAATGC TTTTCTGGAA GATGAGGTAT TTCCAGAAGC AGGCAGACAC TGGCTGGAGC GCCGTAAGAG CTCTGCCTCC 001120
001121 CTCGATTCGT TATGTGAACG GGAGAGGCCT GGAGGTTTTG CGCGAGCATT ACGTTCAAGG TGATGATAAA TGGAGTCTGG 001200
001201 CCGGGGTGGT AATAAGGGAA TTTCTAGCCA GATCCTGTCT AGACCACGGT TTGTTATGGA GGGAAAGCCA CAAAAGGGGG 001280
001281 CTATAGTACC CCGTTCCTCA GAGAAATTGT CCTCAGGGAG AGCCAAGCAC ATTTCCAGTC CGAACCTTCG TCCGGCCCCT 001360
001361 CACTGGGGCC GCTGTAAATA ACATTAGCCT GTCTTTCAGC AAGCTCTGGG AAGCTTATAA ACCGAACACG GTTTTTGTTG 001440
001441 TCTGTAGTTG TGCATTAAGC TCAGTACTGT CAAGTCTAAT TATTCTCATT AAGGCCTGAA AGTTTTAGGG ATTAATTAAG 001520
001521 CAGAGAGCCA AAGCTTATGC AGAACAGCAA TATTTTCCCC CTGAAGGATT AGCCCGTCAT TCTGATGGGG TTTTCCTTGA 001600
001601 GCGTTGCCCA TTACTTTATA AGATAGCAAA GTGTGGAGCT CATATAATAT GGATTAGAAA TCGCCTACAT TTCACATGTG 001680
001681 CTATTCAGCC ATGAAAATAA GTCCTGGCCC TTGTGCCTGT AACCCGGAAG GCTCACTAAT GAGAACCAGG CAGTGGGGCT 001760
001761 AATGAGAAAT TCAATACGTG CCGTCTCTCC TTCCCGTTTC CTTTCAATTT ACTTTCTACA GGAGCCAATA TGTGTAGGAA 001840
001841 AAGTCTGAGT GGATGCTGGT ATTCTCCTCT TTACCTGGTA GGGATCTTAG AAATTATCTG ATTCTAGTCC TTCTTGCTAT 001920
001921 AGAAGTGAAA ACTGGGATTC AGCAACCTGC TTAAAGGGAC CCAGTTAATG GCAAAGCTAC TTGAAGAGAA GGTGACAGGA 002000
002001 AACTTCAGGA GGTGTAGACA ATCATTAGGT ACATTAGGAT GGTGAGGGTA CTTAGTCTTA GGTTTTGGTG TGATGGAAAA 002080
002081 TGCCGCAGAA ATGAGAAGGT CCCACAATGA AAGGAGGATG CTGGTGAGGA CCTAATTTGC AGACGAGCCC CAGGAACTCT 002160
002161 GGTGCCCTGC TTGGCAAAGG ATTTCAAGGC TTCCTGGAGA AATGGGGGTA GGGGGAGGGA TCGTTAAGCC CTGGCACCCC 002240
002241 AGATATAAAA AGGAAAGTCA GACGTGGGTA ATCACCCTGT CTTCACCCAG CAGCTCCAGT TTTCAGCAAG GCAGAGGGGA 002320
002321 CACTTCTGCC TTCAGTCTCC CCTACTTCAC ATCAAGGCGC CGTTGCATTT GATGGCTCGG GACGCGGCCT TCCTGCCTCC 002400
002401 CACACTCCCA AATCCATGCC TTGTTTATCT TTTTAGAATG GATCTCTGAT GCCTGCCCAC CACGCGGCTG GAGTGGCGCC 002480
002481 AGCATCCCCA GGGCACGTCT GTGATGACCG ATGAGGTCCA CCACCTCTAC CCCAGACCCT CCCATACCCA AAGGCTCCCT 002560
002561 CTGCCCAGTG AAGTCCCCTT CGCCCCAGCA TTCCTGCACC AGGCACATCC CAAGGGTGGC CCCGTGTCCT GCAGGTGGCA 002640
002641 GGAGAGTGCA CCCTCCTCAG ACACCAGTCT GATATGTCAC CAGGAGAGAG CGGGCGGTCC TCAGTGTATG CTCTTTTCTT 002720
002721 GTCTCTGGGG AGCTTGTTTT GGCTCCTCAA GAAGACCTTG GAAAAAGCAC AGGTTGTAGA GGCGGCATTC GCCGGGCAGA 002800
002801 ACCATGTCCC AAGACAACTG GTTAGTGAGG CCAGGCGTGA CAATCATAAT CGGAGCAGCA ACTGTGCAAT GGCATAGGCA 002880
002881 AAGGGGGCCG AAGGCGCAGC AGCCACCACC ATTATAGTCA GACTGGGCAC CGTTTCCTAA ATGTCACACA TCCTTCCAGC 002960
002961 CCCTTGATCT TTGCTTATGA AAAGGGTCAT TCCCTCTACA ATCCTCTAGA ACTCTTTGTA GGTTTTTGTT TGTTTTCCAA 003040
003041 ACACTGTACT TGCTTCTTTA TCTGTCTCTC TATGAACCGT GAGTCCCAGA GGACCTCGTG GTCCTCAATT ATCCTTGCTT 003120
003121 CTCTGTGGCC TGTGCCATGG GGCTGAATGT TTGTAGCCCC CAGAATCCAT ATGTGGAAGC CCTAATCCCC ACTGTGCTGG 003200
003201 TATTAGGAGG TGGGGCCTTT GGGATGTGAT TAGGGTTAGA TGAGGGCATG AGGGTGGGGC CCTCATAATG AGATTAGTGA 003280
003281 CCTTATCAAG AGGATGAGAC ACTAGATCCC TTGCTCCCTC TCTGCCTGCA ATCACCAGTG AAAGGCCAAG TGAGGACATA 003360
003361 ATAGAAGCCA TCCCTTCCCA GAACCCAGCC ATGCTGGCAC CCTGATCTCT AGCTTCTCAG CCTCCAGAAC TGTGAGAAAT 003440
003441 AAATGTCTGT TGTCTACTCC A

Predicted Small Protein

Name LINC01108_smProtein_500:760
Length 86
Molecular weight 9224.7319
Aromaticity 0.0697674418605
Instability index 43.9348837209
Isoelectric point 8.93499755859
Runs 22
Runs residual 0.116192294342
Runs probability 0.0248149218738
Amino acid sequence MDIITFVIISRMVKVALIATFISPVIISTNGMCRNPSLFLRTPDVISPGGTVCPARVCTP
SSNSFAQVWGQASPNTQNLCSYLAPH
Secondary structure LLEEEEEEEELLLHHHEELEELLLEEEELLLLLLLLLLEELLLLEELLLLEEELLEEELL
LLLLLLEEEEELLLLLHHHHHHLLLL
PRMN LLLLLLHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooTTTTTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiii