LINC01207

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01207

Approved Name

long intergenic non-protein coding RNA 1207

Previous Symbols

_

Synonyms

_

Chromosome

4q32.3

RefSeq ID

NR_038834

OMIM ID

_

Ensembl ID

ENSG00000248771

pubmed IDs

26693067

Sequence

>gi|336285464|ref|NR_038834.1| Homo sapiens long intergenic non-protein coding RNA 1207 (LINC01207), long non-coding RNA

000001 GTTTCCTTCT TGATCCTTCA CAAGGGAAGA TTTTCTTTTT TAGAGGTACA GATTCCTCTT AGTCAAGTCC TGATTAAAAC 000080
000081 TCCAGCTAAG ACATTAGTAA GCCTTGGTTA GTGAAGTGGC ATCAGGAAGT GCCTACATTT TCATGGCCTG GTAGCGTTCA 000160
000161 GTGAAAATGT TCATTAACAG ACACAGGCCA TTCAGTCCCG AATCCCAAGA CACTGAAGAC TCTGTTTGAA TCAGACTCAC 000240
000241 GGGTTCCTTC CTAGCCACTC TCAGGGACAG GAATGCTTCT GGTGAAGAAG TTTTCGGTGG TGGTTCATGG AGCTTCCCTA 000320
000321 CACCAACTTG GAAATGGCAT TCATTTTATT GGCTTTTGTT ATCTTTTCCT TATTTACCCT GGCTTCCATC TACACTACTC 000400
000401 CGGATGACAG TAATGAAGAG GAAGAACATG AAAAAAAGGG AAGGGAAAAG AAAAGGAAAA AGTCTGAAAA GAAGAAAAAT 000480
000481 TGCTCAGAGG AAGAGCACAG AATTGAAGCT GTTGAGCTAT GATCTCATAG CCACCGATAT TTCTCGCTAA GAAGACAGAG 000560
000561 GAAGCAATCC ATGGGAACTA CTTATCCACA GTTACACAAG AGGAGGGGAT AATGAAGAAA GTTAAAATCA CTTACTGATT 000640
000641 AAACACGATG ATAATAACCA TTAATGAACT CAATACTCGG GAAAGGCTTC ACATTTCTGG GACTCAGCAT TATCCAAAAT 000720
000721 ATCTATTAAG AGCCATACAC CATTCTAGCT GCAATTGATT ATACAAAAAA AAAAAGACCA AAGTGGTTAC AATAATAAAA 000800
000801 TAGAACACAG AGAAAGAAGA AAACTACATG TGTTACAATT TGGTAAGATA AACAAACAAA CAAAAAATTT AATCACTTTT 000880
000881 TTTGGTCCTG CGACACACAT GATAATTTTT GTCTTAATTC TCCTAACAAA TGATAATGAA AAGCTATAAG TAACTGTGTT 000960
000961 ATTGCTTCCG TATCTGAAAT AGGTGAAAGA GGGAAAAAAC ACTATATATT TTTTCAGCTT TACAGAAGAA ATTTTGAAAG 001040
001041 GTTTACATTC AATGGAAATA TTAGCATTGC CTCAGTAAGC TTTAAAACAC AAATGTCTAC GTTTTCTGAA GCAATATGGT 001120
001121 TTACAGAGAC ATGAGTGTTT GTAAATCTCT CGTCCCAATA CTATGTAAAT CCTTAATGTG TAAGCATCCA GGAAAATTTG 001200
001201 TCATTCTGTG TCCTTTATTC ATCCTAAAAG TTGAAAGTTT TCTGTTATTT ATATTTTTTT TTTTTTAATT TGAGAGAGAG 001280
001281 GCTGGGTGCA GTGGCTCACG CCTGTAATCC CAGCACTTTG GGAGGCCGAG GTGGGCAGAT CACGAGGTCA GGAGATTGAG 001360
001361 ACCATCCTAG CTAACACAGT GAAACCCCGT CTCTACTAAA AATACAAAAA ATTAGCCGAG CGTGGTGGCG GGCGCCTATA 001440
001441 GTCCCATCTT CTCGGGAAGC TGAGGCAGGA GAATGGTGTG AACCCGGGAG GCGGAGGTTG CAATGAGCTG AGATCATGCC 001520
001521 ACTGCACTCC AGCCTGGGCG ACAGAGCAAG ACTCTGTCTC AAAAGAAAAA AAACAAAGAG AGAGAGAGGG TCTCGCTTTG 001600
001601 TCACCGAGGC TGGAGTGCAG TGGTGTAATC ATAGCTCACT GCAACCTTAA GTTCCTGGGC TCAGGTGATC CTCTTGCTTC 001680
001681 AGCCTCCCAA GTAACTGTGA CTACAAGCAT GTGTCATCAC AGCCAGCTAA TTTTTGTAAT TGTTTGTAGA ACCAAGGCCT 001760
001761 CTCTATGTTG CCTAGGCTGG TCTCCAAATC CTGGACTCAA GCGATCCTCT GCCTTAGCCT CCCAAAGCAC TGGGATTACA 001840
001841 CGCATGAGCC ACCATGCCTG GCCTGTTATT TATAACTCTT ACAAACATTA AACCATCACA ATCAATGCTG GCAGCATTAT 001920
001921 CGTGGCAGGA AGGAATCCAC AATACAACCA AAGATACCTA CCTGCATCTC ATAAGAATCA TAGCTCAGGT CTCCATTGCC 002000
002001 ACAGAGTTTC AGTCATGTGA GAGACTCCCT TTATCTGTAC TTACCCCTTA CTGCTCACTT TCAAGAAAGA TTCAATAAAG 002080
002081 ATGGCTTTCT GCCATACAGA GATAGGCATA CTAGTGTAAA CAGAATGCTT ACAAAGTTAA CCAAGTAGTC ACCCAGGTCA 002160
002161 GAGTACTAAA CATTTAAAAG GGCTCACTAA GTTAGAGAGG ACAGAGAATG GGCTTTGGAG ATGAGCAGAT CTGGGTTTGA 002240
002241 GTCTGCTCCA ACCTTTCCTT ATTATATGAC CTTGGGCAAA GTACTTAATT TCTCTGAGCC TTGGTGTTCT CTCTGAAAAA 002320
002321 ATGAGCATTT TAAAAAATCT TTGCTGGGCT GACAAATGGA TGAGAGATAG TATAAGTAAA ATGTGTAACT CAGTGCCTAG 002400
002401 GATCGTTGCT GAATAACGCT ACCTATTATT ATTGGAAAGC ATTTTGGGAC ATTTGGACTT TTAAGTTTAC TGATACGTTT 002480
002481 TTTCAGAAAA GCAGAAAAAT CAACAACATG GTTTCAAATA CTTGGATGAG GCTCCATTCA CTCCCTTAGC CCATCTTACA 002560
002561 TATATTACTG CATTGACTGG AAATGGGAGG CACTTCACAA TTAGGAATAT TGTCAGGTCA GCCAGGCGCC GTGGGCATGC 002640
002641 TTATAGTCCA AGCTACTCAC AAGGCTGAGG CAGGAGGATC GCTTGAGCCT GGGAGTTTGA GGCTGCAGTG AGCTCCAGCC 002720
002721 TGGGTGAAAG AGAAGACCCT ATCTCAAAAA ATAAAAAATT TTAAAAACCT GTCAAGTTAG ATGTTAAAGA GGACATGTAA 002800
002801 AATAAAATCT CCCTAACATG TTTATGAAAT AGCTGCAAAA GGCTGAGCGT GGTGGCTCAC GTCTGAAATC CCAGCACTTT 002880
002881 GTGAAGCCGA GGTGGGTGGA TCACCTGAGG TCAAGAGTTA GAGACCAGCC TGGCCAAGAT GGTGAAACCT CCTCTCTACT 002960
002961 AAAAATACAA AAATTAGCCA GGCGTGGTGG TCCATGCCTG TAATCCTAGC TACTCAGGAG GCTGAGGCAG GAGAATCACT 003040
003041 TGAACCTGGG AGGCGGAGGT TGTAGTGAGC CAAGATTGCG CCATTGCTCT CCAGCCTGGG GAGCAAATGC AAAACTCCAT 003120
003121 TTC

Predicted Small Protein

Name LINC01207_smProtein_1853:2077
Length 74
Molecular weight 8331.7073
Aromaticity 0.0675675675676
Instability index 47.8162162162
Isoelectric point 8.76947021484
Runs 7
Runs residual 0.0480553724456
Runs probability 0.020087314205
Amino acid sequence MPGLLFITLTNIKPSQSMLAALSWQEGIHNTTKDTYLHLIRIIAQVSIATEFQSCERLPL
SVLTPYCSLSRKIQ
Secondary structure LLLEEEEEELLLLLLHHHHHHHHHHHHHHLLLHHHHHHHHHHHHHHHHHHHLLLLLLLLL
LLLLLLLLLLLLLL
PRMN -
PiMo -