LINC00342

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC00342

Approved Name

long intergenic non-protein coding RNA 342

Previous Symbols

NCRNA00342

Synonyms

_

Chromosome

2q11.1

RefSeq ID

NR_103734

OMIM ID

_

Ensembl ID

ENSG00000232931

pubmed IDs

27299310

Sequence

>gi|514052684|ref|NR_103734.1| Homo sapiens long intergenic non-protein coding RNA 342 (LINC00342), long non-coding RNA

000001 GGGGCAAGAT AGCCAACTAG AAGCAGCTGC AATTCGAGGC TCCCACTGAG AAGAACTAAA ACAGTGTGCA AATCCTGCAC 000080
000081 CAGCAACTGA CATATCCAGG TTCTATGATC AGGACTGACT AGGTAGTTGG CATGGCCCAT AGAGAACAAG GAAAGATGGG 000160
000161 CTGGTGGATT GGCCCACCTG GGAGCCACAT GGGGCAAGGG GAGCCCTCAC CCTCAGCCAG CCAAGGGAGG CAGTGAGTGA 000240
000241 GCATGCTACC CAGCCTGGGA AACTGCTTTT TCCATGGATC TTTGCAATCC ACAGATCAGA AGATCCCACT CATGAGACCA 000320
000321 CACCACCAGG GCCTTGGGTG CCAACTACAG AGCCATGCAG ATTCTCAACA GCCACTCAGC TGGAGTCTGC CTAAAAAGTT 000400
000401 GGGGAGGGTT GTCATCATCA CTGTGGCTGC CTGCTGCCTA AACCCTCTGA GTTCCCTGGG GGAGGGGGAG CAGTCATCAC 000480
000481 TGTGGTTGCT GGCTGCCTAA GACAACTGAG CTTCCCAAGA GAGGGGCAGT CATCATCACT GCAGCTGCCT GCTGCCTGAG 000560
000561 GAAACTGAGC TCCCCAAGAA GGGACAGCAG CCATCACTGT GGCTGCTAGC TGCCTAAGAC ACTGAACTCC TGGGGAGGAA 000640
000641 GGGCGGCAGC CATTTCTACA GAGCCAGGCT GCTGTTTTTC CTTTGCTGAT GCCAGGGAGA CTGGACGGCT TGGTCCCAAG 000720
000721 AGGTATTCCC CACAGCGCAG CATACTGGCT GTGGCAGATC ATGGCCAGAC TGCCTCTTTA GGCTGACCCT CACCCATCCC 000800
000801 TCCTCACTGG GTGGGGCATC CCTGCAGGAA CTCCAGCAAC TCAAGCCAGG GAATTAGGGA GAGAACTCTG ATCTCTCTAA 000880
000881 GTCTGAGTCC CTAGCAGGAG GGGTGGCTGG CTGTTGTCTC CACAAACCAG AAGACTTATT CTTTCCCCCT GCTCACTCTG 000960
000961 AGGAATCCAG GCATCCCAGA CGAGTGGGAT TTCCCCCAGC ACAGCATACC CCCTTCACAA AGGGACAACT AAAGTGCTTC 001040
001041 ATTAAGCAAG TCCTGGATCC TGTGCCCCCC AACTGGGTGA GACACCCCAA TGGGTCACCA GACACCTTAT ACAAGAGCAT 001120
001121 TTCTACTGGC ATCAGGTGGG TGCCCCTCAA GGACAGAGAT CCCAGAGGAA GGAGTGGGGT CTCATCTTTG CTGTTCTCCA 001200
001201 GCACTCTCTG GTGACATCTT CAGGTGTGGG AGGGACCCAG ATAAGTAGGG CTTGAAGTGA ATCCCCAGCA AACTGCAGCA 001280
001281 GCCCTACAGA AGAGGTGCCT GACTGTTCAA AGGAAAACAG AAAGCAACAA CAACATCAAC CAAAAAGTCC CCACGAAAAC 001360
001361 CTCATCTAAA GGTCAGCAGC CTCAAAGATC AAAATGAGAC AAACTCATGA AGATGAGAAA GGAATGAAAA ACCCCTCACA 001440
001441 ACTCAAAAGG CCAGAGTGGC TTGTTTACTC CAAATGATCA CAACACCTCT ACAGCAAGGG CACAGTGCTG GGAGGAGGTT 001520
001521 GAGATGGATG AATTGACAGA AGTAGGCTTC AGAAGGTGGG TAGTAGCAAA CTTCACTGAG CTAAAGGAGC ACGTTCTAAC 001600
001601 CCAACACATT GGAACGAATC CCAGAACTTA AAGATTGGTT CTCTAAAATA AGACAGACAA AAATAAAAAA GAATAAAACG 001680
001681 GAAGGAACAA AACCTCCAAT AAGTATGGGG TTATGTATAG AGGCCAATTC TACAAATCAC TGGCATCCCT GAAAGGGAGG 001760
001761 TGGAGAAATC AATGCATTGG GTTAGAACAT GCTCCTTTAG CTCTGTGAAA TTTGTTATTA CCCACCTTCT GAACCCTACT 001840
001841 TCTGTCAGCA AAGAAGCTAA GAACCATGTT AAAAGGTTAC AGAAGATGCT AACTAGAATA ACCAGTTTAG AGAGGAACAT 001920
001921 AAATGACCGG AGCCAACTAT AAAGCACATA AGGGGAACTT CGTGATGCAA ACACAAGTAT CAACAGCTGA ATTGATCAAG 002000
002001 CAGAAAAAAG AATATCAGAG CTTGAAGACT ATCTTGCCAA AATAAGGCAG GCAGACAAGA TTAGAGAAAA AAGAATGAAA 002080
002081 AGGAATGAAC AAAACCTCAA AGAACTGTGG AACTATGTAA AAGACCAAAC CTATGACTGC TTGGACTACC TGAAAGAGAC 002160
002161 AAGGAGAATG GAGCCACGTT GGGAAAACAC ACTTCAAGAT ATCATCCAGG AGAACTTCCC CAACCTAGCA AGACAGGCCA 002240
002241 ACATTCAAAT TCAGGAAATC CAGAGAACCC CAGTAAGATA CTCCACAAGA AAATCAACCC TCAAGACACA TAGTCATCAG 002320
002321 ATTCTCCAAG ATCAAAATGA AGGAAAAAAT GTTAAAGGCA GCCAGAGACA AGGGCAGGTC ACCTACAAGG GGAAGCCCGT 002400
002401 CAGACTAACC GTGGGCCTCT CAGCAGAAAC CCTACAAGCC AGAAGACAGT TGGGTCCAAT GGTCAACATT CTTAGAGAAA 002480
002481 AGAATTTCTA ACCTAGAATT TCATATCTGG CCAAACTAAC CTTCATAAGT GAAGGAGAAA TCCTTCTCAG ACAAGCAAAT 002560
002561 GCTGAGGGAA TTTATCACCA CCAGGCCTGC CTTGCAAGAC CTCCTGAAGG AAGCACTAAA GATGGAAAGG AATCAGAATG 002640
002641 ATGCTGGCCT CATAAAATGA GTTAGGGAGG ATTTCCTCTT TTTCTATTGA TTGGAATAGT TTCAGAAGGA ATGGTACCAG 002720
002721 TTCCTCCTTG TACCTCTGGT AGAATTCGGC TGTGAATCCA TCTGGTCCTG GACTCTTTTT GTTTCTCCAC AGACACTACC 002800
002801 CAAAGCAGTC CTTCACTACA GTGGCAGACA GACCTGAAAA TTTTCATCTG AAGCAGCAGA GTGAACTGCA GAGTCAGAGA 002880
002881 TAGAATCTCA CTATGTTGAC CAGGCTGGTC TTGAACTCTT GGCACCCAAG CGATCCTTTT GCCTGGAATC CCAAAGTGTC 002960
002961 AGATTTACTG AAGAATATTT CATTGTAATT ACTTTTTATA CTTTATAGGT CAAGAGCTCT GTTTTAAAGA CAAAATTTAT 003040
003041 TGAATATACT TTTTCAAACA AAGTTTCATG TTCTAATCAT TGTCTGATTT CAGCATTAAA TGAAACACAG TAAAGAAAGT 003120
003121 TGGGCTGA

Predicted Small Protein

Name LINC00342_smProtein_242:499
Length 85
Molecular weight 9178.5611
Aromaticity 0.0470588235294
Instability index 59.8788235294
Isoelectric point 7.73883056641
Runs 10
Runs residual 0.0188948306595
Runs probability 0.036777954425
Amino acid sequence MLPSLGNCFFHGSLQSTDQKIPLMRPHHQGLGCQLQSHADSQQPLSWSLPKKLGRVVIIT
VAACCLNPLSSLGEGEQSSLWLLAA
Secondary structure LLLLLLLLEELLLLLLLLLLLLLLLLLLLLLLLEEELLLLLLLLLLLLLLLLLLLEEEEE
EEHHHLLLLLLLLLLHHHHHHHHLL
PRMN -
PiMo -