LINC01140

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01140

Approved Name

long intergenic non-protein coding RNA 1140

Previous Symbols

_

Synonyms

FLJ41676

Chromosome

1p22.3

RefSeq ID

NR_026985

OMIM ID

_

Ensembl ID

ENSG00000267272

pubmed IDs

12477932

Sequence

>gi|224177485|ref|NR_026985.1| Homo sapiens long intergenic non-protein coding RNA 1140 (LINC01140), transcript variant 1, long non-coding RNA

000001 GGAACATCCT CGCGGCCCGA GGCGCGGTCG CAGCCGGGGA GCACTCGCCA CGGTGCTGTG GAATTCTCTG GTTTTTCACG 000080
000081 CAAGGTCAGG CGTCCTGCTG GCGCCCTCTC GCCACCCTGC CCTCCCGTCA GAAGCCCGGC TCCTCGCCGG GGAAGGCCGG 000160
000161 ATGCTGGCCC GCCGGGACCT GGGACTTGTG CCACATGGAG TGTCGGGAGT CTCCATTGCC GCGAGTTCTA CACCACAGGG 000240
000241 CCAGGCTGTT TGCTCCCCAT CGGTCGCTGC CCCCAGCACC CTGTTGTTAT TAAGGACTCA TTTGCTTGGA GCGGCATCAT 000320
000321 TACAAGGGTG TGGGGTCGCG GCTCCTTTGT CCAGGACTAC GCAGGGGCTT GACCCCAGGG CGCTGGTTTA GGCCGGATCT 000400
000401 GGGGTCCCTT GTCACTCCCA GGCTTCTTCC ACTTCCGAAT TCTGGAGAAC CGGGAATCAA GCCCTGCGCG TTCCTCTTCT 000480
000481 TCCTCCTTCG TGCCGAAAGC ACGCTTCATG TCTGCCAGGG CATCAGTTCT GAAACACCAG ATACTGCTAC ACCATCTCAT 000560
000561 CGGCATGGAC CTATATGTGG CAGCAAGTCC ATCTCATCGC TTTTGGTGAA AGTCAGTCCA GTTTGTAAAG ATTCTCATTG 000640
000641 TCACTGTAGA CGAGGAACTG GGACACCAAA AGGAGAAACT CTGGCCACGC TTGCACCCTG TTCCCAATCC TGGTCCAGTG 000720
000721 TCACCCACAG ATGGTAAGGA GCTCTAGAGA CCTCACCAGC CCCTGGGATT GGTCACCTCA CTCTTCTATG GACAGAGATT 000800
000801 CCTGCTGGGA TCCTTTGAGG GCAAGCAGAC CCTTCTTCCA GCTCGGACTG TGAACTCCAC TGCAGCCGTA AGGACTGTCT 000880
000881 GTGACAGTGA GCCCGAGATG ACTGGGCTCT GTGCTCCCTC CCGGCCCTCC AATCCTTGGC CTGCCACAGA GAACTGAGCT 000960
000961 CTTTTATTAG CACCATGAAT GTGACTGATA CAGCTAGCCA TTCCCTTGTG CGAATGACTC AGTTTATTAA TGCTCTGCTA 001040
001041 AAGATGGCTT CTTTGCTTGC CAGCAGCCTT AAACAGTATT TCATTAAAAC TGGCTTAATT ATTTTGAGAA GACGGCCCAA 001120
001121 TTAAAAGCTA TACACTCCCT CTATGTGAGT GTTTATACAT AGAGCTGTAT ATATAATACA TATTTGTAAG TGTGTATATA 001200
001201 TATATGTGTG TATGTATGTG TCTATAAATA TATAGGCTTA GCAATTTCAT TACATGGGAT AAATTGTTGG AAAAAATACC 001280
001281 CAGGAGCTGG TCCCCTTTCT GTTGCTAGAT TCAGAGTAGA GGCCACCCCT CCACTCTGGG AGAGGCTGGT GTTGGTGATC 001360
001361 TCTCAATGAC TCTGCAATGG AAGTCCCAAC TGCACAGAGC CCTGCCCCAG TTTCAGGAGC CAGCAGCCTC GGAGAGGCGG 001440
001441 ATCCTGACCT CTGCTCTGCT CTTGGGATAG CCTTTCCCTT CCCAGCAGGG TTGAGATACT TGGGCCGGGA AATGTTGTGG 001520
001521 CAAAGTGTTT GCCAAAGCTC AGGAGAGACA CAGACTTGGG GCTTTTGTTT CTTGAGCTGG CTGTCTAGCT TTCCTAATGA 001600
001601 GCAAATATGT TCTCTTTAAG GAAACAAACA AACAAAGCAA AAACACCAAT TCATCTGGAT TTTATTCATT TGTTTTAAAT 001680
001681 ACAAACAAAC AAAAGGAGAG TGGTTATTTC TGCACCAACT ATTTCAAATG CAAGTTACTC CATCGCTCGG GGTGGTTGGA 001760
001761 TGGTGCTTGT CACCATAGGA CCCACAGGGC TAGTTCCAAC TGTTATTCGG TAAGGCTTTT TTCTTTCCAA AATTCCCAGT 001840
001841 GTTCCTTTAA GGCCCATTTA GCTGCGGGTT TTGTTTATTC TCCCGGCAAT CAGCATTTAA AATAAGACAA ACAAGCATTT 001920
001921 TTTCCTGGGC TGTGAATCCC CCCGGCCAGC CTCCACCTGC ACACCTGAAG CCAGCATGTC CAATCAAATT TCTCTGTAAC 002000
002001 CCATATCCCC TTTAGAGACT TGCCCCCGTC GTATACCAGG CTGGAAATAG AGAACTTAAG CAGGGCAAAT GTAATTTTAA 002080
002081 GAATTGCTAA TGATGCTAGA AATCTGCAAT GCAATTAGCG TCATTGGATT TGGCGCTCCT CCGAAGGCAC AAAACTCCTT 002160
002161 GTCATAGCGC AGTGGCAGCA GCGGCAAGTG CCTCCGCATG TGCCGGGCTG TCCGGGTATG CTGGCAGCCG CTTTGCACTG 002240
002241 AGATGTGAGC AGTTGGTTAG GCTTCCTCTC TTTCTTTCTC ACAGATACTG ACTTCTTTGT CTCTTTTCTG GGTTGCAGAG 002320
002321 GGATGGGTAT TTTCCATTGA TTATTACTTT AGCATTTGAC CCTCCAGTGG AGTCACCCTG TTTTTTTTTT AGAAAACTGA 002400
002401 GACTCTCACT TTGTGAATTC ACTGTGCTCT CTGGGATTTC AGTGCTGTAG TTCAACCACC AATCCCCCTG TCCTGAACTC 002480
002481 CAGTACTTCT GATGCTATTA ATTGGTTCCT CAACAATTGT GGCCTTTTCC ATCATTGCCC ACCATAGTAT ATACTTTTTC 002560
002561 TTTCTCTCTC TTTTTTCTAA TTTCCTTGTC TTCTTCACTC TCCATGGAGC CAGAGGTAGT ATGAAGAGTT AAAAATAGGA 002640
002641 ATATAAAGAA AGCCAGAGGG ACAGAGGGAG TGAGAAAGAA AAATTTTAAA AAGGGAGGAA ATGAATTATT GGATTAAAAA 002720
002721 TAAACTTTTA CTTTTTTGCA GAAAAATTAT TTTTGCTCTC TGGGAAAATA ACATGGGCCA GGCATAAAAA GCATGTCAGC 002800
002801 TGGCTAAAAG ATTGCAAAAT CCAGAAGATG ATCTCGATGT GTCTGTTCAA TTTAGCAAGG GTATCTACTA GGGGATCCTC 002880
002881 TTTTAAATAT GGAGGCCCAA ATCAGAAGCT TGTAGAGGGG AGCTATTCTT CCAAGATTCC AGATGTGTCT GTGAGACAAC 002960
002961 ACGTTATGGG GCAAATTGAT TTCACCCTTG GGAAACCAGG GAGATTTTCA AAGTTATGTC TGCAAAGCCA GCTAATGCAA 003040
003041 TTCCCCATTA GTGCATTAAA GTGCGCCCTT ATTAATTCAA ACATAAAGGC AACAAAATAA GCTTTTAAAT TTAAAATATA 003120
003121 ATACATATAT AATGAGCATG TGTGAAAGCC TTATTCAAAT GAAAATACAG GAGTGTTTGA ACTACTGAGG TATCTTTTGT 003200
003201 ATTGAATTAT GAGCATATGT AATAGATTTA ATTATTAATT TCCCCATTGT TCTATGCACA CAGACAGGGT TCAAGGCACA 003280
003281 GTCATTCTCT GGCTTTCATA GATCTAATTT GTATAATTAT TGCCTGAATA AAAAATTGCT CCAAAAAAAA AAAAAAAAAA 003360
003361 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA

Predicted Small Protein

Name LINC01140_smProtein_1376:1618
Length 80
Molecular weight 8560.5923
Aromaticity 0.1125
Instability index 58.03875
Isoelectric point 4.00238037109
Runs 11
Runs residual 0.00131578947368
Runs probability 0.0473896356249
Amino acid sequence MEVPTAQSPAPVSGASSLGEADPDLCSALGIAFPFPAGLRYLGREMLWQSVCQSSGETQT
WGFCFLSWLSSFPNEQICSL
Secondary structure LLLLLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLHHHHHHHHHHHHHHHHLLLLLLE
ELLHHHHHHLLLLLLLEELL
PRMN -
PiMo -