MIRLET7BHG

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

MIRLET7BHG

Approved Name

MIRLET7B host gene

Synonyms

linc-Ppara

Chromosome

22q13.31

RefSeq ID

NR_027033

Disease

asbestos exposure-related lung cancer

Ensembl ID

ENSG00000197182

pubmed IDs

15364908, 24381249

Sequence

>gi|582968252|ref|NR_027033.2| Homo sapiens MIRLET7B host gene (MIRLET7BHG), transcript variant 1, long non-coding RNA

000001 TTCTTTCAAC TCGATGAGAG GAGGACTCCT CCTGGGCCTG GTGCTTCGAG GTCCAGACGA GCAGACACTG GCGCCTGTGA 000080
000081 CCCTGCAGCG GACGCCCTTC AAAGTCCGTG TGGCCGCATT TGGAAGCCTG GGCGGCGTGG AGACGGCGCC TTCAGCTTGA 000160
000161 GATAAATGTG GCCCCGTCCC AGAGCACCAC CCGAGACATC AGGAGCCCAT CGTGGGCTAG GGAAGATCCT CCGGGACCTA 000240
000241 ACGGCCCAGG TCTTCCACCC TTGGCCACCT CCCCAGGTGA TGCCTGAAGC TCAAGGGACT GTGTCCACCC TCAGGCCCTG 000320
000321 CCCGTGGCTC TGGATCGGCG GTCCCCATCA GAGGCCTGGG CTGAGTCCTC AGGTCAAGAA GTGGCGCTGA CCTGGAGCCC 000400
000401 CTGCCTGGGG CTGGCCTTCC TCACAGTGAG CTGGGCCTCC TGCCATCCTG GCTCTGGAGG GCGTCTGAGT GGAAGAGGCT 000480
000481 CAGCCTCACC TCCACACCCT GAAGACTCAC TCTCTGGCCC TGGGCTCTGT TGTGTGCACC GCCTGTCTGC AGGGCCCAGC 000560
000561 CTGCCTCGAG CACTCCCTGG GGACAGAAGC TCCTGCGCTG GCCTCTCCTG TCACCTAGCC ACCGGGCATC CTGGAGTGCA 000640
000641 GCCCTCATGC CCCGAGGCCA CCTGCCATCG CCCAGGATGA CTCAAGATGC CACCTGGGTA AGTCTAGCCC AGAACGAGAA 000720
000721 AGGCCGAATG AGGGAAACTG AGTCACATCT GCTTCTCCCA GGTGGGGCTG GGGGCCCAGG GGGAGGGCAC TTGTGGTGAC 000800
000801 GACCAGGGTG GACTAGGGGC CTGTCTCCCA TTCTGTCTCT GCCACACGTT ATAGAGCCCG ACCTGTGGGC TGGGTCCCCC 000880
000881 CATGGTGAGC ACTGTAGGCT CTAGGAGGCT GAAGTCCTGG CCTGGGGGAC CTGTGGGGGC CAGGCTGGGG CCATTGGAAT 000960
000961 CAGTATGGTG GTCCCCACCT GCACTCCAGC TCCGATGGCT GATGACGGGG GCCCTGGCAG CTGACCCCTT GCCTCAGTGG 001040
001041 GGAGACCACG GCTCACGAGG GCATGAGACC TGCCCAGAGC ATAGCCCAGA GGCCGGCCTG GATTCTCACA GCTCCGCCCA 001120
001121 CTGGGCTGTC CTGCCCCACC CCCTTTTTAG CCTGCTGGGA CCATGAGTCA GAACCCAGTA GGTGTGAGGG GACCCAGCCA 001200
001201 GCCCAGGTGG CCAGTCCAGC CTGAGGGGGT GGGGGTCTCA GAGATTAAGT CAGGAGCCCA GAGTGCGTGC TTAGGGTGCA 001280
001281 ACCCACCCCA CAGCCTCCCC CAAGGCACCT CCCGGGGGTG CAGATGGGGG CAGACGCAGG GACACTCAGG GTGGACCACC 001360
001361 TGACTGGGAG CTGATCATGA GGTGTGCGAC AGCCTGAGTC AGCCTCAGCT CAGAGGAGCT GGGATGGCCT TGATCTTCCC 001440
001441 CCTCCGGCCA CTCACCACAG CTGGGAGGGT CCAAGGAGAC CCTCAGCCAC TGTCCCTTCA GCCCTCACCC CACCCTGGCT 001520
001521 GGCTCTGCCC GGACAGTCCT TAGGGACATC TTGGTCCTGC CCACCAGCCA GGGGACCCAG AATTGGAGCA GGGGAGCAGG 001600
001601 ACCCCCAGAC CCTCTCATAC CGCTTTTGTT CTGAGGCCTT GAGGGAACAC GGGGTCTTCG CGACCCCAGG CGAGATGATG 001680
001681 CTGGGACAGA GGAGGGTCGG GCTGCATCGG GGCCCTCCTG GTCCCCACTC CTGGGTGAGT CAGCATCTCA GTCTCCAGTT 001760
001761 CTGGGTGGGC GGCACCAGGT GTGAGCCTCA GCGGTCTCCT TGCTGGCTAA GGACCTGGGA TTTGCTCCAG TGCCCCTCGG 001840
001841 GAGGGAGTAG GACACGTGCC CAGAGAGCAA GCAGACCCCC TCCCCGACCA ACTTCTGAGA GCAAAGACAC CTAGAGCTAG 001920
001921 AGATTCAGCA CCCCCTGAGC CTCAGTTTCT CCACTATGAA GTGGGATCCG TGATCTCAGC CTTACCAGGG CTGTGAGGAT 002000
002001 TAAATGGTCC AGCCCAGCCT GGGGCCTGCC TGGAGTAGGC AGCCTGCTCA GGATGGGCTG TGGAAGGAGG GCTGGGGTAG 002080
002081 ACCTTTCAAG TCCACTTGGG CATGGGGAGC TGAGAGCTAG CATGCCGTTT AACTTGGCAG GAAGCAGGCC GGGCGCAGTG 002160
002161 GCTCATGCCT GTAATCCCAG CACTTTGGGA GGCTGAGGCG GGCGGATCAC GAGGTCAGGA GATCAACACC ATCCTGGCTA 002240
002241 ACACGGTGAA ACCCCCTGTC TACTAAAAAT ACAAAAAATC AGCCGGGCGT AGTGGCGGGC GCCTGTAGTC CCAGCTACTC 002320
002321 GGGAGGCTGA GGCAGGAGAA TGGCGTGAAC CCAGGAGGCG GAGCTTGCAG TGAGCCGAGA TCGCGCCACT GCACTCCAGC 002400
002401 CTGGGGGACA GAGTGAGACT CCATCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAGCTTG GCAGGGAGCA GGACATTTGG 002480
002481 ACCTCACTCT GCTGCCCCCT TGGCTGTGTG ACATCCAGGT CACGTTGCCT CTCTGGGCCT CGGTCTCCTC ACCTGTTTAA 002560
002561 GAGGGGTTGA CAGTCGTATC TGCCCCCTCA GCTTTTCCCC AGGAAGGTGG TAGCCACAAT TAGCATTTGT TGAGGCTGAC 002640
002641 CCTGCACCAG GCCCAGGATA GGCGGGGCTT AGGGAGGCCC GTCTCTCGCC ACGTTCCCCT GCTAGGGGAG CCCCGAGGCC 002720
002721 CTCTCAGTGT CATCCTCATG CTACACTCTG TCCCAGCCCT GTGCGTCCCA AGCTAGGGCA CTGAGTGTGC CAGCACCCGC 002800
002801 AGGGACAGGC ACTGGACCCT GGGTGGACCT GAGGGTCTGT GACTACCCCC CCAGCTGCTC TCCCCTAGAG GCCACTTCCC 002880
002881 TCAAGGAAGG AAAGAACCTT CCCGCCACCT CCTGCAGTGC GGTCAGCTCA GGCCAGCCTG CACAGCAGGG CCAGAACCAG 002960
002961 GGCCCCTGGG GAGGGATGCC TGCCTGCCCA GTGGGAGGAG ACGGCACGCC CGTGAAGCCG CTACTCAGCC AGCCTGGGGG 003040
003041 CCACGAGTGC TGCTTCTGGT GGCGCTGTGC GGGGAGGGAG GGGGCCGAGC AGGGTGGGCA CTCGCATGCC TGTGTCTTGC 003120
003121 TGGCCTTCGA CAGATGACAG CCCTCCTCCT AGGGTCTCCA GTGCAGAGTT CCTTGGGGAC ATTATGGCCA CTCCTGTCCA 003200
003201 GATGAGAGGG AGCCGGCTGC CTGTGACAGC GTCGCAAAAT GCCGCCAGGG CTTTCCCTCC CTCCTCCTTT CTCTCTTCCT 003280
003281 CGTCCCTCTC TGGTTGGTGG TTTCCTGCAG GCTCCCGTCC CTGCTGGTGC TGGCCACAAT GTCCCCACTC CCAGGGTTTC 003360
003361 GGCGTCCCAG CCCCCTGCGC CCACCGCGCC TGCCCGCCAG AATCCCTGTG CCCTTGGTGC GTGTGGCCTG CCGAGCCTCG 003440
003441 AGCCCCTGTT CTCCTCAGCC CTCTTTCCTC CCGCGTCCCC AGGAGGTGCC TCTGGAAGCC ACGGAGTCCC ATCGGCACCA 003520
003521 AGACCGACTG CCCTTTGGGG TGAGGTAGTA GGTTGTATAG TTTGGGGCTC TGCCCTGCTA TGGGATAACT ATACAATCTA 003600
003601 CTGTCTTTCC TGAAGTGGCT GTAATATCTG CGGTGGACAG AGCGTCTGGA ACCCTGGCTG GGAGCGGGCA GGGCCAGGTT 003680
003681 TGGGGGCAGC CTTGGCAGCA GTCGGGGGCA GGGGCCGCCT ACACTGAGAA GTCTGACAGG CCTAGGTGCC ACTTGCTGTG 003760
003761 TGACCTTGGA CAGGCCCCTG ATCTCTCTGG GTCTCAGTTT CCTCCTCTGT AAAATGGAGG CAAATGAGGA TGGAAGGAGA 003840
003841 TGCAGTGTGG AGCATCGAGG GCAGAGGAGA GCTGAGCCGA CCCCACCCTC TGCCCCAGCC GCACTGAGAG AGGCGATCCA 003920
003921 CGCAGCTGTT TGTCTGACCT CTGTCTCCCA ACACTCCCCA ACACTCCCCC CGCCATCAGG CCCAGGCTCA TGGGTGGCCC 004000
004001 TGAGCCGTAC CCTCCACTGA GCACCAGGAG AAGGCACCGT GGGGCCAGGG GTGGCCGAGA CGTTTGGAGG TCACAGGGCT 004080
004081 GCGAGTATTG GCGTTGCCCA TCACCCCAGG TTCCCAGCAC GTGCCCCAGC CTGGCCAGCT CAGTGGCAGG GCCTCTGCCT 004160
004161 GTGGAGGAAG GGAGGCCAAG GCACCTTTCC TGAGCAGGAA GTGAGAGGAA CAGCTCTGCA TACACTGGGT CCCACATGGC 004240
004241 ACAATCTGAA GGCAGACAGT GGCTCCTCTG TACCTGGGGA AACTGAGGCC CAGAGAGCCA GGGACTTCCC AAGACCAGCC 004320
004321 AGCAGCAGCT GCCCCTTCCT GGGGTGCCAT CTCCCCTGTC CCTCCTGCCC TGCGCCTGCC CAGCCCTCCT GCTCTGGTGA 004400
004401 CTGAGGACCG CCAGGCAGGG GCTGGTGCTG GGCGGGGGGC GGCGGGCCCT CCCGCAGTGC AAGGCCGGGC CTGGCGGGGT 004480
004481 GAGGTAGTAG GTTGTGTGGT TTCAGGGCAG TGATGTTGCC CCTCGGAAGA TAACTATACA ACCTACTGCC TTCCCTGAGG 004560
004561 AGCCCAGTGA CACGACCCCA TGGGAGGGCC GCCCCCTACC TCAGTGACAC GACCCCACGG GAGGGCTGCC CCCCACCTCA 004640
004641 GTGACCTGCA GGGGGCCTGA GCCGAAGCTG GGTGGGCATC TGGGAGCTAG ATTCAATAAA GCTGTTCTGA CCATGAA

Predicted Small Protein

Name MIRLET7BHG_smProtein_881:1165
Length 94
Molecular weight 10087.3323
Aromaticity 0.0744680851064
Instability index 54.2021276596
Isoelectric point 6.56719970703
Runs 11
Runs residual 0.0117946345976
Runs probability 0.01911001911
Amino acid sequence MVSTVGSRRLKSWPGGPVGARLGPLESVWWSPPALQLRWLMTGALAADPLPQWGDHGSRG
HETCPEHSPEAGLDSHSSAHWAVLPHPLFSLLGP
Secondary structure LLLEELLEEELLLLLLLLLLLLLLLLEEELLLHHHHHHHHHHHHHLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLEELLLLLLLLLLL
PRMN -
PiMo -