ENST00000457898.1

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Name

LINC01772:long intergenic non-protein coding RNA 1772

ENSG00000226029[1]

Function

LncRNA ENSG00000226029 has significant effects on oesophageal cancer(OSCC) patient survival in the independent dataset(P = 0.03, Coefficient = 2.43). [1]

Diseases

Oesophageal cancer[1]

Sequence

>NR_147210.1 Homo sapiens long intergenic non-protein coding RNA 1772 (LINC01772), transcript variant 1, long non-coding RNA

000001 TGTGGTGGTG GTGCATGCTT GGAATCCTAA CTACTTGCGA GGCTGAGGTG GGAGAATTGC TTGAGCCCAG GAGTTTGAGG 000080
000081 CTGCAGTGAG CTATGATCAC GGCCTCGCAC TCCAGTCTAG GCAGCAGTGA GACCCTAGAG AATTCTCTAA AAGAATTAAA 000160
000161 AATGAAAAAA AGCCACAACA CTTCTTTTGC CTGAGGATTC TGTAAGAAGC AGTTTTTATT GTTATGGAAA TAGCCACTCT 000240
000241 GATTAAGAAA CTGTAGAGAG GAGAAAGAAA ATGAAAATTG AAGATTCTCT TGCCCATTGA ATTAAGGGTA AGGAGATTGC 000320
000321 ATTAAAGGAT CTTCGGGAGC AGTAACTTTT TTATGTCGAT TTCACATAGC ATTACTTCAC ATCGAGTCAG TTTTAAGTAC 000400
000401 TTGGAGGGTG GAAATCAGAA AGCTTAGATG TAGAGGAAGC TTTATTAGAA GTGTGTACCA TGGCTGGGTG TGGTGGCTCA 000480
000481 TGCCCACAAT CCCAGCACTT TGGGAGGCCG AAGTGGGTGG CTTACTTGAG GTTAGGAGTT CGAGACCAGC CTGGCGAACA 000560
000561 TGGTGAAACC CCGTCTACTA AAAATACAAC AATTAGCCAG GCATGGTGGC GCACGCCTGT AATCCCAGCT ACTTGAGAAG 000640
000641 CTGAGGCATG AGAATTGCCT CCAGGAAGTG GAGGTTGCAG TGAGCTGAGA TCATGCCACT GTACTCTAGC CTGGGCAACA 000720
000721 GAGCAAGACT GTCTCAAAAG GAAAAAAAAA AAAAGTATGT ACCATTGAAG ATCAGCACTT GGATTGTGGA GACAGACCTG 000800
000801 GCCTTAGGAT CTAGGCTATC CCCTGGTTAG ACGTGTGGCC ACAGGCTGTT CCTTCACCTG AGCGTCACTC GGATGAGGCA 000880
000881 CTAGCAGATG CACATTGCAT TGTTTGACCT TAATGACCTT TCTCTGGAGT CAGGTAAGTA CCCAGAACAG TTCATCATGG 000960
000961 TAGGGAGGAG GAGGTGGCAC AGCTGATGAC ACAGCCCTCA GGAATCTAGC CTGAAAAGCA CCTTGTGGGG TCTGGGGCCA 001040
001041 GGAGCAAGGG ACAGCTTGAT GCTGTCCATC ACTAACTGGA ATCCCAGCTG GAAAAAACTC ATTACAGGAC CAGAATCTCC 001120
001121 AGGAAAACTC AATCCCAAAC CAGGCATCAC TTCAATTACA CCCCTGACTG GAGTTTACAA ACTGGTGGGT GGATGGTAGA 001200
001201 AGATGGCTGC TTTCTTGGGG GGTATGCTAG ACCTCACTTG CCCTCTGCAC AACAGAATTT GGATGCACTG GGAGGTGTGG 001280
001281 CAGATAGACT CCCAGGTGGT CACTGTGATC CCCACCTGCT GTTCACAGCC TTATGTACTC CCTGCCCCTT GAGATGGCCT 001360
001361 AGACCTGTGA CTGCTAACCA GTAGAGTGCC ACAAAGGTGA CAAGATGTTA TTTTCATGGT TGCCTTATGT AAGACTGCAA 001440
001441 CATCTGCCTT GCTGAGAAAT TCTCTTGCTG GCTTTGAAGA AGGAAGCTGT CATGTTGTGT GAGCTGCCCT TGGGAGAGGG 001520
001521 TCAGGTGGCT AGGAACTGAG GTAGCCTCTG ACAGCCAACA AGAAACTGAA GCTCAGTCCA GCAGTCTGCA AGAAAGCAAA 001600
001601 TGCTGCCAGC AACCACACAA GCTTGGAGGC TGATCACTCC CAGGTAAGCC TTCAGGTGAG ACCCCAGGCC TGACCAACAC 001680
001681 TGACTGCAGC CTTGCAGAGG ACCAGCTAAG CTGTGCCCAG ACTGTCCCAT AGGAACAGAT GGTAAATGTA TTGTGTTAAG 001760
001761 TCGCTAAGTT TCTGGTAAGG TTATGCAGCA ATAGATAACC AACACAAAGG TTAGCAAAGT GTGTTTGAGG TGGGAATCAG 001840
001841 TGTGGAAAAG AGAGAATCTG GATAGACTTA ATAGCTGTAG GGCCTTGGGA GGTCCCTTAT GATGCTCCTT TGGGCCTTCC 001920
001921 TGTTGCCTTT AAGTAATGCT TATTGTTTAT TGCTAAAGTA ATGCTTAACT CTCTGCTAGG CACTATTCTA TGCACTTATA 002000
002001 AAACTCATTG CATCATCTCT ACAACCCCAT GAGGTAAGGA CTTGTTTTTT TTGTGTTTTG AGACAGTCTC GCTCTGTGGC 002080
002081 CCAGGCTGGA ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGCGTTCAA GCAGTTCTTC TGCCTCAGCC 002160
002161 TCCGAGTAGT TGTGATTACA GACGTGTGCC AGTACTTCCG GCTAATTTTT TTGATTTTTA GTAGAGACGG GGTTTCATCA 002240
002241 TGTTGGCCAG GCTGGTCTCA AATTCCTGAC CTCAAGTGAT CAGCCTGCCT TGGCCTCCTA AAATGCTGGG ATTACAAGTG 002320
002321 TGAGCCACCG CACCTGGCCA CACAACAAAC TTTCTACACA TTGATGTATG ATATTCCACT GAGGGGAGAG GCACCCTCCT 002400
002401 GGCTTAACTG AAGGGGTGTA CCACAGAAGG ACATGGTGGA CATCACACAC AGATTCTGTG GCATAGGATG AACTGCTAAA 002480
002481 CTAGGATATT TTAAAGTCCT CCAACATCTG TAATTGTTTT CCGTGGTGAA CAAGTGTCGT GTGTCTCAAG AAGACAGGCC 002560
002561 ACAGTGACCT CTTTGAAAGC TTTCTGTCAA GTCTCTTATC CAAGGGGAGC AAAATCACAA AGGTCCCAGG CGATTTTTTT 002640
002641 TCCTCTTTCC TCTTTTCTTC ATAAATCTTG GTTTTGCTTT TATTTGACGA AAACAATCTG ATGCCTGTTC TCCCTTCTAT 002720
002721 ACAATGGTAA ATTAGCATGC AAGTAGCTAT CCCTTTATTA TTGTTTGATA GATTTTTGCA GCGGCTATGT CCGTAAAAAT 002800
002801 TAGCCCACCT GAGATATATC ACTGGAGCCA ACAGAACCCT GCACCCAACA GGCACCTTGT GCAGACCTGG ACCCTTACAG 002880
002881 CTGTTGGCTC ATGTTCCTTT GGTTCTTCTA ATGAATATCA TGAGTAGAAA CTGAGCTCTT TGGCTTTTAC CCACTACCAT 002960
002961 GACTCTAGTA CATTTTCTCT CTCTCGTTTC TCTCATTTTT GTATCATGAT TTTCTGCCAT CAGGGGCATC AGTGTGGGTT 003040
003041 CCTGGTTTTG ATGGTATGGA GTGAACTTCT GGGATCGTTT TATGACTGTA TAACAGTTGG TATTCTTCTG TTGATGAAGA 003120
003121 TTTGTGTTGT ACCCAGTTTT TCCTTAGTAT AAAAAGGAGG TTAATAAGAA CATTTTTGCT AGAAGCCTTG TTGGATGTAT 003200
003201 TGTTCATTTT ACGGGGGTAA TCAATAAATG TAAGCGTGCT TCTTT

Labs working on this lncRNA

  • College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.

References

  1. 1.0 1.1 1.2 Wang P, Guo Q, Gao Y, Zhi H, Zhang Y, Liu Y, Zhang J, Yue M, Guo M, Ning S, Zhang G, Li X. Improved method for prioritization of disease associated lncRNAs based on ceRNA theory and functional genomics data. Oncotarget. 2017 Jan 17;8(3):4642-4655.

Basic Information

Transcript ID

ENST00000457898.1

Source

Gencode19

Same with

lnc-NECAP2-1:1,NONHSAT001116

Classification

intergenic

Length

3245 nt

Genomic location

chr1+:16787443..16794976

Exon number

2

Exons

16787443..16789782,16794072..16794976

Genome context

Sequence
000001 TGTGGTGGTG GTGCATGCTT GGAATCCTAA CTACTTGCGA GGCTGAGGTG GGAGAATTGC TTGAGCCCAG GAGTTTGAGG 000080
000081 CTGCAGTGAG CTATGATCAC GGCCTCGCAC TCCAGTCTAG GCAGCAGTGA GACCCTAGAG AATTCTCTAA AAGAATTAAA 000160
000161 AATGAAAAAA AGCCACAACA CTTCTTTTGC CTGAGGATTC TGTAAGAAGC AGTTTTTATT GTTATGGAAA TAGCCACTCT 000240
000241 GATTAAGAAA CTGTAGAGAG GAGAAAGAAA ATGAAAATTG AAGATTCTCT TGCCCATTGA ATTAAGGGTA AGGAGATTGC 000320
000321 ATTAAAGGAT CTTCGGGAGC AGTAACTTTT TTATGTCGAT TTCACATAGC ATTACTTCAC ATCGAGTCAG TTTTAAGTAC 000400
000401 TTGGAGGGTG GAAATCAGAA AGCTTAGATG TAGAGGAAGC TTTATTAGAA GTGTGTACCA TGGCTGGGTG TGGTGGCTCA 000480
000481 TGCCCACAAT CCCAGCACTT TGGGAGGCCG AAGTGGGTGG CTTACTTGAG GTTAGGAGTT CGAGACCAGC CTGGCGAACA 000560
000561 TGGTGAAACC CCGTCTACTA AAAATACAAC AATTAGCCAG GCATGGTGGC GCACGCCTGT AATCCCAGCT ACTTGAGAAG 000640
000641 CTGAGGCATG AGAATTGCCT CCAGGAAGTG GAGGTTGCAG TGAGCTGAGA TCATGCCACT GTACTCTAGC CTGGGCAACA 000720
000721 GAGCAAGACT GTCTCAAAAG GAAAAAAAAA AAAAGTATGT ACCATTGAAG ATCAGCACTT GGATTGTGGA GACAGACCTG 000800
000801 GCCTTAGGAT CTAGGCTATC CCCTGGTTAG ACGTGTGGCC ACAGGCTGTT CCTTCACCTG AGCGTCACTC GGATGAGGCA 000880
000881 CTAGCAGATG CACATTGCAT TGTTTGACCT TAATGACCTT TCTCTGGAGT CAGGTAAGTA CCCAGAACAG TTCATCATGG 000960
000961 TAGGGAGGAG GAGGTGGCAC AGCTGATGAC ACAGCCCTCA GGAATCTAGC CTGAAAAGCA CCTTGTGGGG TCTGGGGCCA 001040
001041 GGAGCAAGGG ACAGCTTGAT GCTGTCCATC ACTAACTGGA ATCCCAGCTG GAAAAAACTC ATTACAGGAC CAGAATCTCC 001120
001121 AGGAAAACTC AATCCCAAAC CAGGCATCAC TTCAATTACA CCCCTGACTG GAGTTTACAA ACTGGTGGGT GGATGGTAGA 001200
001201 AGATGGCTGC TTTCTTGGGG GGTATGCTAG ACCTCACTTG CCCTCTGCAC AACAGAATTT GGATGCACTG GGAGGTGTGG 001280
001281 CAGATAGACT CCCAGGTGGT CACTGTGATC CCCACCTGCT GTTCACAGCC TTATGTACTC CCTGCCCCTT GAGATGGCCT 001360
001361 AGACCTGTGA CTGCTAACCA GTAGAGTGCC ACAAAGGTGA CAAGATGTTA TTTTCATGGT TGCCTTATGT AAGACTGCAA 001440
001441 CATCTGCCTT GCTGAGAAAT TCTCTTGCTG GCTTTGAAGA AGGAAGCTGT CATGTTGTGT GAGCTGCCCT TGGGAGAGGG 001520
001521 TCAGGTGGCT AGGAACTGAG GTAGCCTCTG ACAGCCAACA AGAAACTGAA GCTCAGTCCA GCAGTCTGCA AGAAAGCAAA 001600
001601 TGCTGCCAGC AACCACACAA GCTTGGAGGC TGATCACTCC CAGGTAAGCC TTCAGGTGAG ACCCCAGGCC TGACCAACAC 001680
001681 TGACTGCAGC CTTGCAGAGG ACCAGCTAAG CTGTGCCCAG ACTGTCCCAT AGGAACAGAT GGTAAATGTA TTGTGTTAAG 001760
001761 TCGCTAAGTT TCTGGTAAGG TTATGCAGCA ATAGATAACC AACACAAAGG TTAGCAAAGT GTGTTTGAGG TGGGAATCAG 001840
001841 TGTGGAAAAG AGAGAATCTG GATAGACTTA ATAGCTGTAG GGCCTTGGGA GGTCCCTTAT GATGCTCCTT TGGGCCTTCC 001920
001921 TGTTGCCTTT AAGTAATGCT TATTGTTTAT TGCTAAAGTA ATGCTTAACT CTCTGCTAGG CACTATTCTA TGCACTTATA 002000
002001 AAACTCATTG CATCATCTCT ACAACCCCAT GAGGTAAGGA CTTGTTTTTT TTGTGTTTTG AGACAGTCTC GCTCTGTGGC 002080
002081 CCAGGCTGGA ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGCGTTCAA GCAGTTCTTC TGCCTCAGCC 002160
002161 TCCGAGTAGT TGTGATTACA GACGTGTGCC AGTACTTCCG GCTAATTTTT TTGATTTTTA GTAGAGACGG GGTTTCATCA 002240
002241 TGTTGGCCAG GCTGGTCTCA AATTCCTGAC CTCAAGTGAT CAGCCTGCCT TGGCCTCCTA AAATGCTGGG ATTACAAGTG 002320
002321 TGAGCCACCG CACCTGGCCA CACAACAAAC TTTCTACACA TTGATGTATG ATATTCCACT GAGGGGAGAG GCACCCTCCT 002400
002401 GGCTTAACTG AAGGGGTGTA CCACAGAAGG ACATGGTGGA CATCACACAC AGATTCTGTG GCATAGGATG AACTGCTAAA 002480
002481 CTAGGATATT TTAAAGTCCT CCAACATCTG TAATTGTTTT CCGTGGTGAA CAAGTGTCGT GTGTCTCAAG AAGACAGGCC 002560
002561 ACAGTGACCT CTTTGAAAGC TTTCTGTCAA GTCTCTTATC CAAGGGGAGC AAAATCACAA AGGTCCCAGG CGATTTTTTT 002640
002641 TCCTCTTTCC TCTTTTCTTC ATAAATCTTG GTTTTGCTTT TATTTGACGA AAACAATCTG ATGCCTGTTC TCCCTTCTAT 002720
002721 ACAATGGTAA ATTAGCATGC AAGTAGCTAT CCCTTTATTA TTGTTTGATA GATTTTTGCA GCGGCTATGT CCGTAAAAAT 002800
002801 TAGCCCACCT GAGATATATC ACTGGAGCCA ACAGAACCCT GCACCCAACA GGCACCTTGT GCAGACCTGG ACCCTTACAG 002880
002881 CTGTTGGCTC ATGTTCCTTT GGTTCTTCTA ATGAATATCA TGAGTAGAAA CTGAGCTCTT TGGCTTTTAC CCACTACCAT 002960
002961 GACTCTAGTA CATTTTCTCT CTCTCGTTTC TCTCATTTTT GTATCATGAT TTTCTGCCAT CAGGGGCATC AGTGTGGGTT 003040
003041 CCTGGTTTTG ATGGTATGGA GTGAACTTCT GGGATCGTTT TATGACTGTA TAACAGTTGG TATTCTTCTG TTGATGAAGA 003120
003121 TTTGTGTTGT ACCCAGTTTT TCCTTAGTAT AAAAAGGAGG TTAATAAGAA CATTTTTGCT AGAAGCCTTG TTGGATGTAT 003200
003201 TGTTCATTTT ACGGGGGTAA TCAATAAATG TAAGCGTGCT TCTTT
[back to top]

Predicted Small Protein

Name ENST00000457898.1_smProtein_2090:2278
Length 63
Molecular weight 7261.47
Aromaticity 0.161290322581
Instability index 54.5225806452
Isoelectric point 7.73870849609
Runs 9
Runs residual 0.00847926267281
Runs probability 0.0483620875777
Amino acid sequence MQWHDLSSLQPLPPAFKQFFCLSLRVVVITDVCQYFRLIFLIFSRDGVSSCWPGWSQIPD
LK
Secondary structure LLLLLLLLLLLLLHHHHHHEEEELEEEEEELHHHHHHHEEEEEELLLLLLLLLLLLLLLL
LL
PRMN LLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLL
LL
PiMo oooooooooooooooooooooTTTTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiii
ii