NONHSAT079029

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT079029

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5168 nt

Genomic location

chr20+:23030380..23035547

Exon number

1

Exons

23030380..23035547

Genome context

Sequence
000001 AGTGCCTGGT GGGAAGGGCT GATGCCGCAT ACTCGGATTG CTGGGTTCTC TGGCCGCCCT TGCGCCCGCC CTCGCGCATG 000080
000081 GGATCACCTC GCCGGGATGA GTAAACCCTG CCCTGGCGCA GGGAGGTTCT CGGGCGGGGC CGACAGGGGC AGGCGCCAGG 000160
000161 GAAGGCCAGC ACCCCTGTAA CAAGACGACT GTCCCCGCCC ACCACTCGGG CCCCCACGCG TGCAGCCCTC TTTCATCTCT 000240
000241 TGGTCCTCCT TTCTTTCTTT TCATACATGT TACAGCCACT TCCAAGGAAA GCCTGGATTG CAAGAGCTCT GGGAACCGGA 000320
000321 GACTTCAGAG AAGAGGGCTT TGAATGGGGA GTGGGGGAGG TGGTGCACAG GACCTGCAAG ACGCTGGGAG GGGTGATCGG 000400
000401 CACCAAGGGC ACTTTGGGAG GACCTGCCTA GGACGTGGAC TTCCCCGAAG ACAGGATCGC AAGGAGAGAC AGCTGGATCC 000480
000481 TGTCCGCGGC CAAGGTGCCT GGCTCAGGAA ACCAGCGGAG CGCGCTTGGC CTCACAGGAC AGTGGGTGTG GCTGGGGTGA 000560
000561 CGGGGCAGGG TGGGGAAGAC TGGCCTAACA CCAGCGCCCT CTGCCCCATG GCTGGCCAGG GACCCGCGAG TCCCTGGACA 000640
000641 CGCACTGGCC AACGCCAGAC CCCATCTCAT CGGGTGGGGA AGTCGCGGGG ACACTGTCAG GGCGCCGAAG TCCGGACCCG 000720
000721 GCTCAGAGGC GGTGGCAGGT GAATTGCTGC GGCGCCGGGT AGGGGCGGGC GCGTGGGAGC GAGTCAGCCT GGCCAGTTTC 000800
000801 GGCCCAGCTT CCGAAGGATG GTGCTTCTTG CACCCCAACA GAGTGGCTGG CAACCCCCCA GGGGAGCGCG CAGGATCCCA 000880
000881 GCTGATCCCA CCCGGGTCGG CTAAGGAGGT TTCCATTTCG TCCAGAGTCC GAATTGATAC CCACGTGCAT AGAAACGCCA 000960
000961 CTTGCTCGGC AAAGGGCACT GAAGAGCCAC CGTCCTGTGG ATGGGCAGGG TGGGGGGGGG GCTGGAGGAG GACATGGGAA 001040
001041 TCCGTCACTT TCGACCTCTT CCGGTGGTTC ACTTACCGGG AATGCGGAAG AGTGGGTCTC CCCCTCGGGT CGCCCCCATA 001120
001121 ATGGTGAGAG GCAAACTGTT TAAAAACACC CTTGCCTCTC TCCTCTACTG TCCTCACAAC GAGCGCCAGG GGGCGGCGCT 001200
001201 GTCGAGCTCT AAACAAAGCC AAGGAAGTTG GAGAAGTTTC GGGCTAAAAA GGGTTAAGGT GTAGGAGCAC AGAGTCCTCC 001280
001281 TTCTGGGGTT GGAAGCTCCG TTCCCGGGCA GCTCAGCGTG GATTCCGCTG CGTTCACCTC TTGCCTCCAG GGCCCAGTAG 001360
001361 ATCCTGGGCT TTAAACAAGA ACAGAGAGTA TGGCGTCTGC CACGTGCGAC AGACACGCAC CGGTGGGGTG GGCCGGGCTG 001440
001441 GACTGGACTG ACCTGCAGTG ACCAAACGGG TGGGGCGTGG ACACTCTGAA AGTGAAAAAG GCAAGCACGA CTGTCCCGCC 001520
001521 GCACACTCCC CAGCGCCTTG GGGCAGAGAG CCTCCAAACG TCCCGCTGAG TTGAGCTCTT CGCTGGGAAG GCCGCCCCAG 001600
001601 CACCCAGGAA ACGAAGGAGC TGCTGGACAC GGGGGTCTGA AGCTCCCGGA ATGCACACAG CTGCGCCTCC CCTGACGCGG 001680
001681 ACCCCCTGGG AAGAGGAAGG GAAGAGCGGC AGATAGCACA GTAGGGCTTG TCCGGCGGGA GCAGTCAAGC TGTGGCTGCC 001760
001761 TAGGCTGAAT CAGGGGGTCG TGACTCCGGT GCACAGCCTG CCTACGCGGG ACGCAAAAGC AGGGCTGCCT GCGGTGCCGC 001840
001841 CATGGGGGTG AGGGTGGAGC TTCAGATAAG GGCAGGAACC TCGTTGGGGC ACTGCTGAAA GGTCTCAGAA TGTGGGGACA 001920
001921 ATTCCCAGAG TGAATGAGAC AAGAGAGGGG GAcaggatcg gttacagaat cagaaaccag tgcaaagtga aaatgtgggg 002000
002001 tccattgttc aattacaagg attcaagact gtgatagcaa agtgttaaac aaggaatgag gccctGGGGG GGTGATGAGA 002080
002081 GGGCCCTTCT GAACCTGCCA GCCCTAAAAG CCACTGAGGG ACATTCAAGG ATCAGATCTC ACCCCACACC AGGCTGGCTG 002160
002161 AGTTCTTGGA TGGATTCTAG CTGATTCACC AACACAATGT TTCACTGGAC GTCCTACACA GGTCCACTTG CACCCATTGC 002240
002241 TGATTCTTGC AAAGAAAAAA AAAGTGTGAG AACAGCCCCA ATAGCTGAGT CCACTAAGAA CTCAGTGATG TCTGTGAAAC 002320
002321 TTGGATCAAC CCTATATGAG GAGGTGCAGA TGTTGGATGT TTCTGTATTA GAAAGAAACT AGAGCAACAA CAGATAGAAG 002400
002401 CAGAGCATTG TCCCATGGGG GGTACCACAG TGTCTGGGTT GCCCTACACA AGGATGGGTG GACATGACCC ACTTGGCTTT 002480
002481 CCCACCTGTA AGGGCTTTAC CCCTGACCCT CAGGATACCT TTGCCTTGTG GGGCCTGCCC TCAGCCGCTT TGTTTTTCCT 002560
002561 CTCCAGCCAA TACCTTGGAG GCTTTGCATA TTTTAGACTA CCTTCTGGTG ACTGAAGGTC CCCTGTCTGT GTTCTTGCCC 002640
002641 CATCCCTATC TTCCTATCTG TGCCCCCTGC ACACCAGGGT GGGCCACTCC TTCCTATCCC TTTGCAGATA ACACATCCAC 002720
002721 TTCCCAATGC TTGGCCCAGT TCCTGGCCCA GCACCCTCAC TCTGGCCCAC CAGCAGACTT GCTTTACTCC TGAAACAGCT 002800
002801 CCCTTGAAGA ACACCCATCT GGTCCTAAAT ACTGTTGCCA AAATCTTGTT TTCAGCCATC CTTCTAGAAG CCTTGGTGCC 002880
002881 ATTTGCACTT GTGACCCTTC ACCATTCTCA AGGCTCCTCC TCCAATTACT TTTGAGACAC ATGTTTTCCT CCCTTCCTCT 002960
002961 CCTGTGAGCC TTGGGGAGGC CCCTGGGATC CTCTTGTACC ACCCACTAGA GTGGCCATTC TCAAGGTTTT GCTGGTGGTC 003040
003041 TTCATCATCT CTTTACAACC CACCTTTTGG TGAACTTACC CATTCCCCTG GCAGTGATAG CAGTGTCCAA GGAGATTGGG 003120
003121 GAGGGAGTAT TCTAGAAGGG ACCCTTACCC CATCTTTCCC AGTATCAATA GTTGGTCCAC TGAGGTTTTA AACCATGCAC 003200
003201 TTGCCTTTGA CTTTCTGCTC TCCTTCATCC ACTGCCTAGT TTTTGTGTCT TGTTTCCCCT GATCACTGCT CCTGCCTTCT 003280
003281 CAATTTATGC TGAGACAGTT GGGTCCCCAG CTCTTTTCCT TGCTGCTTAT CTCTCCAGCT CTCCACCGTG GCTGAGGCTA 003360
003361 CTCTGATCCT CATACTCCCT CAGTGCTGCT CCCACCCGCT GGGAGAGCCC ATACTCTTTC CCCTCCGCAC TGAAGATGCT 003440
003441 CGCGATAGTG CCTCTACTCT GTGGCCACAG ATGAGAAAAT TTGCACTCAA GGGGCTATGC TGGAATTTGG ACCCAGAGCC 003520
003521 TGCACCCCTT GCACCTCACA AGGCTTCAAG GACCAGTGTT CTGCTGGTCT CCATGAATGA GGAAACATCC TGGCAAGGGA 003600
003601 GGCTCCTGCT CACTGTCAGA GAAGCACCTT GTCCGCTCAT CTCCACCCTG TAGTTCATGT CTCTCCCTCT GCTAGCAACA 003680
003681 TTCTCGTCTG CCAGGCTCTT CACCCACTGA AATCTTTCCA TTCTTTCAAG TCCCAGTTCA AGCTTCCCCA TCAGCAAGTT 003760
003761 TTCTGAATAA CCCAACCAGA ACTGATCTGT CTGTGATTCC CATAGACAGG TCGTTCTCTC AGCTCTTTAG TACTCACCAC 003840
003841 TCCCTCCCTG CCCTGTACCT AATCATGCAT ATACCACACT TTTTGCTGCA CTTATACTCC AGATTCCCAC ACATTTTTGT 003920
003921 ATCCCCTTAC ACCTAACAGC ACGCTTTGTA ACTAGACAGT GCCCAGAGCT TATTGGAGCA CCTGTGGTAG ATGTTTCTTT 004000
004001 GTCCTCTCCA GGGTGATTTT CTCACATTCA CCTGCAAAGA CAAGGATGGG GGACCTCAGA GCCTCAGGGT GAAttaagag 004080
004081 tgtggacttt ggagccgact tggctgcact ggagacccag ctccggcatt cacaagtttt tgtttattgt taagatttga 004160
004161 ctaataaccc ctccccgtgt agctcagttt ccttgactgt gtaatgggaa tagtccctcc ttcaaagtgt taggatgaag 004240
004241 attaaatggg ttatacatgt aaagcattta gaacagagga gccgtagtca gctctcaata aattttagCA AAAAAGAAAA 004320
004321 CACTCAAGTC CCTAGAATTT GGATTCGGCA TCTGATTGTA TTCAATATCC AGCATGGTTT CTTTTCCATG GTAGGGACTC 004400
004401 ACCAAATGTC TGTTGGATTT AATTTGTTCC AGCCAAGCAT TTCTACCACT AATAACAATA AGCAATTGTG AAAAAAAATC 004480
004481 AGGGATGGAG GCATAGGGAA GAAATTGAAG CATGTCCTGT Cttaggcttc ccatctgccc tagagacatc catagtgact 004560
004561 gtccccattg cccaggtgtg gaagctgaag gccagagaga atcagtgact ctccaatgtc acacagctag taagtggtga 004640
004641 aacaaagatt caccggcctg TGGTAAGGAT GAAAGAATTA CAATGTGGAG TATgtctgag cccccatttt gccacatact 004720
004721 ggcctacttg gctaagttat ttcaatctct ggtgttcttt cccaaaaact ggagatataa tatctaatat gtaggattag 004800
004801 tgtaaaataa aaCTCTTTCC CAAaaactgg agatataata tctaatatgt aggattagtg taaaataaaa cagaaaatgt 004880
004881 acatacagtg actattactt ttcttggaag acagcagctg tttaacaaat atggttgtta ttactGCTAT ATATAAAAAA 004960
004961 AAAAGCTTTA TCAAAGACCC CTGTGGTCAC TGGTAGCCAT CCAGGAAGTC TTGCTTTTTA TTTATTTATT GACTGATTGA 005040
005041 TTGACttttg agacagagtc tcaaaatcgg tagcccagac tagagtgcag cagtgttatc atggctcact gcagccttga 005120
005121 cctcccagat caagtgatcc gcctatctca gcctccagag tagctggg
[back to top]

Predicted Small Protein

Name NONHSAT079029_smProtein_2414:2614
Length 67
Molecular weight 7021.8713
Aromaticity 0.166666666667
Instability index 40.1045454545
Isoelectric point 5.45904541016
Runs 11
Runs residual 0.0160587915079
Runs probability 0.0208604326252
Amino acid sequence MGGTTVSGLPYTRMGGHDPLGFPTCKGFTPDPQDTFALWGLPSAALFFLSSQYLGGFAYF
RLPSGD
Secondary structure LLLEEELLLLEELLLLLLLLLLLLLLLLLLLLLLLLEELLLLHHHHHHHHHHHLLLEEEE
EELLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLL
LLLLLL
PiMo oooooooooooooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiii
iiiiii