NONHSAT003998

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT003998

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2137 nt

Genomic location

chr1+:76252386..76260488

Exon number

10

Exons

76252386..76253289,76254844..76255041,76255637..76255742,76255837..76255959,76256131..76256202,76256970..76257022,76257146..76257256,76257866..76257991,76259769..76259918,76260195..76260488

Genome context

Sequence
000001 GACTGCAGTG TACTCTCAAT GAGGAAGATA TAGATACGAA AAGAACTAGA GCCGCATCAC ATGGGGACTT CTGCAAATAC 000080
000081 AGAGACTCGG ATTAAAGGTG GAGAAGATGG AGCTAAAGGA ACTGCTTATT TAATACATTT GAACAACTTT TGGGGTACTT 000160
000161 AGAAGGTGCT TTGAAACCTG CATTTGATTA AGCAAGAATT CGCTTGCAAG TTAAGGGTTA Gattttttta ttttttaaaa 000240
000241 atatGCTTTC TGCCACATGT AAACCGTAAG TTGTCCTGAA ATGATAAGCT TTGATATTCT ATGGTGTTTA TTTTTTACTT 000320
000321 ATCTTTTTGA ATGAAAAGTG AACAACAAGA AATGCTGGTG GTAATTTTTT GGGTCAATGA TGAGTTGGCA TGTATTCTGA 000400
000401 ATCTAAAGTT GATTATTACT ACTTTAGCTC TAGAATTACT CTGAGACCTG AAAATTACCT GAATCGTGAC TAAGACGAAG 000480
000481 CCTGAATGAT TTAAGTTCTT TTTTGTTGGA TACAATttgt tgttgttgtt gttgttTTGG ttttgttttt gagtcggtct 000560
000561 ccggtcgccc aggatggagt gcagtgacac aacctcagtt cactgcagcc ttaatttccc tggctcaagc gatcttctac 000640
000641 ctcagcctcc ctagtTAGTT ACAATTTTTA AATGGAACGT TTTCCTATTA CAAGATGAAA TCAAGTGAAA ATTGAGATGT 000720
000721 ATGTTGAATC TATGCTGTGG GGCAGAAGAA CGTTGTAGAG GTAAACATTG ATGAGATTAC CACTTTATTT TGAAAGGGCA 000800
000801 CTCCACAGAA GGATGTTATT ATCAAGTCAG ATGCACCGGA CACTTTGTTA TTGGAGAAAC ATGCAGATTA TATCGCATCC 000880
000881 TATGGCTCAA AGAAAGATGA TTATGAATAC TGTATGTCTG AGTATTTGAG AATGAGTGGC ATCTATTGGG GTCTGACAGT 000960
000961 AATGGATCTC ATGGGACAAC TTCATCGCAT GAATAGAGAA GAGATTCTGG CATTTATTAA GTCTTGCCAA CATGAATGTG 001040
001041 GTGGAATAAG TGCTAGTATC GGACATGATC CTCATCTTTT ATACACTCTT AGTGCTGTCC AGATTCTTAC GCTGTATGAC 001120
001121 AGTATTAATG TTATTGACGT AAATAAAGTT GTGGAATATG TTAAAGGTCT ACAGAAAGAA GATGGTTCTT TTGCTGGAGA 001200
001201 TATTTGGGGT CCCACTAAGC AGCTGGTCTA ACTGGAGTTA ATGGTCTGCT GGAACTTCCA GCATGCACTG TGGTAGTTAT 001280
001281 ATCAGCATAT ACAGGAGCAC TGGAACAGAG GGGGCCAGTC TAAAAGATAA GGGATGAACA CAGTGCAGGC CAGAGAGACC 001360
001361 AGCTGCTTCA TGGGATGTGA AAAATCTACT TTTATAAAGT AGGGAGAAAT TGACACAAGA TTCTCTTTTT GTGCGGTGGC 001440
001441 AACTTTGGCT TTGTTGGGGA AGCTTGATGC TATTAATGTG GAAAAGGCAA TCGAATTTGT TTTATCCTGT ATGAACTTTG 001520
001521 ACGGTGGATT TGGTTGCAGA CCAGGTTCTG AATCCCATGC TGGGCAGATC TATTGTTGCA CAGGATTTCT GGCAATTACA 001600
001601 AGTCAGTTGC ATCAAGTAAA TTCTGATTTA CTTGGCTGGT GGCTTTGTGA ACGACAATTA CCCTCAGGCG GGCTCAATGG 001680
001681 AAGGCCGGAG AAGTTACCAG ATGTATGCTA CTCATGGTGG GTCCTGGCTT CCCTAAAGAT AATTGGAAGA CTTCATTGGA 001760
001761 TTGATAGAGA GAAACTGCGT AATTTCATTT TAGCATGTCA AGATGAAGAA ACGGGGGGAT TTGCAGACAG GCCAGGAGAT 001840
001841 ATGGTGGATC CTTTTCATAC CTTATTTGGA ATTGCTGGAT TGTCACTTTT GGGAGAAGAA CAGATTAAAC CTGTTAATCC 001920
001921 TGTCTTTTGC ATGCCTGAAG AAGTGCTTCA GAGAGTGAAT GTTCAGCCTG AGCTAGTGAG CTAGATTCAT TGAATTGAAA 002000
002001 GTTGCATAGT ATAGTTTTGC CATTTTAACA TTTCTGTATT TGAAGTGCTT ATCGAATCTA AAAGTGACTA CTGTTAATAT 002080
002081 TTTGTATATT GTGttaaatt aattttaata aattatataa ttatacatat tgtaaaa
[back to top]

Predicted Small Protein

Name NONHSAT003998_smProtein_770:928
Length 53
Molecular weight 6407.7679
Aromaticity 0.0769230769231
Instability index 49.1384615385
Isoelectric point 11.295715332
Runs 6
Runs residual 0.0349847892221
Runs probability 0.00731613966908
Amino acid sequence MRLPLYFERALHRRMLLSSQMHRTLCYWRNMQIISHPMAQRKMIMNTVCLSI
Secondary structure LLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHEELLHHHHHHHHHHHHHHLL
PRMN -
PiMo -