NONHSAT103654

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT103654

Source

NONCODE4.0

Same with

,

Classification

antisense

Length

3622 nt

Genomic location

chr5-:127395133..127419482

Exon number

2

Exons

127395133..127396936,127417665..127419482

Genome context

Sequence
000001 GGCAGCCGAG CGAGCGCGCG AGTGTggcgg gcgcggggcg ggccccgcTG TTTAAAGCCT CGGCGGGGCT GCGGCGGCGG 000080
000081 CGGCTCCGGC AGCAGCACGC GGGAGGCGGG GCCGGAGGCT CCTCCGCCCC TCAGCCCGCC CACCCGCGGT CCCGCAGCGG 000160
000161 CGCGCGGCTG ACGCTCCCTC AGCGATCCGG CCCTCCGCGG CTCCTGCGCG ACCCCACGCA AAAGCGCCCC GTGCGAAGAG 000240
000241 CGCGGCCAGC CAGTCCGCAA AGGGCCTCGG GCAGTGCGGG CAGCACGAAC GGCGCGATCG GGGAGCAGGC GCGGCTACGC 000320
000321 TCCGCCTCAG TGATGTCGCC GGGAGGATCT CCGCGCGGCT ACCTCCTCTT CCGCCGCCTT CCTCCCACGC CTTCACCTCC 000400
000401 CTCCCCCGCC GCGCTCTGCG GCGAAAAGGA GCGGGCTCCC AGGCCACCTC CCCTCTCTCC ACCCAGGGGC CGCGCGTCAG 000480
000481 TGGACCTGCA GCCCAGGCAA CGGGGCTGCG GACGGCGCGC GTGCCCTGCC TCGCCTCACG CCGGTAGCTG ATGCTCCCGC 000560
000561 GGCAGCTGCC ACCCCAGCGC CCCAGCCACG GCTTCAGAGG CTCTCGCACG CTGGCTTTTT TTTTTTCCCC CCGCTTTGGT 000640
000641 GAACTCGCAG TACTTGGGCT GAGCCCGAAA TCTCGCGAGA TGCTGCAGGC GTCAGTGCGG CTTCCCACTA GGCGCGTGGT 000720
000721 ACCCGAGGCC GGGCCGGAGC CGCAGGCCAG CCGCGATTCC TTCGGGCGAT GCTGGGGGCT TTCCGGTCGG GGCCGCAGCC 000800
000801 GCTTCCGGAG CCGCGGGCGA GGTGCGTTCC CCAGCCTGGT TTGCTCTGGG CTCTGACCCG CCGCCGCGAA TCTCCGCTCG 000880
000881 TAACTCCGGG CCTGAACCTG GAGGAGGGAG GATTTCCTCT GTGCAGCTTT CGAGCGGTAG ACTCACCCTG CCAGGCAGAC 000960
000961 CTAGATTGGC ATAACCAGAG ATCAGTTCCA GTTGTGATGC AGTGGGTGTC TACACACTTC TGATGACACC TTTATTTTAA 001040
001041 AAGCTGATGT TCCAAGGTGA GAAGGGAGCT TCTTTGAGAC GTCATGTTGT TTCAAGACGA GTTTTAATTT AAGGCATTTC 001120
001121 CAAAAGTGAT ATGCAGCTCA TCTCTCAGAA TACAATGCCG GCAATAACTA CCCGTGGAGT CCTTACCAGG AGGTTTCTGC 001200
001201 GTGATTTGGG TTGTGCAGAA TTCCTCTGCT TATATAGAAA CAGAAAGGTA CTCCTCTGCA GGTCTAGTTT CTCTTCGGTA 001280
001281 AAATGTAAGG GAATATAACT GTGTTCCCAC TTTCTTAGCA GCAGAAATTT TTAAACCTAT AGATTATGCC CAAAGTCCTA 001360
001361 AGTAAAATCA ATGAATGGCG TCCTCCAATT ACTTGGTTTT TATTTATGTA Cttttttttt ttttttttga gacggagtct 001440
001441 cgctctgtcg cccaagctgt agtgcagggg tgcgatcttg gctcactgca aactccgcct cctgggttca cgccattctc 001520
001521 ctcctgcctc agcctcctga gtagttggga ctacaggcgc ccgccatcac gccctgctaa ttttttgtat ttttagtaga 001600
001601 gtcggggttt caccgtgtta gccaggatgg tctcgatctc ctgacctcgt gatccgccca cctccgcctc tccaagtgct 001680
001681 gggattacag gcatgagcca ccgcgcccgg ccCTTGTATT TTTAAACTTT GACTTTATGG AGACGAACAC CATCGTTTGC 001760
001761 TGAACACTTG AAACATGATG AAAGAGCCAC AGAGTTGGCA GAACTGTTTG AAAATGCTgt gcaagcggtc ttctctgtct 001840
001841 tctttatggc cagtaaaatt ctccagaaga gatttatggc agcctcactc ccagtagttt ctgcatttag tgagataagg 001920
001921 taagttctga gaaggctttt ttctgcatct gttgaatttc aaatgtcttt agaataatct ttatatcaaC TCTGGGGGTC 002000
002001 TCAGTGAGTT CCCACAGGTT GTTGTGAGAA TAACTAAGGT AATATGGCAA AACTGTTAAG TTAGCACTCC ATAAATATTT 002080
002081 GCTTTTATTA TTATTTGAAA ACTAGTCTTT CTGCCTTGAT GCCAGTAGCA GATGGGAAAT TATGGTGATT TTTATTTCTA 002160
002161 ACATTGATCC ATTTTACAGA TCAGCCAGTC TGCTTAATTC CTGGGTCAGC ATTTCCTATG CAGTTACCCA AGTGCTGATT 002240
002241 CATTGCTTTT TCTCATATTC AATTAAACTC CCCATCTAAC TTTTCCAAAT CAATTTCTTC CTGACTGTTG ACTTAGAATT 002320
002321 CATTGCAGTT GTCCTAAATA ATTTGAGCAA CTCAAGAAAA TTAACATCTG CTGAGATCTG TTTCTACCTT TCCTAAGTTT 002400
002401 CTCTTCATTC CCTTTCAGTA CCCTGGGATG TCACTCAAGT CTCTTTTTAA ACCTATATCC ACTTCACCTC ATTTGGCTTT 002480
002481 TTGCCTATCA GAGGTGAAAC AAAAGAGTGA AATCTTTGCT TGTGGAATCC TCTCATGTCA TCAATTGTTT ATAAATGTTT 002560
002561 AGTATTTAAA GGACCACTAA GGGCCAGTGG GGAAAATGAA TCTTATGGAT GGATATCATC ATTTTTTGCT ACTTAATAAT 002640
002641 GTTATGGACT TTCAAATGAG ATTTCTGTAA CTGGAATGAG AAAAATCCTA ATAAGTTTAG GATGGGTCAG AAGATTTGtg 002720
002721 caatggagtg aatgcttgtg tgcttcccaa atttcgtatg ttgaaactat aatcccaatg tgatagtagt tgatggaagg 002800
002801 gcctttggac agtggagccc tcatgaatgg gattagtgct cttaaaagaa gagaccagag agctagctag ctgtctttcc 002880
002881 accatatgag ggtgcaatgg gaagctggca gtctgcaacc agaagaggac cctcaccagt ccctcaccgt tctagcaccg 002960
002961 tgaactcaga cttcagtctc cagaactgtg agaaataaat ttctattgtt tatgtcaccc agtttatagt actttgttat 003040
003041 agaagctcaa attgactaag ataATTTGGA TATTAGGTTA TATCGCTTGC AGGAGGTATG TAACTTCTTG AAACATGTAA 003120
003121 ACTCTCTGTG TTGTGTTCTG CTGCCTAAAT CCACCAGATA AATATTTGGT TTACTTTGCT TGGGAAAACT GTTACTCAAA 003200
003201 TGTTCTCTGT ACAAATGCAG GAGAGTAAAA TTATTTCCCC ATTTTATATC AGTAATTGCT ATTTTATTTG TCAGTAAGTG 003280
003281 TTTCATTATG ATTTAAACAA TTGGTATAAG TATATTAATA TAATTATTTA CTAGTTGCTA AAAATATATG CTGCATCTTA 003360
003361 TGAAAATACA TTTAAAAAAC ACTCGAATGG GTCATTTGAC ATGAAAGCAT CATTCGATTT AGAATTCAGA GACTAAGTTG 003440
003441 CCAAAAGAGG AAAATATTTA ATACCATTCC CTGAAGGCTA GTAAGTTTAT AATTAGCAGG ATCACCAGAG GGAGTAAATT 003520
003521 TAGAGCTGAA AACCACTCAC TGATTCTGTA GTATACTTGA ATAAAATAAG GTCACTGTCC ATGTAGCTGG CCAAATAGCT 003600
003601 TGGCACAATA TGGAAGAAGG AA
[back to top]

Predicted Small Protein

Name NONHSAT103654_smProtein_332:550
Length 73
Molecular weight 7562.6281
Aromaticity 0.0277777777778
Instability index 108.920833333
Isoelectric point 11.5856323242
Runs 7
Runs residual 0.0436942416869
Runs probability 0.043568778863
Amino acid sequence MSPGGSPRGYLLFRRLPPTPSPPSPAALCGEKERAPRPPPLSPPRGRASVDLQPRQRGCG
RRACPASPHAGS
Secondary structure LLLLLLLLEEEEELLLLLLLLLLLHHHLLLLLLLLLLLLLLLLLLLLEEELLLLLLLLLL
LLLLLLLLLLLL
PRMN -
PiMo -