NONHSAT100620

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100620

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3316 nt

Genomic location

chr5-:17130137..17217531

Exon number

3

Exons

17130137..17132009,17162578..17162674,17216186..17217531

Genome context

Sequence
000001 CTGCTCCCCT CCCCTCCTCC GGCGCAGACC CTCCCCTCTC CCCTCCAGCC TGGACACGCC CGCCTCCCCT TGACTCCCCC 000080
000081 CAGCTCTGGG CCCCCACCTC CCCTCCCCTC CAGCACAGTC ACCCCCATTT CTCTCCTATC CGCCATCCTG GTTCCTCCCT 000160
000161 TCCCCCCACC TCCCAACTCT GTGCCCCGCC AACGTTTCCT AAATGCCCTC TATTCAGATC CCCCCTCCGC CTCCCCTCTC 000240
000241 CTCTCCTCCA TTCCTGCGTC CCCCTTCCCC CGCCGCGCCG CCTGGGCTGT CCGTGGACTT CTCCCACTCT CTCACTCTCT 000320
000321 CACTCACTCT CTCTCTCTCT CTCTCTCTCT CACTCTCTCT CTCTCTCTCC CCCTCATTTA TTTGGAACCG TTGGATAAGA 000400
000401 AGTGCTCGGG CTCTCGCTCA GACTTAGGGA GCTGCCTCGA GGTGATGAAT GACACCCCCT GGCACCAGCT ACCCTTCTCA 000480
000481 GACCCCAGTC CAGCCCGCTC CCGACGTCGA CTACGATTCC GCTACCTCGG CTGGCAGCGA GGTTGGGGTG AGCCCCAGCT 000560
000561 GCAGGCGCGT CTGGGCTGCG CCGCTGCAAA CGAGTTGCGC ACCTTGGGCG GCTCCGCACC TGCACCCGCA CCCGCGGGGC 000640
000641 TCAGCCCCGA AGGCTGCAGC TTCGGGGGAG GCGCGGTCGC CGAGGTCCAG CTGGTGGGGC GAGAGACGTC GCCCCTCGGA 000720
000721 GGATGCTCTC GGAACTTGGG AGAGGAAGGA GGGAAGAGAA GAGGGGAAAG GGGCCGTCGA TGTTTTTGAT GTCTGTGCTT 000800
000801 TAATGGAGGC CACCAATATT GAGAAGACGG GGTTGGCCGA GGCAGCCCGC ACGCTGCTGC TTGCGAGCGC TCGAGTCAAA 000880
000881 GCTAGGGCCA ACCGCGGCTT GTCCGGGTGC CCTAAGGGGG CGGACACTTG GTTTAGCACC GGGACACAGA ATAGCCACCG 000960
000961 GGGTAGGAAG ATGCGTTCAC TTTGCTTACC TGTTGGCAAG AGGGACATAC AAAAATAACG TAACGTGACA TCGTTGACAA 001040
001041 CGGTAGCTCT TTGATTACAC AAAAGCCAAT TTTACCTTCC CGCAAAAGCC AGTTGACGCC TTTGGAACTT TTATTTGCGG 001120
001121 CATTTTGGCG CCCTCTGGCT GTGTTTGGAT CGCTTTCATG CTCGCCTGCG TCCCAGCCAA GAAAAAATCG ATGGAGCTGC 001200
001201 AGGTTGTCCT CAGGATGGTT CTGCCCTCAG GACCTGGGCG TGAATTCAGG GACAGGGTGG CCCTCCAGAA CCGGAGTGAC 001280
001281 AAACTGTAAC CATACTTAGG GAGGCAGACG TCAAAGGCAA GTACATCTGT ATTCAACTGG GTAAAGCCAT GTGAAGACGT 001360
001361 GCCTGCCTCC CTGTTACCTT CCACCATGAT TGTAAGCTTC CTGAGGCCTC CCCAGCCATG CCTGCTGTAT AGCCTGCAGA 001440
001441 ACAGTTTCAG TCGATTTGAT TATAAAGCAA AGGTTTGGGA AGAAAGACTC ATATTCCTCC TGAGATGCAA CCTCAAGTCT 001520
001521 TAGGAAAAGA AAGTTTTGAA CTCAGGTTCC GGGAATGTGG AAGAGGAAGC TCTTAAAGGG CAAGTAGACT TTGCAACATG 001600
001601 CTCGTTTTCA GGATTCTCCT CCTTTGTCCT CAACCCTGCC CAGTGTTCCC CTAACTCCAC CTAAGCCACT CCATAAAGTT 001680
001681 AGGTCCCATT CCTCTATTCC TTCACTACTA ATGACAGCTA ATAGAGCTAA ACAGAATACC ACAATAAGCC AGAAGCTGCC 001760
001761 TTTTTTTATT ATTATTATTA TTCATTGCCC AATTTGTAGA ACTCTCTGAA TTCAATTAAA GTTTGGCAAA GCCTGTGCAA 001840
001841 ATGAAACCAA GTACCTATTT TTTGCTCATC ATAAACCCAA AAGTTTTCTA GAGAACTAAC TGAAAGAGAT TTTCACCAAA 001920
001921 TCTTTTTATT TTTTTAATCT AGAGAATACA ATTGAGAACC AAATCAATAA ATATCCTCAA CTGTTACCTT TGTTATAAGG 002000
002001 GAACTACGAT GAACCGTGCT TGCCCCACAT TTACCCTAGC AGCAACTATG CTTTTTCTAT CTCTGGCCTT ACCCTGCCTT 002080
002081 CCTGCCTCCA GAGTCTGAGA TGGAGAAAGG CAAAGTCAGA TGGAGGATAG AGCTGGGCAG GGAGTTGCTG CCAGCAACAA 002160
002161 TTGGAGTTGC TGGTTTGCTT TCAACACTGA CCCCACTTTA TTGGCCATGA GTTAAGGCAG TCAAATGGGC ATCTAGGAAA 002240
002241 CATGACCAAG ATCTGCATTA GGGAAGCAAA GCAGATTAAA AGGCACAATT GCTGGCCAGG CATGGTGGCT CACACCTCTA 002320
002321 ATCCTAGTGA GAGGTGAAGC CAGTTGGACT TCCTGGGTCG AGTGGGGTGG GGTCTTGGAG AAATTTTCTA TCTAGCTAGA 002400
002401 GGATTATAAA TGCACCAATC AGCTCTGTGT CTAGCTAAAG GTTTGTAAAC GCACCAATCA GCACTCTGTA AAAACGCACC 002480
002481 AATCAATGCT CTGTGTCTAG CTGAAGGTTT GTAAATGCAC CAATCAGCAC TCCGTAAAAC GGACTGATCA GTGCTTTGTA 002560
002561 AAATGGACCA ATCAGCAGGA TGTAGGCAGG GCCAAATAAG GGAAAAAAAG CTGGCACCCA AGCCAACAGG GGCAACCTGC 002640
002641 TTGAGTCCCT TTCCACATTG TGGAAGCTTT GTTCCTTCAC TCTTCATGAT AAATCTTGCT GCTGCTCACT CTTTGGGTCC 002720
002721 GCACTACCTT TATGAGCTGT AACACTCACC ACGAGGGTCT GTGGCTTCAT TCCTTAAGTT AGCAAGACCA CGAACCCACA 002800
002801 GGGAGGAACA AACAACTCCG GGCACACCAC CTTTAAGAGC TATAACACTC ACTGCGAAGG TCTGTGGCTT CACTCCTGTA 002880
002881 GTCAGCAAAA CCACAAACCC ACCGGAAGGA AGAAACTCCA GACACATCTA AACATCTGAA GGAACAAACT CTGGACACAC 002960
002961 CATCTTTAAG AACTGTAACA CTCACCGTGA GGGTCCGTGG CTTCATTCTT GAAGTCAGGG AGACCAAGAA CCCACCGGAA 003040
003041 GGAACAAACT CCGGACACAC TAGCACTTAC AGGAGGCCCA GGTGGGAAGA TCACTTGAGG CCAGGAGTTT GAGACCAGCC 003120
003121 TGGGCAACGT AGTCAGACCC CATCTCTACA AAAAAATTTA AAAAATTAGC CGAGCATGGT GGCAAGTGCC AGTAATCCCA 003200
003201 GCTACTCTTG GAGGCTGAGG CAGGAAGAGC CCTTGAGCCC AGGAGCTGGA GGCTGCAGTG AACTATGATC AAACCACTGT 003280
003281 AGTCCAGCCT GAGTGACAGA GTGAGACCCT GCCTCT
[back to top]

Predicted Small Protein

Name NONHSAT100620_smProtein_1385:1522
Length 46
Molecular weight 5451.4353
Aromaticity 0.155555555556
Instability index 53.9133333333
Isoelectric point 9.58856201172
Runs 7
Runs residual 0.00669191919192
Runs probability 0.0283644989528
Amino acid sequence MIVSFLRPPQPCLLYSLQNSFSRFDYKAKVWEERLIFLLRCNLKS
Secondary structure LEEEELLLLLHHHHHHHHLLLHHHHHHHHHHHHHHHHHHHHHLLL
PRMN -
PiMo -