NONHSAT100137

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100137

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4986 nt

Genomic location

chr5-:4520770..4525738

Exon number

1

Exons

4520770..4525738

Genome context

Sequence
000001 AGAAATATTA AAAATGATAT ATAAATAAAT AGCAATTGAA GGTCATTCAG CTCTCCAGGC ATTCTGCTCA GTAAACAGCC 000080
000081 TCTGTGTACC TTCTGAAACA TTGCATTGCA ATGTTGAATG AAGGGCTCAG ATAATCTCAA CATTTGTCTC CAAAATATTC 000160
000161 CTTTTTAAAT CCGTCATGCA TGGATTTACG GAATGAATCA CTCACAGCCT TTGCTGGTCC TGAAGGTCTG GAGCCCTCAC 000240
000241 CATGTCTTGG GGAGAATGGA GTTTCACCTC CCAAATGCCT GCTGTCATGG GGGAGCTGCT CCTCATTTAG TGGAAAACAG 000320
000321 GTCTTTCCCA CCTCCAGGGC TGCCTTTGCC TTCACCATTC CCTTATTCTT TTACTTCTTG GCTTGGTCAT TGTGTAAGCT 000400
000401 TTGATTATAT CATTGTTCAG TGTTTAAATT TAATAGGAAA GGTTTCTAAT ATCTTTAGCC TGGTCTCAGA CATAGCCTCA 000480
000481 ACTTTCTGAT TCTCCGGTGG AACTGCAGTT TTATGAGCCA GGTACTGCCA CTGTCTCTCA TTCACAACGG AAGGCTTGAG 000560
000561 GCTGAGTGGA ACGAATATGG TCATACCACT TAAGAAGTGG AGCCAGGAAG CAAGCCCCAT CAGCAGGACC CAGGAGTCAA 000640
000641 TATGTTCTGC ATTGCTGAAG TTGAGAGCAA TATTGTGCCC ATCTCTGAAA CAGAGATAAA ATAATGCTAC AGTTGGATCA 000720
000721 GAAAAGAAAA ATGAAAAAAA TAAAAGCTTT GAGGAAGAAC TTTGATGGAA GAAACTATTT CCATGACCCT TTTCCTCTCT 000800
000801 GTCTACAATG GCATCACCTT CTTGGTGAAG GGACATTGAG ATGCTACGGC TGAGAATGCA GAGTGAAGAG CCCTTGGAGG 000880
000881 CCCTACCAGC TGCTACCAGC CTCCTGTCTT CCCTGCCTGG GTCCATGGCT GGCCTGCCCT GGCATCCATG TGGCTTCTGG 000960
000961 ACTGAGTTCC AGCTGGTGAG ATGCGAGGGG AAGCTGCACC TGCTGCACCC AGACCTGGCT CATACAACCT TGCACAGGCG 001040
001041 ACCTCGGGGG GTTTTCCTTC TGTGGACGGA CAGCCCAACA AGCTTGGAGC CCATAAGCTG AGGGTGAAGA GTCTCTGAAT 001120
001121 TGCTGCTTGG AGAACAGCCT GCTGCCAGCT AGGATTGCCT GGTTTGGATT TTATCTGAGA GAGAAATAAA CTGCTTTTAT 001200
001201 TGTGTTGGAC CCATTGGGCA TTTTGAGCTG ATCAACTAAT ATTACTAACT CCTTTAGTTA TATGTCCTTC CTTTTAAAAG 001280
001281 GTCTATGAAA GATGATTTAA AAATACATTC ATGTTGCATG TTAAAGTATA TTTCAAATAA TTGAGAAACT CTAAGTAGGA 001360
001361 TAAGAATAAA GTTAGGAAAT GAGGATGGAG CCAGGAATGA AGCTGTTCTG AAATGGGGGG AACATGAAGC CCCATTCTCT 001440
001441 GGCTTAGGGG ACAAGTTTGA ATCCAAACTA GAAACGGAGG ACAGGAAAAG GGAAATAGAA TCAATGGCAG AAGAAAAACA 001520
001521 CAAGTAGAAA TAATATCAGT GATGGCATTG CATCTGTCAT CTGGACTGAA GCATTTATTT ACTGTGAGAA ACCCTTTAGG 001600
001601 AATCTATTTT CATCCTTTTA TGTAGATAAA GACATCGATA TAGTTGACAT CAATACAGAT CAAACATAGA CAGTGATCTA 001680
001681 GTTATAGACA CAGACCTAGA CTTAGACATG GGTTTAGATA CAGCTTAGAA ATGCCTTTAT TTCCCACTCT CCTCTCAGCC 001760
001761 TCCACCGGTG TGTGAAATTG TCCAATCAGA GTCTCCTTTT GAGTCATCCA TGGACGACTA CTCATTAGCT AACTCATCTG 001840
001841 GCCTCTTTCT TCCTCTAACC TTTTCTTCCC TTTTCTCTGT GCCTCTTCGC CTGCCCCAAC TTAGTCCTGA GCATCTGTGA 001920
001921 CCTCCTGCTC ACTGTGCATG CCTGCAGCTC TGACTCCTGT CCCGCAAAGT TTCTTGTTCT TCAAATCTTA TTCTGGGTCA 002000
002001 CTCCTGTTTT TCTCAAGGAT GCCTGTGGCT TTATCTCCCT TTCCTTAATG ATGCACCTCA AACCTCCGTC TCCCCCTGAA 002080
002081 ATGCCTCCTC ATTTACCTAA CAGGCTGGTG ACTGTCCCCT CTGGTGCCTG ACCTAGGAAA CTGACACATC CCACATTCGG 002160
002161 AACAAAGGCT GTGCCCTCCT CGACAGTCCT CTACCTGGGG TCTCACTGCA TTGCCATTGT TTCAGGGCTG CTTCCCAAGC 002240
002241 CAGAGAAGTA GCAGCCATCC TGGATCGATT CCTTCCTCTC CCCACAGATT CCATCAGTCC ACAGTGCTGC TGACTTTGCT 002320
002321 TTCTGCACGT TTCTTAGATT TCCCACTTCT CGCCATCACC ACCCTGGGAG CCACCTGCAA AGTACTGGCC TGGTCACCAT 002400
002401 CGCCTCTCGC CAGCTTACAG CAGGTGTCCC CTATCCAGCC CACCTTCCTC CAGTCTTGCT CCAGTTCAGT AACTTATTCT 002480
002481 AGCATCTTTA GTAAGTCATG TGCTGGGGCT GACGTGCCTC TACATGGCTC CTCCTTCTTG TCTTCCTCTC CTTTGCTAGC 002560
002561 ATTACTGACA TAGCCTGCAT GTCCCTTTGT GCCAGGAACC GTTTAAGGTG ATTTTCTGCT ATTATCCCAT TTAATCCTCA 002640
002641 ATATAGCTCT CTGAAGAAGG TAGCCTAGTC ACCCTCCTAG GATGGCGAGG CAACCAAGAC TGTGTAGTGC CTTCCCCTTG 002720
002721 CACGCATAAG GTCACCTGCA CATTGTTCTC TGCTTGTGGC ATTATTTCCT CCAGCTTCAC AGGGAGAATC TCAACCCATC 002800
002801 TTTCATCCCT AATTATGGAC ACCATCTGCT CAACAAAGGC ATATGGTGCT CCTCCATCTG CCCAGAGAGT TGCTGAGCCT 002880
002881 CTGTACATCA ACAGCCTTTA GCTGCCTCTA CGCCGTGATT GGGAGGAAGC TCCCTGAGGA CAGGGCCATA TCTGACATTT 002960
002961 CCTCAGTTAC ATTTTCCACA CTGAGCATAA TATGGTCACA TAGGGGGCAC TTATTAAATA CTTCCCTAAA ACATGAGTAA 003040
003041 ACATGACATC AAAAAGGTAT CACTCACTGC AAGGAAAATA TAGCTTATGA CATTTTACAA ATGTATGAAA ATTTAAATGT 003120
003121 ATTTTTTGTT CATGAATAGT CACTGTGGTG CACGTAATAT ATTTAACCAG GAGTATGTGG CTTAGTGAGT TAATGCAAGT 003200
003201 GAGAAATATG TACCCACATG AAACACCTCA TATATGAAGT TTTTTTTTTT TTGACAGAGT TTTGCTCGGT CACTCACGGC 003280
003281 AACCTCCGCC TCCTGGGTTC AAGCAGTTCT CCTGCCTCAG TCTCCCGAGT AGCTGGGACT CTACAGGCAT CCCCCATCAG 003360
003361 GTATGGCTAG TTTTTTCTTT TTTTGTATTT TTAGTAGAGA TAGGGCTCTG TCATGTTGAC CAAGCTGTTC TTGAACTCTT 003440
003441 GATTTCAGTT GATCTGTCTG CCTTAGCCTC CCAAAGTGCT GGGATTATGA GTGTAAGCCA CCATGCTCAG CCCTCATATA 003520
003521 TGAAGCTTTG AGTGCTATGT ATAGGGATTC AGACTTTATA GGCTGCCACT TTCATGATAG ATTTACATTG CACCCTACTG 003600
003601 TTAATCTCTA CATTTTTAAT TTTAAATTCT TAAGGATCAT CTTGTCTTAT CTTCATTCTA GTTAGTTTTT TCTTAAAATA 003680
003681 AAAACTAACA AATAAAAAAC AAACAAATAA AAAACAGGCT AGACAAGTTA AATGATCTGC TTTAGCCCGA ATACTGGCAT 003760
003761 AGAAGTAACC AGAAATGCTC TTACTCCTAA TACATACATA CATATGTCCA TATACATATG TTTCTTATTA TCCATATAAG 003840
003841 TGCATTTTGA TTTTGGACAT AAAACAAAGG ACTACATGAA CGAATATATT TGTATTATTA TATAAGCAAT ATGAGCTTAC 003920
003921 TGAAGAATAT CTAAAAATAT TGAAACAGTT TAACAAAATA ATATTCATAA TTCTATTACC TAGCAGTAAC TCCTGTAAAT 004000
004001 ATTTTAGGAT ATTTCATGCT TGTTCTTTTA TTATCAATAT ATTTTCCTAT TTGATATTTT TGCAACTTGA GTGCAAAATT 004080
004081 TTCACTTAAT ATTATAATTA TTATTTCCCA GTTTATTAAC TATTATTTTA AACATACTTT TATGTTTGTG TACTCTTCTA 004160
004161 TCACATGTGG TTATTTTCAT TAACAAACCA TCTATAATTT TAGATGTCTA TCAGCACTAA CATGGGTTAT TGTTGCTGTT 004240
004241 ATATTTAATA TAATTTCCTC AGGATCAATT TATATATTGT AAGCACTATG TCAAATTCCC AAATATCTTG AATTTCTTTG 004320
004321 TGTGAGTTGA TTTTTCAAAT GATTGTACCG GTTTACAAAA GCAGAGTTTA AGAGGATTCA TCCCATATCA TAAATAAATA 004400
004401 GTGGTTTCAT GATATGTTCT ATAAATGTAT TTTGGGAAAG TTTAAAGTCG TTTAACTATG GTCATATCCT ACATCCTAGA 004480
004481 TGTTGTTCTG TTGGGTTTCA TTTAACCAGT CTCAACAAAT GATACAGAAT ATTTCCTAAA GAACTTATAA TCCAATAAAT 004560
004561 GACTTTTTGA TGCCTCATGG AGAAAATTGG CTTTGAGACC ACACTTCTTC CCACTTCTTC AATATAAAAG AAAAACAAGT 004640
004641 TCTTATTAAC ACAATTTTGG ACACTCAAGC ACATTCTATG GTTCTGTATT GGTGATGAAA ACTGCCTATG AGCGACTACC 004720
004721 AACCTCTTTT ATGCTGCCTA TTTTTGGAGA TGTCCAAATT TAAGATTAAG GATTCTGGAA ATTTGTTTCA CATATTCTCT 004800
004801 TTCTCTTTCT CTCAAGGTTA GGGTACCCAG TGTCTTCCAA ACTAAAGTAT TGTAATATTA CCCTGCCACC TAGGGGTATT 004880
004881 GCCAAAGAAA TTTATGCATG TATATAGACA TTTAGTGTAT TGGTCAAAAT GTGACAATTA TGTTATTAAA GTATGTAATT 004960
004961 AACAGTCAAA AAAAAAAAAA AAAAAA
[back to top]

Predicted Small Protein

Name NONHSAT100137_smProtein_179:403
Length 75
Molecular weight 8025.1943
Aromaticity 0.148648648649
Instability index 54.6163513514
Isoelectric point 6.06121826172
Runs 7
Runs residual 0.0480553724456
Runs probability 0.0112044817927
Amino acid sequence MDLRNESLTAFAGPEGLEPSPCLGENGVSPPKCLLSWGSCSSFSGKQVFPTSRAAFAFTI
PLFFYFLAWSLCKL
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEEEELLLLLLLLLLEELLLHHHHHHHHH
HHHHHHHHHHHHHL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHH
HHHHHHHHHHHLLL
PiMo oooooooooooooooooooooooooooooooooooooooooooooooooooooTTTTTTT
TTTTTTTTTTTiii