NONHSAT104312

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104312

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4825 nt

Genomic location

chr5+:141785944..141790752

Exon number

1

Exons

141785944..141790752

Genome context

Sequence
000001 CATTTGAAGT AGTGAGGCAG GGATTTGAGC CCAGGTCTGG TCCCTGAGCC CATGCTCTCT GTTCTGCTGT GCACCTTGGG 000080
000081 TGATCAGGGA AGCAGACCCA AGCTAGGGAA GAAGTCATGT GGGCAGGGCT GAGGCACTGG AACAGAGTGG GTAACGCAGG 000160
000161 ATCTTGGGAA TCTGACACAC TGGGGTTCAA ATCCTGGCTC TGCCACTTCG TAGCTCTGTG AGCTTGGAAG TGACTTAATT 000240
000241 TCTCTGATCC TTGGGTCCCC ATCTATAAAA GAGGAGAGCC ATAACTACCC AGAGAGTGTA AGGATGACAT GAGAATGTAT 000320
000321 ATAGAGCCTT GGCATGGGCC CTGGGGTGTG ATGAGCATTC AGTAAATGCT GGCTGCTGTG GGCACATGGG AATTGCTGGG 000400
000401 CACTTTCTCA TCCACTTGAT CTCCTTGGCT CTTAGGGAAG GCAGACGCTG GACGGCAGGT CTCTCAGACA CAGGAGAGAC 000480
000481 CCTTAGCTTT ATTTATCCAC CCTGGGATGT ATCCCTGGGA GCTGTGTATC AAAATTATAG CCACTGAAGG TTGGAAGGAC 000560
000561 AAAGGGGAGG CTAGGCCTGG ATATAGCATC TTTACAGTGG TCCAACTTCC CTCTTGCTGG TGATGTCCAA AGAGAAAGGC 000640
000641 CAGCACAGCG GCTCAGATGG GCGTAGCGGG ACTGCCTGGA GGAGCCTATG CAGTGGGCTT GGGCAGGGTC CAGAGTTACT 000720
000721 GGTGGCGGGG GTGGACGCTG TCAGTGTCTC CCCCAGACTC CACTGAACCT TTTATATCAG GTGCCCTAGA GTAGCACCTG 000800
000801 TTTACCTAGA CCTGCCCTCC AGAGACACTG TCTCCCCACC TTGTGAGCTC CTGGCTGCCA TGACCTCCGC ACTTGCCCTG 000880
000881 AGAGGCTGTG AGGATGAAAT GCTGATGTAC ATGAAGCTCT TAGCCTAATA CTTCACATTC AGCGAGCGCC CACCTTCTGA 000960
000961 CTGTGGCCAT TAATTTGTGG ATTCAAGTAA TGTTTTAGGT TGAGAGGTCA TAGTTATGAA GAAAACAGTC CCCACCCTCA 001040
001041 TGGAGGTCAC ACGCTAGTAG GGAAGATGGA TGAGAAACAT GTAAGCAAGT AGATTTGATT GTAATATTGG AGTGAAGAAC 001120
001121 AGTAAAGCAG GGAGAGGGAA CAAAGACATC GTTGGTGTGG CTGGGAAGGA AGGCTTTTAT TGGGAGATAA CAAGTGAGCA 001200
001201 GAGACCCTGG GGATATGGAG GTGCCAATCA TGAGAATGTT GGGGGAGGCC AGGTGCGGTG GCTCATGCCT GTAATCCCAG 001280
001281 CACTTTGGGA GGCCGAGGCG GGTGGATCAC TTGAGGTCAG GAGTTCGAGA CCAGCCTGGC CAACATGGTG AAACCCTGTT 001360
001361 TCTACTAAAA ATACAAAAAT TAGCCAGGTG TGGTGGTATG CCTGTAATCC CAGCTACTGG GGAGACTGAG GCAGGAGAAT 001440
001441 CACTTGAACC CAGGAGGCAG AGGCTGCAGT GAGCGGATAT CATACCACTG CACTCCAGCC TGGGCGACAG AGCAGGACTC 001520
001521 TATCTAGACA GACAGACAGG TAGGTAGATA GAATTAATGT TGGGGAGCAT CCCAGCAGAG CCTCCAGGTG GGACTGAATC 001600
001601 TGGTACAACA AGACAGAGAG GAGGGATAAC TAGGAAATGA AGCCAGGGTG GAGGCTACCA GGTCACATAG GACCTTATTG 001680
001681 TTTTCAACAA TCATTATTTC TAGATGTCGC TGGGCAGGAT CAGCTGTTCC TTCTCTTGTG CTATTCTGCG TGGTATGCAC 001760
001761 ACAGCCATTG TAGCAAGTCA CCCTTTGTTA GGACTGTGTA TTTTACCACC ACACTGGGAG CTCTGGAGGC CAGGGACTAT 001840
001841 GGCTGTTTTG CATATACCCA GCACCTAGCA TGGGAACTGT TTATCGAAAT TATAGCCACT GAAGGTTGGA AGGACGAGGG 001920
001921 GGCAACCAGG CCTGGGTATG GTATCCTTAG AGTGGTCCAA CTTCCCTCTT CCTGGTGATG TCCAAAGAGG AAGGAGCATG 002000
002001 GAGTCTGTGC TTCACACATT TAAAGATAAT GAATCTTGGT GGCAATGGAT GCTGTCTATT GGGGGTTTCT GTGTGCTATC 002080
002081 CACAGCAGTT AGCATTTTAT ATTTCATTGT CTCATTGAAT CAATACAATA GGCTTGGCCC CAAAGTGTCA TCTCCATTTT 002160
002161 ACGGATGAAG AAATAAGCTG TAGAGGCCGT CCCATGCCCC AAGCACACAC AGCTAGGAAC TTGCTGAGCT GAGATTTGAA 002240
002241 GCCAGGACTC TAAGGACCAC ACTTTCAACT CTGACCAAGC TGGAAGGTCG ACTGCCTCCG GCAGGGGCTT GGCCCAGGCC 002320
002321 TGCCCTTCCA AGGGCTGTGG TGTTTACTGC CGAGGGTTTT GGCATCCTTC GGCCTTCTTG GTCCCTCAGG GGTCACCTTG 002400
002401 CACCCCAGCC TTCCCTGTAG TATCCCACGG AGCCTGAGGG GCTGTCCCTC AGACTTCCTC ACAGCTTTGT AATCCACAAA 002480
002481 TGGAAAAGTG GCACTTAAAA TGAATTTATT CAGGTGGGAG CTGTGTAAGA CATGAAAATA AGGCTTTTGA GCTCCTCCAC 002560
002561 ATAAACTTGA GTGTAAATGA AAAGCTTTTT ATTTGGTTTC TGAGCCAGAC TTTGAGAAGC CAGTGGTCCG TGCGGTCTGG 002640
002641 AAGCGCATGT TATGGGCGCT GGGTCGGAAC AGGGGCCACT TTGAATGGCA AGGAGGGAGA ATCGCTCCAG CGAAGCTGGA 002720
002721 ACGGCCAAGA AGCTGGGATG TCAATAAACA CAGGCAGGAC ACAGAAAGCC TTTTCCGAAG AGATGGAAGG CGGGGAAGCG 002800
002801 AAGGAAAGGC AGTGCCTCAC TCCCGGTGCA TTTGAGAAGG AAAATGCTTG AGGTAACCCG TGTTGGCAGC TTTTATAGCT 002880
002881 TCCTCTCGAT AATGTGCTGG GCAGACCCTG AACAGGCTGG TGTTTTATTT TCGTGTAACG AAGCCGAGTG GACCAGGGGC 002960
002961 AGCTCTCCAG CAATGGCTGG CCTGGGGCTG TGCTCACGGA GAGAGGCAGG GCTACCTGAG ACCTGGGGCA GGGGCCAGCC 003040
003041 TCCTGCCTGC TGGGCAAACT GTGGGGAAGG CGGGGACCAG GATTTCTGAC TTCCTCTGGA TGATGTTTGG TTTGTCACAA 003120
003121 TCCTTAGACG TAAGCCCCCT TTATGTCCAG TGCCAGTGAG GGAGGTGGCA GGCAGCCGTG GGAAGGCAAC TGCTCTGAGA 003200
003201 AGGAGGGCAC AGGTCCACGC TGTCTAAATG TTACTGGTTG TGAACTACAC AGGAACCTTA GCATGGGGAA CCCACAGAAA 003280
003281 GGTCTTTGTG CCCTTTGCTG CCTTTTGCCA AGGTACTTTC TGTGACTAGA TTTACCTTGC ATTTATATTC TCTTCTCTTT 003360
003361 GGTAATATGT ACATATGTGT GTGAGGTATA TTTGGACCTG GGATTATGTG TGTGGTTAAA AATATACATC AGTACGTTAG 003440
003441 TGACCTGCAT AGGTGCATTA AGCAATGTAT TTTGAGAGCC ATGGGCTGGA AGGAAACACC CAACTAGCTA CTTTGAGACA 003520
003521 CTAGCAAAGC CCAAAGTAAT TCTGTTAGAT GAGCGTATGA GCTACTGTTT TATATGTTTG TTACAGATTT TTTCTTCCTG 003600
003601 TTTTCTCCTG GATATTATGT GAAAACAATG TATGTATTTA AACAAAATTT TAAATTTCAA TCTTAAGAAT TTCAGGGTTG 003680
003681 GTTTGTTTGT TTGCTTTGAG ATGGAGTCTC ATTCTGTCAC CCAGGCTGGA GTGCAGTGGT GCGATCTCGG CTCACTGAAA 003760
003761 CCTTTGCCTC CCAGGTTCAA GTGATTCTCC TGCCTCAGCC TCCCAAGTAG CTGGGACTAC AGGCGCGCTC CATCACGCCC 003840
003841 AACTAATTTT TTGTATTTTT AGTAGAGATG GGGTTCCACT GTGTTAGCCA GGCTGGTCTT GATCTCCTGA CCTTGTGATC 003920
003921 CGCCTGCCTT GGCCTCCCAA AGTGCTGGAA TTACAGTCGT GAGCCACCGC ACCCAGCAAA TTTCAGGGTT TTTTTCAACC 004000
004001 CATGTTTATT TTGAAAAATT GAAAATATCA GGAAAAATAA GAAGATAAAA GTCTTCTGCA ATGCTATCAT TCAAATAATT 004080
004081 GCTGCTGTTC TTTAGGTATT TGTATGCTCT TCCAGACATG TTCCTAATGA GTATATATTA ATGTAAAATA AAATATCAAT 004160
004161 GGCGAACATT TGAGTGTTAC CATTTGCCAG CATTTTGCAT GTAGGGACTC ACTTCATCCT CAACAATTCA ATGATTGGTT 004240
004241 TTATTATTAA ACTCATTGTA CAGATGAGAA AGCCGAAGCC CAGAATGATT ACCTTTCTTA AGATCACATA GGTAAGTAGC 004320
004321 CAAGCCAGGA TTTGAACACA GGCAGTCTGG TTCCAGAGCC TGTGCTCTTA AGGCACTTGC TATACTAACT TGCAAAAGAA 004400
004401 TTCGGAGAAT ACTATCCATA CCAATTTGTA ACCCGTTGTT TTTCCCACTC AAAATATTAT GAACTCCCTT CCATGCCAAT 004480
004481 AACCTTATAT CTAGAACATC ATTTAAAATG GCTCCGTTGT GATGCATACA TCCCCAAAAT GCAATGCTGC AATGCTGTTT 004560
004561 CCCTTTGTGG TTGTTTTGGG TAATAATAAC AATTAAGCTT AAAAATAGCA AATATTGAGG TTCTGTGTCT GGTGCTGTCC 004640
004641 TAAGACCTGT TTACATCTCA TCTAATTTAA TCCTTGTGAT AACTCAGGCT TCTTACAGGG GAAGAAGTTA GGGAAGAACA 004720
004721 GGAAGAAATT TGCCATGACA TCTTCCTTAC ATAAAACCTC AAGGTTAGAT TTTTGACATC GTCTTCAAAT AAAATTATAT 004800
004801 ATGTAATAAA AAAAAAAAAA AAAAA
[back to top]

Predicted Small Protein

Name NONHSAT104312_smProtein_2843:3088
Length 82
Molecular weight 8750.7198
Aromaticity 0.111111111111
Instability index 52.4161728395
Isoelectric point 4.35638427734
Runs 12
Runs residual 0.00596844084703
Runs probability 0.0410336807397
Amino acid sequence MLEVTRVGSFYSFLSIMCWADPEQAGVLFSCNEAEWTRGSSPAMAGLGLCSRREAGLPET
WGRGQPPACWANCGEGGDQDF
Secondary structure LEEEEEELLHHHHHEEEEELLLLLLEEEEELLLLEEELLLLHHHLLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLL
PRMN -
PiMo -