NONHSAT104343

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104343

Source

NONCODE4.0

Same with

,

Classification

sense

Length

4752 nt

Genomic location

chr5+:142544478..142606050

Exon number

3

Exons

142544478..142545009,142593562..142593653,142601923..142606050

Genome context

Sequence
000001 AATCAGTGTG TCGGGCCTCA AAAGAGAAGA GCAATGTGAT TTCATGTAGA ACCCTATTTG TATGCTGCCA CAGCCCTATG 000080
000081 CTTTGCTTTA CTTCGGATTC ATATAGAAAT CCAAGAATTT GTAGTAAGTA AGAGAAAGAA AACACCATCT CTGCTCAGCT 000160
000161 GAGCTGGCCC GTGTGCCGAA CTAGGAAGAT GGCTCTTTTC CTCTGGAGAG GATATGATCT TTTCAAGGTT GCTTTTTACA 000240
000241 CATCACCATG GGGCTGGCCT GGTTGCATGC TGAAGCTGTG AAACCTGAGT GTGTATCTGT GCAAAGCTGG GGTCGAATCA 000320
000321 GGGCCCCTAG GGGCCCTGCA CTGGTTTTGT TGAGTGGAAC ATACAGGTTG GGGTTTTTGG AGGCTGGGTG GGGAAGAGGG 000400
000401 AGGAGAGATG AGGGTGTTGT CAGTGAGGGA GGAACTTTGG GGAACAGAAA TATCTAAAAG GAAGTTTAAT GGACTTTCTT 000480
000481 GTACACGGTG CATAATGTGA ATCCTCGTCC ATGTCACCTC ATGCGGGCTT CGCACACCGT TCCGGAAGGC AAAAGCCTTG 000560
000561 TATGCCTGCA AAGCTGAACA TGACTCAGAA CTTTCGTTCA CAGCAGGCAC GGTCTTCGAT AACGTTCACC CATCTCAGGA 000640
000641 GCCTGGCTGG TTGGAGGGGA CTCTGAACGG AAAGACTGGC CTCATCCCTG AGAATTACGT GGAGTTCCTC TAACCGTGGG 000720
000721 CCCCAGCAGA ACTGCTGAGC TTTACATGGT ATCCATGACA ACTGCTGATT CCAGTGTCGA GGCCATTTCT CTTTGCCACT 000800
000801 GAGAAATGCA GCGTGACTGA CTCTGTTGCT ACCTGTCAAC ATGAATGTTT CTGTGAGCTC TGGTGTCACT CATCTCCATG 000880
000881 ATCATCTCAG CCAACATGCA TCAGTACTGC AAGAAAAGAA GTCAATCAGC AGAGGAGAGC ATTTGATAAC TAAGAGGAAG 000960
000961 ACTTGCAAAG CCGTTTTCTC ATGAGTACCC TGAATAGGGG GCACTCATTT TGTTTCAACG GTCCAAACGC CCAACCTTCA 001040
001041 GAAAGAGGAA GTCAGATAGA AATAGTCCCT GAGAGCACAC TGTGTAGCTA AGCCTGCTGG GGCTGGGTGA AGAAATTGGC 001120
001121 GCTGAGATCC AGGCTGGATC CATTGCTTTT GTTTACAATA GGCACTCTCT CTACCCCACC TCTCAGTACT TGAGACTTAA 001200
001201 AGTGCTACAG GCAGCTGGAT CTGTTTGCAT GCAGGATGAA GAGGGTTAAA ACACTGTTTA TATAAGATCC AATCTCTCAC 001280
001281 CATCTCTAAA GCAGCCGTTG GCCTGTCATC AGTGAGATAC AATCCAGTCT TCTCATGCAC GGGAACACAC ACACCCTGCG 001360
001361 TTTCTCCCTC CCAGGCTAGG AACCTCTCTG CCACCAAGGG CTGCCATCCA TCGCCTAGTA ACCACGGCAA CCCAACCTAC 001440
001441 TCTAAAACCA AACCAAAAAA ATAAAATAAC ACATCCTCTT TGCATGACAC ATTTTTTTTC TCCCCTTTTT GGTACACTTT 001520
001521 TTTTGAATGG TTTTCTAACA ACTTGAAGCA CAGGATCAAG GAATTAGGGT GGTCTACTTG AGGCAGATGG GATAGTAGCT 001600
001601 GGGAACTGTT CCCTTTCTGA TTAATTTCAG CAGCATCGGA ATATATTTGG AGCACACCCT AGTAACCTCT TGAGATTAAA 001680
001681 TTACATAGTC TTAATATTTC TGTTCCTCCA TGCAACTGAT GTTTGTTTTT TAAAGGGTAA GATGCTGCCT CCCAATGGGT 001760
001761 GATGCCATCT GACTGGTTTC CCCATGTCCT CCCATTCACC CATCTCTGCT CCCACCCTTG CCTGCCTCTA ACCCACCACT 001840
001841 GGCCAGCCCC CTTGCCCTAC TCTGGGCTGC TGAACACTGG TGCTGTGGTG GTTTTCAAGG TTAATTCCTA GGCTAACCGT 001920
001921 ATGGCCTATA GTTTAAAAGC ACATCTATGT TCACTGCCAC TCTGAAAAAG GGAATTATTT CTCAGTCTTT CAAGGCTTGA 002000
002001 GACTAATATA GGCCATTGTG ATTCAGGAAG AAACCCAAGG TTGGAGGGTG GGATGAGTAC CCTCTGAAAA AGGGAATTTG 002080
002081 CTGGTGAAAA GAGGCTGGAT CTTGTGGAAG ACTGTCTTGG ATGGGGAAGT ACTACCTGGA GATTTCAAAT TCACTTGGCC 002160
002161 TGCAAACAAC AGAGTTATCC GTATCTTCCA CATGTGAATG TCATTGCAAG GGTGACTCTA GACAAACTAC AAACCGATGG 002240
002241 ACCGTCAAGC TCCCCAGGAG CCCCTTGGAT GGCAGCGTTG CTTCAGAGTG TTTCCTGTTT CTGGAATTCC TTGTTAGGGA 002320
002321 ACTTTAAAGA AGAAAAGAAA AACTTGAATT GTGTTGAATT ACTGTATCTT TTACTTTTTT TTTTTTGAAA AGATAAACTT 002400
002401 GTAAATAGAG TGATTTGAAA TACTATATGG CAAAGTTTTA TATTTGATAT TCTTTAAGTT AGTTGCTCAC ACACTTAGGC 002480
002481 TTTGATTGCT GAAGAAGTAT GTTTAAGAGG GAGAGAGGGG AGGCAAAGCT GAAGAGAGTC AAGGTCACTG TCCCCGCTTC 002560
002561 GGCCTGAAGG AAAGAGAAGA CATTTCTATG GCCTTGCTCT CTGCTGTCCT GTTGGTGGGC ACGACACATC AGTGGTGTTC 002640
002641 AGTCTTTATG TGTTTTTAAG CATCCCTTGG GCTTTGGATT TGGAGATGGG AAGAGCATCT CCAGGCAATG AGTTTTTCAA 002720
002721 AGAATGCCTA CTTAGTAGTA AGATGAAGCT CAGGATTTAA ATAAGTGGGG TCAGGCATTC GAGTTTTTGT CTTTCTTCTC 002800
002801 AGGTGTATTT CTTGGTACCC CCAAGATATC AGGCCAGAAA GAGATGAGTC AGTTGCTGTG CTCTTTACTT CTTTTTCTCC 002880
002881 ACATCTTCTG AGGCTTTAGA AATGTGGACA AGCTAGTTTT CAAATTTTGT GTGCGTCTGT AAGTTCTTAA AGAACCAGCT 002960
002961 TCTTAGAATG TTCAGTTCTC AATGTGCTGC TGCTTTCCCT TCTCCTAAAC ATTTTAAAAC TCTTCCCTTT CACCTCCAAT 003040
003041 TCCCGTGATC CCAAAAGAAG AGGAAGACTC CAGGAGGGGT ATAGATTGTG CCGTCATAGC TTTACAGGTG GTTTTAAAGT 003120
003121 TAACAGGGGT TTGTCATGGT GATTCACTAC TCAGTTTATC AGCTCAAGGA TTATACAGCT CTTTTCCGGG AACTCACCCA 003200
003201 GGAGCAAGCG AGACACTACC ATTGAATCAG GGAATGAGAA TTAAGAATGG ACAGGACCAA GACAGAACTC AAGAAAGCCA 003280
003281 CTGGGGAAAA CTCGAGAAGA AAGGGAGTAT ACTAGTAGGT TAGATCTGTG AACCTGAGGA CAAGAAGACC TTGGGAAATG 003360
003361 GAGGCCTCAG GGGATGTGCA TTCACATACT ATTACGCTTC TCAAAGAGAG ACCAACATCA TGCTTTTAAC ACATTTGATG 003440
003441 AGGtttttta tttgtgtttt tgtttgtttt ttgagatgga gtctcactct gtggcccagg ctggagtgca gtggcgcaat 003520
003521 cttggctcac tgcaacctcc acctcccagg ttcaagtgat tctcctgtct cagcctccca agtagctggg actacaggca 003600
003601 tgagccatca cacccagcta gttttttgta tttttagtaa agatggggtt ttgccatgtt tgccaggctg atctcgaact 003680
003681 cctgacctca agtgatctgc ccacttcaga cccccaaagt gctgggattc caggtgtgag ccgctgcggc cgaccACATT 003760
003761 TGATGTTTGA AGTTGTAATC TGTCCCATCA TAAACTTACC TGGAGCTCAT GTGGAGGAAC AGAAGGCCAA GATCCTTGCT 003840
003841 TTGGGGGTGC CTCACGAAGC ATCCCTGTAG ACATTTGGCC CCAGCTTCAC TGCTTGGAAG CATGTCCCTC CCTCTTGAGT 003920
003921 TGGCTCTGAT TTGAAATCGG GAGAAACAGA GCTGCTGCCA ATGGGATCTT TTAGGTAACT CCCTCCCTAG CTTCCGTGTG 004000
004001 TCTGTGCAGT GCCCATGAGC TGCTGCCAAT GGGATCTTTC AGGTACCCCC TCCCCAGCTT CCCTGTGGCT GTGCGGTGCC 004080
004081 CTTGACAGAT GGCTTCTCTG TTTCCCTTTG CCCAGCCAGG CTCCCCTCCT TCCTATTAGC TACAAAACTG GATAAACTTC 004160
004161 AGAATATGAG CCAATGAGTA GGAAGGAACT TGAAGACTAA AGATTTTACT CTCTCCCCTA TCCATGCCCC CTACCTCTGA 004240
004241 CTCTCTCTGT GTGAACAGGA AACTTTAGGG CAGATGAGGA GAATGAATTG GTTATCAGAG TGGAAGACCA TGGCCCAGGA 004320
004321 TCCCTGAGCT TTCCCAGTAG CCTCCAGTTT CCTTTGTAAG ACCCAGGGAT CACTTAGCCA TAGCCTGAAT CTTTTAGGGG 004400
004401 TATTAAGGTC AGCCTCTCAC TCTTCCTTCA GGTTACTAAC AAAATTTCGT AGCTAAAGAA TGCCATggcc gggtgcagtg 004480
004481 gctcacgcct ataatcccag cactttggga ggccgaggcg ggcggatcac gaggtcagga gattgagacc atcctggcta 004560
004561 cgacggtgaa accccgtctc tactaaaaat acaaaaaatt agccgggtgt ggtggcgggc gcctgtagtc ccagctactc 004640
004641 tggaggctga ggcaggagaa tggcatgaac ccaggaggca gagattgcag tgagccaaga tcacgcccct gcactccagc 004720
004721 ctgggtgaca gagccagact ccgtctcaaa gg
[back to top]

Predicted Small Protein

Name NONHSAT104343_smProtein_3419:3670
Length 84
Molecular weight 9104.7172
Aromaticity 0.0963855421687
Instability index 48.2698795181
Isoelectric point 7.67376708984
Runs 12
Runs residual 0.0045515394913
Runs probability 0.045751633987
Amino acid sequence MLLTHLMRFFICVFVCFLRWSLTLWPRLECSGAILAHCNLHLPGSSDSPVSASQVAGTTG
MSHHTQLVFCIFSKDGVLPCLPG
Secondary structure LHHHHHHHHEEEEEEEEEEEELLLLLEEELLHHHHHHHHLLLLLLLLLLLLHHHHLLLLL
LLLLLEEEEEEEELLLLLLLLLL
PRMN LLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiii