NONHSAT028143

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT028143

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3084 nt

Genomic location

chr12+:50222326..50234937

Exon number

3

Exons

50222326..50222527,50222718..50222866,50232205..50234937

Genome context

Sequence
000001 AGATGTTCCC AGGACGCGAG CTAAGTCTGC CCGGCCGGCG CGCGCCGCTT CCGCTTTGGT GGGAAGGACC TCCCTTCCCC 000080
000081 CTTATAGAAA CACTTTCTCC AGGCCGGGCC GGGCTTGGTA AACCTCAGCC GTGCCTGGGT AGAGGACGCC GGGCTGGGCC 000160
000161 GAGCGGGGGT GGAATGAGGG CGTCTAGAAC AGAGAAAACT AAGATGGGGT CGTCCCCCCA GTAATCTGGA AAACAGTTAC 000240
000241 AGCAACAGGC TCCTCTCATC AGACGGCAGT GCTGGTCGTT TCCTCACTGG TTTGCGGGGA AGGCTGGGCA GAAAAAGGAT 000320
000321 GCAACCTTGT TCTCAACCAC TTGCTATCTC GTGATTCAGG GATTGGATGA GTCTCTATGG TTTGTTTTGC CCTGAAGAGC 000400
000401 AGAAGGCTTC TGTCCCAACT GGTGTTGCCA AAGCAACATA TTAATTCCAT GCCATGATCC TGGGTCAAGA TCTGCACAAT 000480
000481 CTGATTGGGC ATGTCACCTC GGATGGCAAG GGAGTGGAAG TGGTCAAAAT CATGGAGTCC CAGCTTTCGG AGACGCCTTG 000560
000561 CAGCTGCCCG GTAACACTTC CAGGGTTGGG GCTCCACAAG GAGGTAGTGG CAGAGGGAGG AAAGATGGGC CAGGAACTCC 000640
000641 CATAGGCCAT GGTCTCCATG ATTCAGATGA ATCCACATGG TTATTGACAT GCAGAAGCCA ATGTCAAAAA CTGAACGTCC 000720
000721 AAATTGGCTT AAGAAAGAGC TCAAGAGAAC CTTCCGGGTC CTTTGATTCA TGAAGTCCAG GGTGATAAAA GTCAAGGCAT 000800
000801 CAGGAAAAGG ACATTCTTTT TCGGCTCGCT TCACCAGGAC TGGATCTATG TCGCAGCAGA GGAGACGGAA TTCTCTTGAG 000880
000881 GCATCTGAGC AGGTTTCCCC GTCAGGTAGG GAGAGGAAGT GTTTGTATAG AGCCACACTC AGATCCTAGA GGAATCCAAA 000960
000961 ACAGCTTTGT CACTGCAGTT CTCAATCAGT GAATGGAAGA GAAATCTGTA TCATTCTCCC ACTGAGAATG CCACTTACCT 001040
001041 CTGGCTGTAT AGGATGTGAG GTTCCTAAAC CATAACCTAA AGAGGTTGTG CAAAAATTTT CTACAGAGTA GGCTCTCAGT 001120
001121 GCCCTCCAGG GGTCACATGC TGTAATGCAT CATCCCCATA TTTTCCTGAC ACTGATGATA GATTGGGTTG TATAGCTGAA 001200
001201 AGTTACAGCC AAATGGATGC AGCAAAGGAA CCTACAGCAG TATTTCCCAG CTTAAGTAAT ATGCTGATAA ACTCACTCTT 001280
001281 TTTCTGGGTA TCTCAAGACC CCAGGGCGGA AGAAAGGGGT CCAGAGGCAT AACCAGTGTC TCTTACCAGG AAAGTGAGGA 001360
001361 ATACAGCAAG CACAATCTTA CAAAAACCTG CTTCTCAGAC CCGCGGTGGG GGAAGTCTGG ATTCCAAAGT CCTCATTCAG 001440
001441 CCTTCACTAA AGCTTCAATT TTACTGTCTT AGGATTCACC ACCTCCTTCT GAGACATTCT GCTTCTATAA CCATCATTGC 001520
001521 TAAAAGTCAC TGCTAACAAC CTTCTAGACA AGTCCAATGA TTTGTGATTT TTACCACATT AGCACCTTTT TTACTCTTAA 001600
001601 GTCACCATCT CCTTTCTCTA ACTTTGAAtt cttttttttt ttgagacaaa gtctggttct gtcggccagg ctggagtgca 001680
001681 atggcgtgat ctcagcaacc tctgcctccc aggttcaagc gattctcctg cctcagcctc ctaagtagct gggattacag 001760
001761 gcatgcatca ccacactcag ctaatttttg tatttttatt agagacaggg tttcaccatg ttggccaggc tggtctttaa 001840
001841 ctgacctcaa gtgatccacc tgcctcggcc tcccaaagtg ctgggactat aggaatgagc cactgcacct ggccTCTTTG 001920
001921 ATTTCTATAA CCATTATAAC TAACTCAAAT TTGTGTTTTT CCACATAGTA TTTTCAAGGC ATTTTTCTGT TTACTACTTT 002000
002001 GTTTGCTTGG TGTAACAAGC ATGTGAGGAA AATATTTATT TATGCCTTTT CTGCAGGTGA AGAAAATGCA CAAtggaaac 002080
002081 cacccagatg tccagtgata gtggcatgga caaataaatg tggtataatc atgaagatag aatactaaat agcagttaaa 002160
002161 aaatgaaggc acccagcttg gtgcacaaca tggatgattc tcacaaatac ataatgttga gaaaaaaagc aagccataac 002240
002241 agaaaatata tgaaaagcat ccatttacat gaagctcaaa aatgggtgaa attaaagtac atcattcagg gaagaaaaac 002320
002321 agtacactat aaagaaaagc aaggaaacaa cagaaggcta gttgttacct ctgggtagag atagaggttt atgaacagga 002400
002401 aaaggcatat aggggaagtt gtggggtgct gaaaatgttT Attatttatt tatttattta tttttgagac aggatcttac 002480
002481 tctgtttccc aggctggagt acagtggtgt gatcactgca accttgactt cccaggctcc agagatcctc ctacttcagc 002560
002561 cttatgagta gctggaacca caggtgcaca ccaccatgcc cagctaattt ttgtattttt ggtagggatg agattttgcc 002640
002641 atgttgccca ggctgatctc aaactcccga gctcaagcaa tccaccagcg tcagtctctg aaaatgctgg gattacagcg 002720
002721 tcagccactg cacccagcAG ttttttgggt tttttgttgt tgttttattt gagacagggt ctcacttcgt cacccaggct 002800
002801 ggggtatgtg gcacaatcaa ggctcactgc agcctcgacc tcctgggttc aagcgatcct cctccctcag ctctctaagt 002880
002881 agctgggact acagatgccc accaccacac ctggctaatt ttttttagag atggggtttt gccatgttgc ccaggctggG 002960
002961 AAAGGAGTTT TTAAAATTGA AAAAAAAAAG CACCCataaa ttatggtaca tctatccaat gaaacattag gaagctgcta 003040
003041 aaaaaagaat gtggTGATAT AAATAATCTC AGAATTAAAA GCTA
[back to top]

Predicted Small Protein

Name NONHSAT028143_smProtein_2627:2881
Length 85
Molecular weight 8896.128
Aromaticity 0.0833333333333
Instability index 58.8810714286
Isoelectric point 10.020324707
Runs 11
Runs residual 0.0030330603579
Runs probability 0.0344140932376
Amino acid sequence MRFCHVAQADLKLPSSSNPPASVSENAGITASATAPSSFLGFLLLFYLRQGLTSSPRLGY
VAQSRLTAASTSWVQAILLPQLSK
Secondary structure LLEEEEEELLLLLLLLLLLLLLLLLLLLLEEEELLLHHHHHHHHHHHHHHLLLLLLLLLH
HHHHHHHHHLHHHHHHHHHHHHLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiii