NONHSAT022111

From LncRNAWiki
Revision as of 11:58, 6 October 2014 by 111.204.253.116 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT022111

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3756 nt

Genomic location

chr11+:65190269..65194003

Exon number

1

Exons

65190269..65194003

Genome context

Sequence
000001 GGAGTTAGCG ACAGGGAGGG ATGCGCGCCT GGGTGTAGTT GTGGGGGAGG AAGTGGCTAG CTCAGGGCTT CAGGGGACAG 000080
000081 ACAGGGAGAG ATGACTGAGT TAGATGAGAC GAGGGGGCGG GCTGGGGGTG CGAGAAGGAA GCTTGGCAAG GAGACTAGGT 000160
000161 CTAGGGGGAC CACAGTGGGG CAGGCTGCAT GGAAAATATC CGCAGGGTCC CCCAGGCAGA ACAGCCACGC TCCAGGCCAG 000240
000241 GCTGTCCCTA CTGCCTGGTG GAGGGGGAAC TTGACCTCTG GGAGGGCGCC GCTCTTGCAT AGCTGAGCGA GCCCGGGTGC 000320
000321 GCTGGTCTGT GTGGAAGGAG GAAGGCAGGG AGAGGTAGAA GGGGTGGAGG AGTCAGGAGG AATAGGCCGC AGCAGCCCTG 000400
000401 GAAATGATCA GGAAGGCAGG CAGTGGGTGC AGGGCTGCAG GAGGGCCGGG AGGGCTAATC TTCAACTTGT CCATGCCAGC 000480
000481 AGCCCCTTTT TTTCCAGACC AAGGGCTGTG AACCCGCCTG GGGATGAGGC CTGGTCTTGT GGAACTGAAC TTAGCTCGAC 000560
000561 GGGGCTGACC GCTCTGGCCC AGGGTGGTAT GTAATTTTCG CTCGGCCTGG GACGGGGCCC AGGCCGGGCC CAGCCTGGTG 000640
000641 GAGCGTCCAG GTCTGGGTGC GAAGCCAGGC CCCTGGGCGG AGGTGAGGGG TGGTCTGAGG AGTGATGTGG AGTTAAGGCG 000720
000721 CCATCCTCAC CGGTGACTGG TGCGGCACCT AGCATGTTTG ACAGGCGGGG ACTGCGAGGC ACGCTGCTCG GGTGTTGGGG 000800
000801 ACAACATTGA CCAACGCTTT ATTTTCCAGG TGGCAGTGCT CCTTTTGGAC TTTTCTCTAG GTTTGGCGCT AAACTCTTCT 000880
000881 TGTGAGCTCA CTCCACCCCT TCTTCCTCCC TTTAACTTAT CCATTCACTT AAAACATTAC CTGGTCATCT GGTAAGCCCG 000960
000961 GGACAGTAAG CCGAGTGGCT GTTGGAGTCG GTATTGTTGG TAATGGTGGA GGAAGAGAGG CCTTCCCGCT GAGGCTGGGG 001040
001041 TGGGGCGGAT CGGTGTTGCT TGCCTGCAGA GAGGGTGGGG AGTGAATGTG CACCCTTGGG TGGGCCTGCA GCCATCCAGC 001120
001121 TGAAAGTTAC AAAAATGCTT CATGGACCGT GGTTTGTTAC TATAGTGTTC CTCATGGCGA GCAGATGGAA CCGGGAGACA 001200
001201 TGGAGTCCCT GGCCAGTGTG AGTCCTAGCA TTGCAGGAGG GGAGACCCTG GAGGAGAGAG CCCGCCTCAA TTGATGCCTG 001280
001281 CAGATTGAAT TTCCAGAGGC TTAGGAGGAG GAAGTTCTCC AATGTTCTGT TTCCAGGCCT TGCTCAGGAA GCCCTGTATT 001360
001361 CAGGAGGCTA CCATTTAAAG TTTGCAGATG AGCTTATGGG GGGCAATCTT AAAAAGTCCA CAGCAGATGC ATCCGGCTCG 001440
001441 AGGGGCCATC AGCTTTGAAT AAATGCTTGT TCCAGAGCCC ATGAATGCCA GCAGGCACCC CTCCTTTCCT GGGGTAAAGG 001520
001521 TTTTCAGATG CTGCATCTTC TAAATTGAGC CTCCGGTCAT ACTAGTTTTG TGCTTGGAAC CTTGCTTCAA GAAGATCCCT 001600
001601 AAGCTGTAGA ACATTTTAAC GTTGATGCCA CAACGCAGAT TGATGCCTTG TAGATGGAGC TTGCAGATGG AGCCCCGTGA 001680
001681 CCTCTCACCT ACCCACCTGT TTGCCTGCCT TCTTGTGCGT TTCTCGGAGA AGTTCTTAGC CTGATGAAAT AACTTGGGGC 001760
001761 GTTGAAGAGC TGTTTAATTT TAAATGCCTT AGACTGGGGA TATATTAGAG GAAGCAGATT GTCAAATTAA GGGTGTCATT 001840
001841 GTGTTGTGCT AAACGCTGGG AGGGTACAAG TTGGTCATTC CTAAATCTGT GTGTGAGAAA TGGCAGGTCT AGTTTGGGCA 001920
001921 TTGTGATTGC ATTGCAGATT ACTAGGAGAA GGGAATGGTG GGTACACCGG TAGTGCTCTT TTGTTCTTGC TTCGTTTTTT 002000
002001 TAAACTTGAA CTTTACTTCG TTAGATTTCA TAATACTTTC TTGGCATTCT AGTAAGAGGA CCCTGAGGTG GGAGTTGTGG 002080
002081 GGGACGGGGA GAAGGGGACA GCTTGGCACC GGTCCCGTGG GCGTTGCAGT GTGGGGGATG GGGGTATGCA GCTTGGCACT 002160
002161 GGTACTGGGA GGGATGAGGG TGAAGAAGGG GAGAGGGTTG GTTAGAGATA CAGTGTGGGT GGTGGGGGTG GTAGGAAATG 002240
002241 CAGGTTGAAG GGAATTCTCT GGGGCTTTGG GGAATTTAGT GCGTGGGTGA GCCAAGAAAA TACTAATTAA TAATAGTAAG 002320
002321 TTGTTAGTGT TGGTTAAGTT GTTGCTTGGA AGTGAGAAGT TGCTTAGAAA CTTTCCAAAG TGCTTAGAAC TTTAAGTGCA 002400
002401 AACAGACAAA CTAACAAACA AAAATTGTTT TGCTTTGCTA CAAGGTGGGG AAGACTGAAG AAGTGTTAAC TGAAAACAGG 002480
002481 TGACACAGAG TCACCAGTTT TCCGAGAACC AAAGGGAGGG GTGTGTGATG CCATCTCACA GGCAGGGGAA ATGTCTTTAC 002560
002561 CAGCTTCCTC CTGGTGGCCA AGACAGCCTG TTTCAGAGGG TTGTTTTGTT TGGGGTGTGG GTGTTATCAA GTGAATTAGT 002640
002641 CACTTGAAAG ATGGGCGTCA GACTTGCATA CGCAGCAGAT CAGCATCCTT CGCTGCCCCT TAGCAACTTA GGTGGTTGAT 002720
002721 TTGAAACTGT GAAGGTGTGA TTTTTTCAGG AGCTGGAAGT CTTAGAAAAG CCTTGTAAAT GCCTATATTG TGGGCTTTTA 002800
002801 ACGTATTTAA GGGACCACTT AAGACGAGAT TAGATGGGCT CTTCTGGATT TGTTCCTCAT TTGTCACAGG TGTCTTGTGA 002880
002881 TTGAAAATCA TGAGCGAAGT GAAATTGCAT TGAATTTCAA GGGAATTTAG TATGTAAATC GTGCCTTAGA AACACATCTG 002960
002961 TTGTCTTTTC TGTGTTTGGT CGATATTAAT AATGGCAAAA TTTTTGCCTA TCTAGTATCT TCAAATTGTA GTCTTTGTAA 003040
003041 CAACCAAATA ACCTTTTGTG GTCACTGTAA AATTAATATT TGGTAGACAG AATCCATGTA CCTTTGCTAA GGTTAGAATG 003120
003121 AATAATTTAT TGTATTTTTA ATTTGAATGT TTGTGCTTTT TAAATGAGCC AAGACTAGAG GGGAAACTAT CACCTAAAAT 003200
003201 CAGTTTGGAA AACAAGACCT AAAAAGGGAA GGGGATGGGG ATTGTGGGGA GAGAGTGGGC GAGGTGCCTT TACTACATGT 003280
003281 GTGATCTGAA AACCCTGCTT GGTTCTGAGC TGCGTCTATT GAATTGGTAA AGTAATACCA ATGGCTTTTT ATCATTTCCT 003360
003361 TCTTCCCTTT AAGTTTCACT TGAAATTTTA AAAATCATGG TTATTTTTAT CGTTGGGATC TTTCTGTCTT CTGGGTTCCA 003440
003441 TTTTTTAAAT GTTTAAAAAT ATGTTGACAT GGTAGTTCAG TTCTTAACCA ATGACTTGGG GATGATGCAA ACAATTACTG 003520
003521 TCGTTGGGAT TTAGAGTGTA TTAGTCACGC ATGTATGGGG AAGTAGTCTC GGGTATGCTG TTGTGAAATT GAAACTGTAA 003600
003601 AAGTAGATGG TTGAAAGTAC TGGTATGTTG CTCTGTATGG TAAGAACTAA TTCTGTTACG TCATGTACAT AATTACTAAT 003680
003681 CACTTTTCTT CCCCTTTACA GCACAAATAA AGTTTGAGTT CTAAACTCAT TAGAAAAAAA AAAAAAAAAA AAAAAA
[back to top]


Annotation (From lncRNAdb)

[NONHSAT022111]
Section Description
ID NEAT1
Characteristics Two isoforms: ~3.7 (NEAT1 v1/ MEN epsilon) and ~23 kb (NEAT1 v2/ MEN beta) transcribed from the MEN1 locus (Guru (1997), Saha (2006), Hutchinson (2007), Sunwoo (2009)). Intergenic, single exon transcript. One transcriptional start site, long isoform is a continuation of the short isoform (Hutchinson (2007)).
Expression Expressed in a wide range of human and mouse tissues (Peyman (1999), Saha (2006), Hutchinson (2007), Sunwoo (2009)). Although expression of the long isoform (Neat1 v2/ Men beta) is much lower than Neat1 v1/ Men epsilon in many tissues (Nakagawa (2011)). Up-regulation of Neat1 (and subsequent formation of/increase in paraspeckles) may be a general feature of differentiation. Neat1 is up-regulated upon human ESC differentiation (Chen (2009)), muscle differentiation (Sunwoo (2009)) and in-vitro neuronal differentiation (Mercer (2010)). Up-regulated upon infection of mice with Japanese encephalitis virus or Rabies virus (Saha (2006)). Expressed in the nucleus accumbens of normal human brains and upregulated in this brain region in heroin abusers (Michelhaugh (2010)). In addition, expression levels are elevated in the caudate nucleus of patients suffering from the neurodegenerative condition, Huntington's disease (Johnson (2012)). Localised to paraspeckles within the nucleus (Hutchinson (2007), Sunwoo (2009)). Mouse Neat1 found to be highly unstable (half-life <2 hr) in N2A (neuroblastoma) and 3T3 cells, although low stability did not prevent paraspeckle formation (Friedel (2009), Clark (2012)), while human NEAT1 appears to be more stable (Friedel (2009), Sasaki (2009), Sunwoo (2009)). The two Neat1 isoforms also have differing stability, with the long (v2/Men beta) isoform more stable than than the shorter (v1/Men epsilon) isoform in N2A and 3T3 cells (Clark (2012)).
Function Transcription of Neat1 is essential for the assembly, maintenance and structural integrity of paraspeckles (Clemson (2009), Sasaki (2009), Sunwoo (2009), Mao (2011)). Newly transcribed Neat1 RNA nucleates the formation of paraspeckles at its transcription site, recruiting paraspeckle proteins. Continued expression is required to maintain paraspeckles, identifying the dynamic nature of this nuclear body (Mao (2011)). Literature currently provides conflicting evidence about whether Neat1 v1 is sufficient for the formation of paraspeckles, or if this requires full length Neat1 v2 (Sasaki (2009), Shevtsov (2011)). In vivo however, significant expression of Neat1 v2 appears to be required for cells to form more than 1 or 2 paraspeckles (Nakagawa (2011)). Neat1 appears to constrain the size and mediate the cylindrical geometry of paraspeckles. Neat1 short and the identical 5' end of Neat1 long plus the 3' of Neat1 long all localise to the periphery of paraspeckles, while the central region of Neat1 long is found in the interior of the nuclear body (Souquere (2010)). Interacts with paraspeckles proteins PSPC1, PSF/SFPQ and NONO/P54NRB either directly or in a complex (Clemson (2009), Sasaki (2009), Sunwoo (2009)). Interaction of short isoform with p54 requires protein interaction regions near the 5' and 3' of the transcript (Murthy (2010)). Neat1 knockout lead to loss of paraspeckles, however mice were viable and fertile with normal morphology and populations of differentiated cells in one Neat1 v1 and v2 expressing tissue tested (Nakagawa (2011)). Neat1 has also been reported to interact with a number of chromatin binding protein/complexes in mouse embryonic stem cells including PRC1, PRC2, JARID1B, ESET and SUV39H1, with the general pattern being interaction with repressors of gene expression (Guttman (2011))
Conservation Placental mammals. Dog Neat1 is found in several genomic locations, version that is found near single dog Malat1 gene is given below (Hutchinson (2007)).
Misc A conserved tRNA like sequence at the 3' end is cleaved off and processed to generate a short tRNA-like ncRNA from the long (Men beta) transcript, similar to Malat1 (Sunwoo (2009)). The last ~500bp of the short (Men epsilon) isoform has been shown to be expressed in trophoblast cells with a function potentially independent from the Neat1 transcript. This transcript, TncRNA has its own entry.
Name Neat1: Nuclear enriched abundant transcript 1. Short isoform - Neat1 v1. Long isoform - Neat1 v2. Multiple endocrine neoplasia 1 (MEN1). Short isoform - MEN epsilon. Long isoform - MEN beta. VINC: Virus-inducible ncRNA

[back to top]