Main Page

From LncRNAWiki
Revision as of 03:46, 29 June 2016 by Lina Ma (talk | contribs)
Jump to: navigation, search

Welcome to LncRNAWiki
105,257 human long non-coding RNAs inside

LncRNAWiki is a wiki-based, publicly editable and open-content platform for community curation of human long non-coding RNAs (lncRNAs), viz., a community-curated lncRNA knowledgebase. Unlike conventional biological databases based on expert curation, lncRNAWiki harnesses collective intelligence to collect, edit and annotate information about lncRNAs, quantifies users' contributions in each annotated lncRNA and provides explicit authorship to encourage more participation from the whole scientific community.

Imagine a world in which every single person on the planet has free access to the sum of all human knowledge. — Jimmy Wales, Founder of Wikipedia

  • Small proteins:Considering the potential functional significane of lncRNA-encoded small proteins(Anderson, D.M. et al.Cell,2015; Nelson, B.R. et al. Science, 2016), we developed computational approaches for automatic identification of small proteins in 105,255 human lncRNAs. As a result, we obtained 9,304 small proteins based on protein instability, secondary structure and transmembrane helix. Among the 9,304 small proteins, 2,213 transmembrane proteins are considered to be with high confidence. Accordingly, LncRNAWiki integrates all putative small proteins, their physical and chemical parameters, secondary structures, transmembrane helices and topology structures (lncRNAs and small proteins, help). We also provide download of all the information of predicted small proteins (see Downloads).
  • BLAST lncRNAs: If a lncRNA is newly reported in a published paper but is not found in LncRNAWiki after BLAST, please consider to create a new page for this lncRNA to obtain its authorship in LncRNAWiki. See here for help.
  • Community curation: Community-curated efforts are quantified and rewarded by giving explicit authorship. Please share your expertise and perform curations to have the authorship for lncRNAs of your interest.
Data sources
In version1, we integrated lncRNA sequences and annotation information from three data sources: GENCODE (version 19; 23,898 human lncRNA transcripts), NONCODE (version 4.0; 95,135 human lncRNA transcripts), and LNCipedia (version 2.1; 32,181 human lncRNA transcripts). After the process of error and redundancy elimination, we at last obtained 105,255 non-redundant lncRNA transcripts. To examine how many lncRNAs have been functionally annotated, we blasted the 105,255 lncRNAs against lncRNA sequences in lncRNAdb (purple circle indicates lncRNAdb) (downloaded on 21 July, 2014; 223 lncRNAs in total) and found that only about 100 human lncRNAs have been functionally annotated to date, indicating a long-term process of human lncRNA annotation.

The majority of the lncRNA information was initially seeded with a subset of information from GENCODE, NONCODE, LNCipedia, and lncRNAdb.

In the newly updated version, 633 lncRNAs with publications were added and curated. These lncRNAs are mainly obtained from HGNC.In total, there are 719 such lncRNAs in present database.


Taking into account that the genomic context of lncRNAs may offer insights into their function, the classification of lncRNAs based on genomic location is of great biological significance in in-depth mining and analysis (Ma etal. 2013, RNA Biology).
We classified lncRNAs into seven groups (Intergenic, Intronic (S),Intronic (AS),Overlapping (S),Overlapping (AS), Sense and Antisense) based on their genomic location in respect to protein-coding genes. Help

About Us
Our group works in the field of Computational Biology and Bioinformatics (CBB), currently with a particular focus on building biological knowledge wikis in aid of community curation of massive biological knowledge.
Visitor Statistics