Using the NONCODE Database Resource

Li Xiyuan1, Bu Dechao2, Sun Liang3, Wu Yang4, Fang Shuangsang5, Li Hui5, Luo Haitao5, Luo Chunlong5, Fang Wenzheng5, Chen Runsheng6, Zhao Yi2

1 Beijing Zhongke Jingyun Technology Company Ltd., Medicine. Beijing, 2 Chinese Academy of Sciences, LuoYang Branch of Institute of Computing Technology, Beijing, 3 Wenzhou Medical University, College of Laboratory Medicine and Life Sciences, Department of Laboratory Medicine, Beijing, 4 Chinese Academy of Sciences, Institute of Computing Technology, Bioinformatics Research Group, Advanced Computing Research Laboratory, Beijing, 5 Institute of Computing Technology Chinese Academy of Sciences, Bioinformatics Research Group, Advanced Computing Research Laboratory, Beijing, 6 Institute of Biophysics, Chinese Academy of Sciences, CAS Key Laboratory of RNA Biology, Beijing
Publication Name:  Current Protocols in Bioinformatics
Unit Number:  Unit 12.16
DOI:  10.1002/cpbi.25
Online Posting Date:  June, 2017
GO TO THE FULL TEXT: PDF or HTML at Wiley Online Library


NONCODE is a comprehensive database that aims to present the most complete collection and annotation of non‐coding RNAs, especially long non‐coding RNAs (lncRNA genes), and thus NONCODE is essential to modern biological and medical research. Scientists are producing a flood of new data from which new lncRNA genes and lncRNA‐disease relationships are continually being identified. NONCODE assimilates such information from a wide variety of sources including published articles, RNA‐seq data, micro‐array data and databases on genetic variation (dbSNP) and genome‐wide associations (GWAS). NONCODE organizes all this information and makes it freely available to the public via the Internet. The NONCODE protocol provides step‐by‐step instructions on how to browse and search lncRNA information such as sequence, expression, and disease relationships, how to use the tools for functional prediction, species conservation assays, blast analysis, identifier conversion, and, finally, how to submit sequences to identify lncRNA genes. As of Dec 2016, NONCODE has cataloged 487,851 lncRNA genes sequenced from 16 species. © 2017 by John Wiley & Sons, Inc.

Keywords: long non‐coding RNA; NONCODE database; information retrieval; species conservation; bioinformatics; functional annotation

PDF or HTML at Wiley Online Library

Table of Contents

  • Commentary
  • Literature Cited
  • Figures
  • Tables
PDF or HTML at Wiley Online Library


PDF or HTML at Wiley Online Library



Literature Cited

Literature Cited
  Brannan, C. I., Dees, E. C., Ingram, R. S., & Tilghman, S. M. (1990). The product of the H19 gene may function as an RNA. Molecular and Cellular Biology, 10, 28–36. doi: 10.1128/MCB.10.1.28.
  Bu, D., Luo, H., Jiao, F., Fang, S., Tan, C., Liu, Z., & Zhao, Y. (2015). Evolutionary annotation of conserved long non‐coding RNAs in major mammalian species. Science China Life Sciences, 58, 787–798. doi: 10.1007/s11427‐015‐4881‐9.
  Cunningham, F., Amode, M. R., Barrell, D., Beal, K., Billis, K., Brent, S., … Flicek, P. (2015). Ensembl 2015. Nucleic Acids Research, 43, D662–669. doi: 10.1093/nar/gku1010.
  Dreszer, T. R., Karolchik, D., Zweig, A. S., Hinrichs, A. S., Raney, B. J., Kuhn, R. M., … James, K. W. (2012). The UCSC genome browser database: Extensions and updates 2011. Nucleic Acids Research, 40, D918–923. doi: 10.1093/nar/gkr1055.
  Fernández‐Suárez, X. M., & Schuster, M. K. 2010. Using the Ensembl genome server to browse genomic sequence data. Current Protocols in Bioinformatics, 30, 1.15.1–1.15.48. doi: 10.1002/0471250953.bi0115s30.
  Gibney, G., & Baxevanis, A. D. (2011). Searching NCBI databases using Entrez. Current Protocols in Bioinformatics, 34, 1.3:1.3.1–1.3.25. doi: 10.1002/0471250953.bi0103s34.
  Guo, X., Gao, L., Liao, Q., Xiao, H., Ma, X., Yang, X., … Zhao, Y. (2013). Long non‐coding RNAs function annotation: A global prediction method based on bi‐colored networks. Nucleic Acids Research, 41, e35. doi: 10.1093/nar/gks967.
  Gupta, R. A., Shah, N., Wang, K. C., Kim, J., Horlings, H. M., Wong, D. J., … Chang, H. Y. (2010). Long non‐coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature, 464, 1071–1076. doi: 10.1038/nature08975.
  Iyer, M. K., Niknafs, Y. S., Malik, R., Singhal, U., Sahu, A., Hosono, Y., … Chinnaiyan, A. M. (2015). The landscape of long noncoding RNAs in the human transcriptome. Nature Genetics, 47(3), 199–208. doi: 10.1038/ng.3192.
  Jalali, S., Gandhi, S., & Scaria, V. (2016). Navigating the dynamic landscape of long noncoding RNA and protein‐coding gene annotations in GENCODE. Human Genomics, 10, 35. doi: 10.1186/s40246‐016‐0090‐2.
  Karolchik, D., Hinrichs, A. S., & Kent, W. J. (2012). The UCSC Genome Browser, Current Protocols in Bioinformatics, 40, 1.4.1–1.4.33. doi: 10.1002/0471250953.bi0104s40.
  Ladunga, I. (2009). Finding similar nucleotide sequences using network BLAST searches. Current Protocols in Bioinformatics, 26, 3.3.1–3.3.26. doi: 10.1002/0471250953.bi0303s26.
  Liao, Q., Xiao, H., Bu, D., Xie, C., Miao, R., Luo, H., … Zhao, Y. (2011). ncFANs: A web server for functional annotation of long non‐coding RNAs. Nucleic Acids Research, 39, W118–124. doi: 10.1093/nar/gkr432.
  Marques, A. C., & Ponting, C. P. (2009). Catalogues of mammalian long noncoding RNAs: Modest conservation and incompleteness. Genome Biology, 10, R124. doi: 10.1186/gb‐2009‐10‐11‐r124.
  Ponjavic, J., Ponting, C. P., & Lunter, G. (2007). Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Research, 17, 556–565. doi: 10.1101/gr.6036807.
  Pruitt, K. D., Brown, G. R., Hiatt, S. M., Thibaud‐Nissen, F., Astashyn, A., Ermolaeva, O., … Ostell, J. M. (2014). RefSeq: An update on mammalian reference sequences. Nucleic Acids Research, 42, D756–763. doi: 10.1093/nar/gkt1114.
  Quek, X. C., Thomson, D. W., Maag, J. L., Bartonicek, N., Signal, B., Clark, M. B., … Dinger, M. E. (2015). lncRNAdb v2.0: Expanding the reference database for functional long noncoding RNAs. Nucleic Acids Research, 43, D168–173. doi: 10.1093/nar/gku988.
  Sharon, D., Tilgner, H., Grubert, F., & Snyder, M. (2013). A single‐molecule long‐read survey of the human transcriptome. Nature Biotechnology, 31, 1009–1014. doi: 10.1038/nbt.2705.
  Stelzer, G., Rosen, N., Plaschkes, I., Zimmerman, S., Twik, M., Fishilevich, S., …Lancet, D. (2016). The GeneCards suite: from gene data mining to disease genome sequence analyses. Current Protocol Bioinformatics, 54, 1.30.1–1.30.33. doi: 10.1002/cpbi.5.
  Sun, L., Luo, H., Bu, D., Zhao, G., Yu, K., Zhang, C., … Zhao, Y. (2013). Utilizing sequence intrinsic composition to classify protein‐coding and long non‐coding transcripts. Nucleic Acids Research, 41, e166. doi: 10.1093/nar/gkt646.
  Trapnell, C., Roberts, A., Goff, L., Pertea, G., Kim, D., Kelley, D. R., … Pachter, L. (2012). Differential gene and transcript expression analysis of RNA‐seq experiments with TopHat and Cufflinks. Nature Protocols, 7, 562–578. doi: 10.1038/nprot.2012.016.
  Weakley, S. M., Wang, H., Yao, Q., & Chen, C. (2011). Expression and function of a large non‐coding RNA gene XIST in human cancer. World Journal of Surgery, 35, 1751–1756. doi: 10.1007/s00268‐010‐0951‐0.
  Yin, Y., Zhao, Y., Wang, J., Liu, C., Chen, S., Chen, R., & Zhao, H. (2007). antiCODE: A natural sense‐antisense transcripts database. BMC Bioinformatics, 8, 319. doi: 10.1186/1471‐2105‐8‐319.
  Zhao, Y., Li, H., Fang, S., Kang, Y., Wu, W., Hao, Y., … Chen, R. (2016). NONCODE 2016: An informative and valuable data source of long non‐coding RNAs. Nucleic Acids Research, 44, D203–208. doi: 10.1093/nar/gkv1252.
PDF or HTML at Wiley Online Library