alexa Knowledge Mining of Disease Network can Provide New Insights in Cancer Research through Analysis of Other Diseases | OMICS International
ISSN: 2157-2518
Journal of Carcinogenesis & Mutagenesis

Like us on:

Make the best use of Scientific Research and information from our 700+ peer reviewed, Open Access Journals that operates with the help of 50,000+ Editorial Board Members and esteemed reviewers and 1000+ Scientific associations in Medical, Clinical, Pharmaceutical, Engineering, Technology and Management Fields.
Meet Inspiring Speakers and Experts at our 3000+ Global Conferenceseries Events with over 600+ Conferences, 1200+ Symposiums and 1200+ Workshops on
Medical, Pharma, Engineering, Science, Technology and Business

Knowledge Mining of Disease Network can Provide New Insights in Cancer Research through Analysis of Other Diseases

Matthew B. Carson, Cong Liu and Hui Lu*

Bioinformatics Program, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, 60612-7340, USA

*Corresponding Author:
Dr. Hui Lu
Bioinformatics Program, Department of Bioengineering
University of Illinois at Chicago, Chicago, IL, 60612-7340, USA
E-mail: [email protected]

Received date: March 28, 2012; Accepted date: March 29, 2012; Published date: March 31, 2012

Citation: Carson MB, Liu C, Lu H (2012) Knowledge Mining of Disease Network can Provide New Insights in Cancer Research through Analysis of Other Diseases. J Carcinogene Mutagene 3:e103. doi: 10.4172/2157-2518.1000e103

Copyright: © 2012 Carson MB, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Visit for more related articles at Journal of Carcinogenesis & Mutagenesis


The word itself seems to hold a dark power over modern humankind. Nearly every one of us has come in close proximity to this life-threatening disease. In recent years, researchers have produced a body of work that has given us a clearer (albeit more complicated) picture of how cancer comes to be, how it develops, and how it can be treated. The roles of genetics (in the form of single nucleotide polymorphisms or SNPs) [1], epigenetics [2], miRNA [3], copy number variation [4], chromatin structure [5], and protein biomarkers [6] in cancer have been shown. While great scientific advances have been made in the understanding and treatment of this disease in the last 50 years, we still do not have a clear understanding of the ‘how’ and ‘why’. Given a set of initial conditions in the body defined by genetics, lifestyle, environmental exposure, etc., cancer begins and proceeds to develop through an evolutionary process. This results in all cancers having unique characteristics [7]. Clearly, cancer is a multidimensional problem for which we have an enormous amount of data now. Gaining knowledge from the existing data, however, is a nontrivial task.

In recent years, bioinformatics and computational biology have made a variety of contributions to disease analysis using existing data in an attempt to increase our understanding of many diseases. Popular topics include the discovery, prediction, and analysis of genes related to disease [8], statistical analysis of SNPs and disease [9], the prediction and discovery of new drug targets [10], the development of the disease ontology and its application to the human genome [11,12], the analysis of protein-protein interaction networks as they relate to disease [13], and many others. Of particular interest is the development of ‘disease networks’ [14,15], which are in most cases bipartite graphs describing disease-disease as well as disease-gene relationships. In the projection of the disease-gene network that describes disease-disease relationships (Figure 1), nodes indicate diseases and the edge between two nodes represents how these diseases are related. These edges may signify one or more shared genes, metabolic pathways, miRNAs, or a number of other data types. The disease network reveals the interconnected nature of various diseases, which begs the question; can we gain new knowledge of a disease such as cancer by studying ‘connected’, noncancer diseases? Many diseases including obesity [16,17], various infections [18], diabetes [19], and possibly even psychological stress [20] have been reported some relationship to cancer. Often the relationship type is unknown or partially known, which indicates that a deeper understanding of these relationships is needed. However, those relationships have not been explored as a whole, but rather as individual links.


Figure 1: A small example of a projection of the disease-gene bipartite graph that describes disease-disease relationships. Nodes indicate diseases; edges between nodes represent disease relationships. Edges may signify one or more genes, metabolic pathways, miRNAs, or a number of other data types.

Due to the complicated nature of many diseases, which may involve the failure of multiple levels of biological function including DNA repair, gene regulation, epigenetic and histone modifications, metabolic pathways etc., elucidation of disease relationships requires a systematic and computational solution. Though there may be a plethora of data available to quantify this disease problem, the data itself does nothing for us if we cannot turn that data into knowledge (a similar problem arose after the sequencing of the human genome). Merely combining sources of data is not sufficient. We must identify patterns within the data, which is manually infeasible when the number of data points and characteristics to be compared is large. Clearer understanding could be gained by finding, among all attributes of a relationship, those that characterize it most accurately. Several existing machine learning algorithms can help achieve this including multiple instance learning [21], positive/unlabeled (PU) learning [22], Bayesian inference [23], the alternating decision tree, or ADTree [24], and others. In the past we have used the ADTree algorithm to analyze methylation patterns on DNA [25] and to predict DNA-binding proteins [26]. In both cases, this algorithm helped us to understand what characteristics have the most influence on determining the class to which the examples belonged. A similar method of ‘rule discovery’ is needed in the case of the disease

network. Of course, the rules may be heavily dependent upon the types of disease in question (i.e. metabolic, infectious, autoimmune and genetic). By analyzing a combination of available genetic, epigenetic, and proteomic data, one will be able to use these algorithms to enrich the edges between cancer and other diseases in the disease network, as well as to predict new edges within disease clusters.

The key to understanding the disease network is to enrich the value of existing edges and to infer new ones based on this enriched value. There is a wealth of information concerning diseases, metabolism, gene ontology, drug targets, miRNA, protein-protein interaction, gene regulation, and gene expression. Unfortunately, there are large areas of missing and overlapping data as well as many false positives and even more false negatives. This makes it difficult to assemble the puzzle and gain knowledge. One can use algorithms such as ADTree which can filter through noisy data to find the most informative and conserved characteristics of a disease-disease relationship. Cancer A and noncancer disease B, though they may not share a causal gene(s) according to OMIM, but may be related at some distance through a common metabolic pathway, co-regulating transcription factor, or negative regulation by one or more miRNAs. Any of these three could be a false positive association. When analyzed together along with other available data, however, a more complete biological process comes into focus and the noise problem can be mitigated. The ADTree allows us to easily visualize which biological processes contribute most to the disease relationship, eliminating the ‘black box’ effect of many machine learning algorithms.

Overall, we believe cancer is both unique and related to other diseases. Study of all diseases as a network system can generate many interesting results. For example; drug of related non-cancer diseases may help treat the side effects of cancer drugs; the complex relationship between bacteria and cancer: bacteria can be both beneficial and cancer-causing, can provide new ideas about cancer treatment; mechanisms and tissue-specificity of non-cancer diseases may prime the cellular environment for metastasis. We expect in the near future, with enormous genotype and phenotype data available for all diseases, there will be a novel view point for cancer research that will emerge from the disease network study.


Select your language of interest to view the total content in your interested language
Post your comment

Share This Article

Article Usage

  • Total views: 11978
  • [From(publication date):
    June-2012 - Jun 22, 2018]
  • Breakdown by view type
  • HTML page views : 8188
  • PDF downloads : 3790

Post your comment

captcha   Reload  Can't read the image? click here to refresh

Peer Reviewed Journals
Make the best use of Scientific Research and information from our 700 + peer reviewed, Open Access Journals
International Conferences 2018-19
Meet Inspiring Speakers and Experts at our 3000+ Global Annual Meetings

Contact Us

Agri & Aquaculture Journals

Dr. Krish

[email protected]

+1-702-714-7001Extn: 9040

Biochemistry Journals

Datta A

[email protected]

1-702-714-7001Extn: 9037

Business & Management Journals


[email protected]

1-702-714-7001Extn: 9042

Chemistry Journals

Gabriel Shaw

[email protected]

1-702-714-7001Extn: 9040

Clinical Journals

Datta A

[email protected]

1-702-714-7001Extn: 9037

Engineering Journals

James Franklin

[email protected]

1-702-714-7001Extn: 9042

Food & Nutrition Journals

Katie Wilson

[email protected]

1-702-714-7001Extn: 9042

General Science

Andrea Jason

[email protected]

1-702-714-7001Extn: 9043

Genetics & Molecular Biology Journals

Anna Melissa

[email protected]

1-702-714-7001Extn: 9006

Immunology & Microbiology Journals

David Gorantl

[email protected]

1-702-714-7001Extn: 9014

Materials Science Journals

Rachle Green

[email protected]

1-702-714-7001Extn: 9039

Nursing & Health Care Journals

Stephanie Skinner

[email protected]

1-702-714-7001Extn: 9039

Medical Journals

Nimmi Anna

[email protected]

1-702-714-7001Extn: 9038

Neuroscience & Psychology Journals

Nathan T

[email protected]

1-702-714-7001Extn: 9041

Pharmaceutical Sciences Journals

Ann Jose

[email protected]

1-702-714-7001Extn: 9007

Social & Political Science Journals

Steve Harry

[email protected]

1-702-714-7001Extn: 9042

© 2008- 2018 OMICS International - Open Access Publisher. Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version
Leave Your Message 24x7