ISSN: 0974-276X
Journal of Proteomics & Bioinformatics
Like us on:
Make the best use of Scientific Research and information from our 700+ peer reviewed, Open Access Journals that operates with the help of 50,000+ Editorial Board Members and esteemed reviewers and 1000+ Scientific associations in Medical, Clinical, Pharmaceutical, Engineering, Technology and Management Fields.
 
Meet Inspiring Speakers and Experts at our 3000+ Global Conferenceseries Events with over 600+ Conferences, 1200+ Symposiums and 1200+ Workshops on
Medical, Pharma, Engineering, Science, Technology and Business

DataBiNS-Viz: A Web-Based Tool for Visualization of Non-Synonymous SNP Data

Fong Chun Chan1, Edward A. Kawas1, Mark D. Wilkinson1,2 and Scott J. Tebbutt1,3*
1The James Hogg iCAPTURE Centre for Cardiovascular and Pulmonary Research
2Department of Medical Genetics
3Department of Medicine, Division of Respiratory Medicine; University of British Columbia, Providence Heart + Lung Institute, St. Paul’s Hospital, Vancouver, BC, V6Z 1Y6, Canada
Corresponding Author : Dr. Scott J. Tebbutt
The James Hogg iCAPTURE Centre for Cardiovascular and Pulmonary Research,
Phone: 604-682-2344 ext. 63051
Fax: 604-806-9274
E-mail : stebbutt@mrl.ubc.ca
Received June 26, 2008; Accepted July 16, 2008; Published July 17, 2008
Citation: Fong CC, Edward AK, Mark DW, Scott JT (2008). DataBiNS-Viz: A Web-Based Tool for Visualization of Non-Synonymous SNP Data. J Proteomics Bioinform 1:233-236. doi: 10.4172/jpb.1000029
Copyright: © 2008 Fong CC, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Related article at
DownloadPubmed DownloadScholar Google

Visit for more related articles at Journal of Proteomics & Bioinformatics

Abstract

Here we describe DataBiNS-Viz – a visualization and exploration environment for non-synonymous coding single nucleotide polymorphisms (nsSNPs) data gathered by the BioMoby-based DataBiNS workflow. DataBiNSViz enables execution of the DataBiNS workflow on proteins described by KEGG, PubMed, or OMIM identifiers, followed by manual exploration of the integrated structure/function and pathway data for those proteins, with a particular focus on nsSNP data in-context. The tool can be freely accessed at http://bioinfo.icapture.ubc.ca:8090/ DataBiNS (please use the Firefox or Safari web browsers). Examples of the retrieved data are given under the “Help on inputs” option. Detailed documentation can be accessed at http://bioinfo.icapture.ubc.ca/mywiki/ DataBiNS.

Keywords
Bioinformatics; Web services; Data mining; Visualization; Genomics; Single nucleotide polymorphisms
Introduction
Single nucleotide polymorphisms (SNPs) are single base mutations in a genomic sequence that occur at a frequency greater than 1% in a defined population. Codons are sets of three DNA bases in a gene sequence that code for a particular amino acid. Non-synonymous SNPs (nsSNPs) are SNPs that occur within codons and that change the encoded amino acid, sometimes ultimately affecting the protein that is constructed from the gene blueprint. nsSNPs are of great interest to researchers as they may be key to identifying and understanding various human disease susceptibilities, as well as disease and non-disease phenotypes in many other species.
In silico analysis of the potential biological impact of nsSNPs requires integration of data and knowledge from various Web-based resources, both databases and analytical tools. Manual retrieval and integration of this information is error-prone and tedious. This provided the motivation for the original DataBiNS - data-mining workflow (Song et al., 2007) for the BioMOBY (Wilkinson and Links, 2002) and Taverna (Oinn et al., 2004) environments which retrieved and integrated data relating to nsSNPs and the biological pathways affected by them. DataBiNS consumes Kyoto Encyclopedia of Genes and Genomes [KEGG] Pathway Identifiers (Kanehisa et al., 2006), and retrieves a list of publications, gene ontology annotations and nsSNP information for each gene involved in the pathway. Although the public DataBiNS workflow successfully retrieved and integrated these data, lack of a visualization tool for the output significantly limited its utility. We report here important extensions to the original DataBiNS workflow and environment, including retrieval of additional nsSNP data such as mapping of SNPs to their altered amino acids on a 3D protein structure, as well as easy to navigate web-based visualizations of the global DataBiNS output.
Workflow Initialization
To facilitate interoperability between the various Web resources, the workflow extensions we report here continue to be provided through the BioMoby Web Services framework. Rather than being limited to a single KEGG identifier, the new services allow for different types of identifiers to be used to initialize the workflow, including:
1. KEGG gene (http://www.genome.jp/kegg/)
2. PubMed (http://www.ncbi.nlm.nih.gov/pubmed/)
3.OMIM - Online Mendelian Inheritance in Man (http://www.ncbi.nlm.nih.gov/sites/entrez?db=omim)
4.UniGene(http://www.ncbi.nlm.nih.gov/sites/ entrez?db=unigene)
5. UniProt (http://www.pir.uniprot.org/)
6. GenBank (http://www.ncbi.nlm.nih.gov/Genbank/)
7. NCBI-GI
To initiate searches on multiple KEGG genes simultaneously, a comma can be placed between the different identifiers (e.g., hsa:7097, hsa:7098)
Extensions to Retrieved Data
Once the workflow has been initialized, the workflow first visits KEGG, PDB (http://www.rcsb.org/pdb/home/ home.do), SwissProt (http://www.expasy.ch/sprot/), and Entrez (http://www.ncbi.nlm.nih.gov/Entrez/), to find the corresponding gene id(s) corresponding to the input identifier. Once retrieved, the LS-SNP database (Karchin et al., 2005) is initialized with the corresponding SwissProt id to find all the nsSNPs for the gene. The PDB id is then used on the coliSNP (http://yayoi.kansai.jaea.go.jp/colisnp/) database (Kono et al., 2008) to retrieve the 3D structure of the protein (if available). The various SNPs associated with this gene are already mapped onto this protein structure (within coliSNP), providing an efficient technique to analyze the location of SNPs on the protein. Supplementing the SNP information are frequency pie-charts of each SNP id from the HapMap (http://www.hapmap.org/) database (Thorisson et al., 2005). Detailed annotations about the gene are retrieved from the Gene Ontology (http://www.geneontology.org/) website, and finally the most recently relevant publications to the gene are retrieved from PubMed.
Web-Based Visualization
Rather than being limited to the default Taverna nestedfolder browsing, or export of the data from Taverna as an Excel spreadsheet, both of which are problematic for manual exploration of these complex data networks, we have created a task-specific Web-based visualization and exploration environment for DataBiNS. The application is built using the Java Platform, Enterprise (J2EE) and is accessed by end-users through an intuitive Web page. The user simply enters an identifier of interest (i.e., KEGG PATHWAY, OMIM, etc.), and then presses the “Execute Workflow” button. In the backend, the Taverna workflow execution engine is triggered to execute the modified DataBiNS workflow. The results of the workflow are then cached on the server to allow rapid browsing of results, and are browsable via the Web interface (Figure 1). There is an option on the front page to re-execute the workflow, where the tool will ignore any saved results and retrieve new, possibly updated data.
In addition to displaying all the results using standard Web technologies, two navigation tools/methods have been added to the web-application to help with the study of the data. First, a “search publication abstract” option allows for users to quickly search the retrieved publications for keywords. If a retrieved publication has the keyword, the publication will be highlighted allowing the user to focus on that publication. The PubCloud application has also been integrated into the web-application. The user can select a group of the retrieved publications and quickly use the PubCloud keyword tag-cloud visualization system to find possible correlations between the publications.
In a significant advance over prior exploration/browsing environments, the Web interface intuitively associates multiple inputs with their respective outputs. Thus the Webapplication displays all data about each gene in a discrete section of the browser window; on a given results page there can be several genes and each gene will have its associated information clearly and intuitively organized and displayed. This approach eliminates the user’s need to backtrack through the results to correlate inputs to outputs, as was required in earlier versions of DataBiNS, thus allowing them to quickly analyze and use the retrieved data.
Future
Though the framework we have developed to display the data is specific for the DataBiNS workflow, it can be generalized to accept any Taverna-based workflow, displaying the results as a browsable Web page and facilitating exploration of results from Taverna-based workflows. The modularized nature of workflows allows one to develop new BioMoby services to add to DataBiNS in order to expand the information retrieved and displayed.
One lingering question is always the validity of the data being retrieved. The workflow is designed to retrieve information from a specified group of web resources. The validity of the information obtained from the web resources is not checked by the workflow and thus there is currently no way to verify the integrity of the information across the different resources, without a great deal of manual data inspection. Future developments may lead to automation of such processes with electronic flags highlighting inconsistencies in data between different web resources.
Acknowledgements
This research was supported by the National Sanitarium Association (Canada), AllerGen NCE, and the Michael Smith Foundation for Health Research. EK is supported by an award to MDW from Genome Alberta, in part through Genome Canada.
 
References








Select your language of interest to view the total content in your interested language
Post your comment

Share This Article

Relevant Topics

Recommended Conferences

Article Usage

  • Total views: 11378
  • [From(publication date):
    July-2008 - Sep 27, 2016]
  • Breakdown by view type
  • HTML page views : 7635
  • PDF downloads :3743
 
 

Post your comment

captcha   Reload  Can't read the image? click here to refresh

OMICS International Journals
 
Make the best use of Scientific Research and information from our 700 + peer reviewed, Open Access Journals
 
 
OMICS International Conferences 2016-17
 
Meet Inspiring Speakers and Experts at our 3000+ Global Annual Meetings
 
 

Contact Us

Agri, Food, Aqua and Veterinary Science Journals

Dr. Krish

agrifoodaquavet@omicsinc.com

1-702-714-7001 Extn: 9040

Clinical and Biochemistry Journals

Datta A

clinical_biochem@omicsinc.com

1-702-714-7001Extn: 9037

Business & Management Journals

Ronald

business@omicsinc.com

1-702-714-7001Extn: 9042

Chemical Engineering and Chemistry Journals

Gabriel Shaw

chemicaleng_chemistry@omicsinc.com

1-702-714-7001 Extn: 9040

Earth & Environmental Sciences

Katie Wilson

environmentalsci@omicsinc.com

1-702-714-7001Extn: 9042

Engineering Journals

James Franklin

engineering@omicsinc.com

1-702-714-7001Extn: 9042

General Science and Health care Journals

Andrea Jason

generalsci_healthcare@omicsinc.com

1-702-714-7001Extn: 9043

Genetics and Molecular Biology Journals

Anna Melissa

genetics_molbio@omicsinc.com

1-702-714-7001 Extn: 9006

Immunology & Microbiology Journals

David Gorantl

immuno_microbio@omicsinc.com

1-702-714-7001Extn: 9014

Informatics Journals

Stephanie Skinner

omics@omicsinc.com

1-702-714-7001Extn: 9039

Material Sciences Journals

Rachle Green

materialsci@omicsinc.com

1-702-714-7001Extn: 9039

Mathematics and Physics Journals

Jim Willison

mathematics_physics@omicsinc.com

1-702-714-7001 Extn: 9042

Medical Journals

Nimmi Anna

medical@omicsinc.com

1-702-714-7001 Extn: 9038

Neuroscience & Psychology Journals

Nathan T

neuro_psychology@omicsinc.com

1-702-714-7001Extn: 9041

Pharmaceutical Sciences Journals

John Behannon

pharma@omicsinc.com

1-702-714-7001Extn: 9007

Social & Political Science Journals

Steve Harry

social_politicalsci@omicsinc.com

1-702-714-7001 Extn: 9042

 
© 2008-2016 OMICS International - Open Access Publisher. Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version