Functional Insights from Computational Modeling of Orphan Proteins Expressed in a Microbial Community
- *Corresponding Author:
- Dr. Michael P. Thelen
Physical and Life Sciences
L452, Livermore, CA
E-mail: [email protected]
Received Date: September 02, 2010; Accepted Date: September 20, 2010; Published Date: September 22, 2010
Citation: Wheeler KE, Zemla A, Jiao Y, Aliaga Goltsman DS, Singer SW, et al.(2010) Functional Insights from Computational Modeling of Orphan Proteins Expressed in a Microbial Community. J Proteomics Bioinform 3: 266-274. doi: 10.4172/jpb.1000150
Copyright: © 2010 Wheeler KE, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Environmental genomics and proteomics data are heavily populated with proteins that are not homologous to experimentally characterized proteins. We approached this problematic area by investigating a natural microbial community from a highly constrained niche in which critical roles are likely carried out by proteins of unknown function (ORFans). Based on several criteria, these proteins were not statistically similar to any protein sequences in the SwissProt database. We selected a target set of 545 ORFans and weakly annotated proteins expressed by the dominant bacterial member of the community, Leptospirillum Group II, and used an automated modeling system (AS2TS) incorporated with other computational tools to predict structures. This generated 484 models, 89% of the target set. Structure-based superfamilies, general functional categorizations, and specifi c gene ontology (GO) functions were predicted for 424, 386, and 117 ORFans, respectively. Structural predictions and classifi cations were integrated into a manually curated database, outlining in silico calculations and available proteomic data for each protein. This analysis facilitated the development of experimentally testable hypotheses for several enigmatic proteins, including confi dent predictions of copper transport proteins and cyclic diguanylate signaling proteins. As DNA sequencing of natural organisms rapidly expands, this computational structure-function approach can be applied to guide experimental testing of the structure and function of challenging ORFans.