Dr. Patrick Xuechun Zhao is an Associate Professor of Bioinformatics and Computational Biology at the Samuel Roberts Noble Foundation. Dr. Zhao received his Ph.D. degree in Communication and Information Systems from Shanghai Jiao Tong University in 2000. After his graduation, he joined the Department of Biochemistry and Molecular Biology, School of Medicine, University of Louisville, as a research associate developing bioinformatics technologies to discover genes and biological networks that are essential to the determination of the normal aging process, in particular those related to successful aging in centenarians. In 2004, he joined the Samuel Roberts Noble Foundation as an Assistant Professor and Faculty Principle Investigator. Dr. Zhao’s current research centers on bioinformatics and computational biology, plant and soil microbe functional genomics, statistical machine learning for ‘omics’ data mining and biological knowledge discovery. Dr. Zhao has published over 60 papers in refereed scientific journals. He also published a series of software and bioinformatics resources including a dozen of databases and web servers for plant gene function and gene regulatory network analysis, which are publicly available and used by life scientists from around the world.


Legumes are one of the most evolutionarily well-adapted plants that play vital roles in ecology and agriculture. They have the unique ability to carry out symbiotic nitrogen fixation (SNF) through endosymbiotic interactions with bacteria. Aside from the root nodulation and nitrogen fixation symbiosis with rhizobia, legumes possess many unique features that are not found in the model plant Arabidopsis thaliana, such as mycorrhization, compound leaf development, protein-rich physiology, a profuse secondary metabolism, glandular trichome development and border cells in roots. We present LegumeIP1 ( - a web-based comparative genomics and gene expression atlas platform that integrates large-scale genomics, transcriptomics data and bioinformatics tools to study gene function and genome evolution in legumes. The LegumeIP (2015 release) hosts 1) genomic data such as genomics sequences, Sanger sequencing- and RNA-seq-based expressed sequence tag (EST) sequences, gene models, and annotations that are cross-refereed to the UniProt TrEMBL, InterProScan, Gene Ontology (GO) and the Kyoto Encyclopedia of Genes (KEGG) reference database for six model and crop legumes: Medicago truncatula, Glycine max (soybean), Lotus japonicus, Phaseolus vulgaris (common bean), Cicer arietinumx (chickpea) and Cajanus cajan (pigeon pea), and two reference model plants: A. thaliana and Poplar trichocarpa; 2) gene expression atlases that include 276 array hybridizations from L. japonicas, 1,915 array hybridizations from M. truncatula, 1,379 gene expression profiles of G. max, and 4,164 array hybridizations of A. thaliana; and 3) gene families, orthologous groups, gene family phylogenetic trees and syntenic regions across hosted species. LegumeIP also integrates a suite of bioinformatics tools for exploring and analyzing 1) whole-genome synteny and detailed syntenic gene alignments; 2) gene annotations, gene families and orthologous groups; and 3) gene expression patterns, clusters and co-expression networks.