alexa Mining for putative regulatory elements in the yeast genome using gene expression data.


Biochemistry & Physiology: Open Access

Author(s): Vilo J, Brazma A, Jonassen I, Robinson A, Ukkonen E

Abstract Share this page

Abstract We have developed a set of methods and tools for automatic discovery of putative regulatory signals in genome sequences. The analysis pipeline consists of gene expression data clustering, sequence pattern discovery from upstream sequences of genes, a control experiment for pattern significance threshold limit detection, selection of interesting patterns, grouping of these patterns, representing the pattern groups in a concise form and evaluating the discovered putative signals against existing databases of regulatory signals. The pattern discovery is computationally the most expensive and crucial step. Our tool performs a rapid exhaustive search for a priori unknown statistically significant sequence patterns of unrestricted length. The statistical significance is determined for a set of sequences in each cluster with respect to a set of background sequences allowing the detection of subtle regulatory signals specific for each cluster. The potentially large number of significant patterns is reduced to a small number of groups by clustering them by mutual similarity. Automatically derived consensus patterns of these groups represent the results in a comprehensive way for a human investigator. We have performed a systematic analysis for the yeast Saccharomyces cerevisiae. We created a large number of independent clusterings of expression data simultaneously assessing the "goodness" of each cluster. For each of the over 52,000 clusters acquired in this way we discovered significant patterns in the upstream sequences of respective genes. We selected nearly 1,500 significant patterns by formal criteria and matched them against the experimentally mapped transcription factor binding sites in the SCPD database. We clustered the 1,500 patterns to 62 groups for which we derived automatically alignments and consensus patterns. Of these 62 groups 48 had patterns that have matching sites in SCPD database.
This article was published in Proc Int Conf Intell Syst Mol Biol and referenced in Biochemistry & Physiology: Open Access

Relevant Expert PPTs

Relevant Speaker PPTs

Recommended Conferences

  • 9th International Conference on Structural Biology
    September 18-20, 2017 Zurich, Switzerland

  •  2nd International Conference on Biochemistry
    Sep 21-22, 2017, Macau, Hong Kong
  • 2nd International Conference on Biochemistry
    September 28-29, 2017 Dubai, UAE

  • 3rd Annual Congress on Bioscience
    October 16-17, 2017 Dubai,UAE

  • 3rd International Conference on Transcriptomics
    October 30 - November 01, 2017 Bangkok, Thailand

  • 3rd International Conference on Genetic and Protein Engineering (10 Plenary Forums - 1 Event)
    November 08-09, 2017 Las Vegas, USA

Relevant Topics

Peer Reviewed Journals
Make the best use of Scientific Research and information from our 700 + peer reviewed, Open Access Journals
International Conferences 2017-18
Meet Inspiring Speakers and Experts at our 3000+ Global Annual Meetings

Contact Us

© 2008-2017 OMICS International - Open Access Publisher. Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version