Author(s): Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M, Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M, Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M, Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M
Abstract Share this page
Abstract The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith-Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database.
This article was published in Nucleic Acids Res
and referenced in Journal of Data Mining in Genomics & Proteomics