ICBO_2018_63: GO-MAP Implements CAFA Tools: Improved Automated Gene Function Annotation for Plants


Maize is both a crop species and a model for genetics and genomics research. As such, maize GO annotations produced by the community data projects Gramene and Phytozome are widely used to derive hypotheses for both crop improvement and basic science. Our maize-GAMER project assessed existing maize GO annotations and to implement and test the performance of some of the most commonly used GO prediction tools (i.e., Reciprocal Best Hits and domain presence) alongside three of the top performing tools submitted for evaluation in the CAFA1 (Critical Assessment of protein Function Annotation) competition. All datasets were compared based on F-score using an independent gold-standard dataset (2002 GO annotations for 1,619 genes) provided by MaizeGDB. In addition to producing and comparing these individual GO annotation sets, we also combined the datasets we generated to produce a maize-GAMER aggregate annotation set. Compared to Gramene and Phytozome, the maize-GAMER aggregate set annotates more genes in the maize genome and assigns more GO terms per gene. In addition, the maize-GAMER dataset’s functional assignments are comparable to Gramene and Phytozome overall (based on F-score). These findings have been published, and the maize-GAMER GO annotations are available via CyVerse and MaizeGDB. Here we review the methods and describe GO-MAP, the pipeline used to generate these datasets. GO-MAP has been containerized to facilitate gene function annotation for other plant proteomes and will be released via CyVerse in the very near future.

Year of Publication
Conference Name
International Conference on Biomedical Ontology (ICBO 2018)
Date Published
International Conference on Biological Ontology
Download citation