ICBO_2018_35: Can a Convolutional Neural Network Support Auditing of NCI Thesaurus Neoplasm Concepts?

TitleICBO_2018_35: Can a Convolutional Neural Network Support Auditing of NCI Thesaurus Neoplasm Concepts?
Publication TypeConference Paper
Year of Publication2018
AuthorsLiu, H, Zheng, L, Perl, Y, Geller, J, Elhanan, G
Conference NameInternational Conference on Biomedical Ontology (ICBO 2018)
Date Published08/06/2018
PublisherInternational Conference on Biological Ontology
KeywordsAbstraction Network, CNN, deep learning, machine learning, National Cancer Institute Thesaurus, Neoplasm Hierarchy, quality assurance
Abstract

We present a Machine Learning methodology using a Convolutional Neural Network to perform a specific case of an ontology Quality Assurance, namely discovery of missing IS-A relationships for Neoplasm concepts in the National Cancer Institute Thesaurus (NCIt). The training step checking all “uncles” of a concept is computationally intensive. To shorten the time and to improve the accuracy, we define a restricted methodology to check only uncles that are similar to each current concept. The restricted technique yields higher classification recall (compared to the unrestricted one) when testing against known errors found by domain experts who manually reviewed Neoplasm concepts in a prior study. The results are encouraging and provide impetus for further improvements to our technique.

URLhttp://icbo2018.cgrb.oregonstate.edu/