{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T03:14:24Z","timestamp":1761621264447,"version":"3.41.0"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2016,5,24]],"date-time":"2016-05-24T00:00:00Z","timestamp":1464048000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Case Western Reserve University CTSA","award":["UL1TR000439"],"award-info":[{"award-number":["UL1TR000439"]}]},{"DOI":"10.13039\/100013844","name":"University of Kentucky Center for Clinical and Translational Science","doi-asserted-by":"crossref","award":["UL1TR000117"],"award-info":[{"award-number":["UL1TR000117"]}],"id":[{"id":"10.13039\/100013844","id-type":"DOI","asserted-by":"crossref"}]},{"name":"High Performance Computing Resource in the Core Facility for Advanced Research Computing"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2016,7,27]]},"abstract":"<jats:p>This article presents recent progresses made in using scalable cloud computing environment, Hadoop and MapReduce, to perform ontology quality assurance (OQA), and points to areas of future opportunity. The standard sequential approach used for implementing OQA methods can take weeks if not months for exhaustive analyses for large biomedical ontological systems. With OQA methods newly implemented using massively parallel algorithms in the MapReduce framework, several orders of magnitude in speed-up can be achieved (e.g., from three months to three hours). Such dramatically reduced time makes it feasible not only to perform exhaustive structural analysis of large ontological hierarchies, but also to systematically track structural changes between versions for evolutional analysis. As an exemplar, progress is reported in using MapReduce to perform evolutional analysis and visualization on the Systemized Nomenclature of Medicine\u2014Clinical Terms (SNOMED CT), a prominent clinical terminology system. Future opportunities in three areas are described: one is to extend the scope of MapReduce-based approach to existing OQA methods, especially for automated exhaustive structural analysis. The second is to apply our proposed MapReduce Pipeline for Lattice-based Evaluation (MaPLE) approach, demonstrated as an exemplar method for SNOMED CT, to other biomedical ontologies. The third area is to develop interfaces for reviewing results obtained by OQA methods and for visualizing ontological alignment and evolution, which can also take advantage of cloud computing technology to systematically pre-compute computationally intensive jobs in order to increase performance during user interactions with the visualization interface. Advances in these directions are expected to better support the ontological engineering lifecycle.<\/jats:p>","DOI":"10.1145\/2768830","type":"journal-article","created":{"date-parts":[[2016,5,25]],"date-time":"2016-05-25T18:07:06Z","timestamp":1464199626000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Biomedical Ontology Quality Assurance Using a Big Data Approach"],"prefix":"10.1145","volume":"10","author":[{"given":"Licong","family":"Cui","sequence":"first","affiliation":[{"name":"University of Kentucky"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shiqiang","family":"Tao","sequence":"additional","affiliation":[{"name":"University of Kentucky"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guo-Qiang","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Kentucky"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2016,5,24]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1038\/75556"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh061"},{"key":"e_1_2_1_3_1","volume-title":"Biomedical ontologies in action: Role in knowledge management, data integration and decision support. Yearbook of Medical Informatics","author":"Bodenreider Olivier","year":"2008","unstructured":"Olivier Bodenreider . 2008. Biomedical ontologies in action: Role in knowledge management, data integration and decision support. Yearbook of Medical Informatics ( 2008 ), 67--79. Olivier Bodenreider. 2008. Biomedical ontologies in action: Role in knowledge management, data integration and decision support. Yearbook of Medical Informatics (2008), 67--79."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1038\/npre.2009.3536.1"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2006.12.003"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2008.12.008"},{"key":"e_1_2_1_7_1","volume-title":"AMIA Annual Symposium Proceedings","volume":"2010","author":"Ceusters Werner","year":"2010","unstructured":"Werner Ceusters . 2010 . Applying evolutionary terminology auditing to SNOMED CT . In AMIA Annual Symposium Proceedings , Vol. 2010 . American Medical Informatics Association, Bethesda, MD, 96. Werner Ceusters. 2010. Applying evolutionary terminology auditing to SNOMED CT. In AMIA Annual Symposium Proceedings, Vol. 2010. American Medical Informatics Association, Bethesda, MD, 96."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh036"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2007.06.008"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"volume-title":"Formal Concept Analysis","author":"Ganter Bernhard","key":"e_1_2_1_12_1","unstructured":"Bernhard Ganter and Rudolf Wille . 1999. Formal Concept Analysis . Vol. 284 . Springer , Berlin . Bernhard Ganter and Rudolf Wille. 1999. Formal Concept Analysis. Vol. 284. Springer, Berlin."},{"volume-title":"Continuous Lattices and Domains. Number 93","author":"Gierz Gerhard","key":"e_1_2_1_13_1","unstructured":"Gerhard Gierz . 2003. Continuous Lattices and Domains. Number 93 . Cambridge University Press , Cambridge . Gerhard Gierz. 2003. Continuous Lattices and Domains. Number 93. Cambridge University Press, Cambridge."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1006\/knac.1993.1008"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2012.04.009"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-10-250"},{"key":"e_1_2_1_17_1","volume-title":"Sahoo","author":"Jayapandian Catherine","year":"2014","unstructured":"Catherine Jayapandian , Chien-Hung Chen , Aman Dabir , Samden Lhatoo , Guo-Qiang Zhang , and Satya S . Sahoo . 2014 . Domain ontology as conceptual model for big data management: Application in biomedical informatics. In Conceptual Modeling. Springer , Berlin, 144--157. Catherine Jayapandian, Chien-Hung Chen, Aman Dabir, Samden Lhatoo, Guo-Qiang Zhang, and Satya S. Sahoo. 2014. Domain ontology as conceptual model for big data management: Application in biomedical informatics. In Conceptual Modeling. Springer, Berlin, 144--157."},{"key":"e_1_2_1_18_1","unstructured":"Catherine Praveena Jayapandian. 2014. Cloudwave: A Cloud Computing Framework for Multimodal Electrophysiological Big Data. Ph.D. Dissertation. Case Western Reserve University.  Catherine Praveena Jayapandian. 2014. Cloudwave: A Cloud Computing Framework for Multimodal Electrophysiological Big Data. Ph.D. Dissertation. Case Western Reserve University."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M2541"},{"volume-title":"Conceptual Structures at Work","author":"Joslyn Cliff","key":"e_1_2_1_20_1","unstructured":"Cliff Joslyn . 2004. Poset ontologies and concept lattices as semantic hierarchies . In Conceptual Structures at Work . Springer , Berlin , 287--302. Cliff Joslyn. 2004. Poset ontologies and concept lattices as semantic hierarchies. In Conceptual Structures at Work. Springer, Berlin, 287--302."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1186\/2041-1480-2-6"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2013.03.005"},{"key":"e_1_2_1_23_1","volume-title":"AMIA Annual Symposium Proceedings","volume":"2006","author":"Murphy Shawn N.","unstructured":"Shawn N. Murphy , Michael E. Mendis , David A. Berkowitz , Isaac Kohane , and Henry C. Chueh . 2006. Integration of clinical and genetic data in the i2b2 architecture . In AMIA Annual Symposium Proceedings , Vol. 2006 . American Medical Informatics Association, Bethesda, MD, 1040. Shawn N. Murphy, Michael E. Mendis, David A. Berkowitz, Isaac Kohane, and Henry C. Chueh. 2006. Integration of clinical and genetic data in the i2b2 architecture. In AMIA Annual Symposium Proceedings, Vol. 2006. American Medical Informatics Association, Bethesda, MD, 1040."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btl334"},{"key":"e_1_2_1_25_1","first-page":"145","article-title":"Mistakes in medical ontologies: Where do they come from and how can they be detected","volume":"102","author":"Pisanelli D. M.","year":"2004","unstructured":"D. M. Pisanelli . 2004 . Mistakes in medical ontologies: Where do they come from and how can they be detected ? Ontologies in Medicine 102 (2004), 145 . D. M. Pisanelli. 2004. Mistakes in medical ontologies: Where do they come from and how can they be detected? Ontologies in Medicine 102 (2004), 145.","journal-title":"Ontologies in Medicine"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2010-000045"},{"key":"e_1_2_1_27_1","unstructured":"J. Rogers and A. Rector. 1996. The GALEN ontology. Medical Informatics Europe (MIE\u201996). IOS Press Copenhagen 174--178.  J. Rogers and A. Rector. 1996. The GALEN ontology. Medical Informatics Europe (MIE\u201996). IOS Press Copenhagen 174--178."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2003.11.007"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-05151-7_18"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2006.12.003"},{"volume-title":"Logic of domains","author":"Zhang Guo-Qiang","key":"e_1_2_1_31_1","unstructured":"Guo-Qiang Zhang . 2012. Logic of domains . Springer Science & Business Media. Springer , Berlin. Guo-Qiang Zhang. 2012. Logic of domains. Springer Science & Business Media. Springer, Berlin."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15280-1_61"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/1940334.1940353"},{"key":"e_1_2_1_34_1","volume-title":"AMIA Annual Symposium Proceedings. 1248--1257","author":"Zhang Guo-Qiang","year":"2014","unstructured":"Guo-Qiang Zhang , Licong Cui , Samden Lhatoo , Stephan U. Schuele , and Satya Sahoo . 2014 a. MEDCIS: Multi-modality epilepsy data capture and integration system . AMIA Annual Symposium Proceedings. 1248--1257 . Guo-Qiang Zhang, Licong Cui, Samden Lhatoo, Stephan U. Schuele, and Satya Sahoo. 2014a. MEDCIS: Multi-modality epilepsy data capture and integration system. AMIA Annual Symposium Proceedings. 1248--1257."},{"key":"e_1_2_1_35_1","volume-title":"AMIA Summits on Translational Science Proceedings","volume":"2010","author":"Zhang Guo-Qiang","year":"2010","unstructured":"Guo-Qiang Zhang , Trish Siegler , Paul Saxman , Neil Sandberg , Remo Mueller , Nathan Johnson , Dale Hunscher , and Sivaram Arabandi . 2010 . VISAGE: A query interface for clinical research . In AMIA Summits on Translational Science Proceedings , Vol. 2010 . American Medical Informatics Association, Bethesda, MD 76--80. Guo-Qiang Zhang, Trish Siegler, Paul Saxman, Neil Sandberg, Remo Mueller, Nathan Johnson, Dale Hunscher, and Sivaram Arabandi. 2010. VISAGE: A query interface for clinical research. In AMIA Summits on Translational Science Proceedings, Vol. 2010. American Medical Informatics Association, Bethesda, MD 76--80."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2014.7004301"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2005.04.007"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2009.03.003"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1055\/s-0038-1634577"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2768830","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2768830","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:07:37Z","timestamp":1750223257000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2768830"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,5,24]]},"references-count":38,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2016,7,27]]}},"alternative-id":["10.1145\/2768830"],"URL":"https:\/\/doi.org\/10.1145\/2768830","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"type":"print","value":"1556-4681"},{"type":"electronic","value":"1556-472X"}],"subject":[],"published":{"date-parts":[[2016,5,24]]},"assertion":[{"value":"2014-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-04-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-05-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}