{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,18]],"date-time":"2026-01-18T13:51:23Z","timestamp":1768744283671,"version":"3.49.0"},"reference-count":21,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2014,12,4]],"date-time":"2014-12-04T00:00:00Z","timestamp":1417651200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2014,12,4]]},"abstract":"<jats:p>Wikipedia's InfoBoxes play a crucial role in advanced applications and provide the main knowledge source for DBpedia and the powerful structured queries it supports. However, InfoBoxes, which were created by crowdsourcing for human rather than computer consumption, suffer from incompleteness, inconsistencies, and inaccuracies. To overcome these problems, we have developed (i) the IBminer system that extracts InfoBox information by text-mining Wikipedia pages, (ii) the IKBStore system that integrates the information derived by IBminer with that of DBpedia, YAGO2,WikiData,WordNet, and other sources, and (iii) SWiPE and InfoBox Editor (IBE) that provide a user-friendly interfaces for querying and revising the knowledge base. Thus, IBminer uses a deep NLP-based approach to extract from text a semantic representation structure called TextGraph from which the system detects patterns and derives subject-attribute-value relations, as well as domain-specific synonyms for the knowledge base. IKBStore and IBE complement the powerful, user-friendly, by-example structured queries of SWiPE by supporting the validation and provenance history for the information contained in the knowledge base, along with the ability of upgrading its knowledge when this is found incomplete, incorrect, or outdated.<\/jats:p>","DOI":"10.1145\/2694428.2694437","type":"journal-article","created":{"date-parts":[[2014,12,8]],"date-time":"2014-12-08T16:17:14Z","timestamp":1418055434000},"page":"48-54","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora"],"prefix":"10.1145","volume":"43","author":[{"given":"Hamid","family":"Mousavi","sequence":"first","affiliation":[{"name":"CSD, UCLA, Los Angeles, CA"}]},{"given":"Maurizio","family":"Atzori","sequence":"additional","affiliation":[{"name":"University of Cagliari, Cagliari, Italy"}]},{"given":"Shi","family":"Gao","sequence":"additional","affiliation":[{"name":"CSD, UCLA, Los Angeles, CA"}]},{"given":"Carlo","family":"Zaniolo","sequence":"additional","affiliation":[{"name":"CSD, UCLA, Los Angeles, CA"}]}],"member":"320","published-online":{"date-parts":[[2014,12,4]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Apache Jena. http:\/\/jena.apache.org\/.  Apache Jena. http:\/\/jena.apache.org\/."},{"key":"e_1_2_1_2_1","unstructured":"Geonames. http:\/\/www.geonames.org\/.  Geonames. http:\/\/www.geonames.org\/."},{"key":"e_1_2_1_3_1","unstructured":"Hoffman2 Cluster UCLA. http:\/\/hpc.ucla.edu\/hoffman2\/.  Hoffman2 Cluster UCLA. http:\/\/hpc.ucla.edu\/hoffman2\/."},{"key":"e_1_2_1_4_1","unstructured":"Musicbrainz. http:\/\/musicbrainz.org\/.  Musicbrainz. http:\/\/musicbrainz.org\/."},{"key":"e_1_2_1_5_1","unstructured":"Opencyc. http:\/\/www.cyc.com\/platform\/opencyc  Opencyc. http:\/\/www.cyc.com\/platform\/opencyc"},{"key":"e_1_2_1_6_1","unstructured":"Semantic web information management system (swims). http:\/\/semscape.cs.ucla.edu\/.  Semantic web information management system (swims). http:\/\/semscape.cs.ucla.edu\/."},{"key":"e_1_2_1_7_1","unstructured":"Wikidata. http:\/\/www.wikidata.org.  Wikidata. http:\/\/www.wikidata.org."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2187980.2188036"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2009.07.002"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376746"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/2898607.2898816"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2012.06.001"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2463725"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610525"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2509908.2509912"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536274.2536308"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSC.2014.12"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSC.2014.31"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/646748.701499"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of 11th Eurographics Workshop on Rendering. MIT Press","author":"Stark M. M.","year":"1998","unstructured":"M. M. Stark and R. F. Riesenfeld . Wordnet: An electronic lexical database . In Proceedings of 11th Eurographics Workshop on Rendering. MIT Press , 1998 . M. M. Stark and R. F. Riesenfeld. Wordnet: An electronic lexical database. In Proceedings of 11th Eurographics Workshop on Rendering. MIT Press, 1998."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213891"}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2694428.2694437","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2694428.2694437","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:12:20Z","timestamp":1750227140000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2694428.2694437"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,12,4]]},"references-count":21,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2014,12,4]]}},"alternative-id":["10.1145\/2694428.2694437"],"URL":"https:\/\/doi.org\/10.1145\/2694428.2694437","relation":{},"ISSN":["0163-5808"],"issn-type":[{"value":"0163-5808","type":"print"}],"subject":[],"published":{"date-parts":[[2014,12,4]]},"assertion":[{"value":"2014-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}