{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T06:10:44Z","timestamp":1774159844098,"version":"3.50.1"},"publisher-location":"New York, New York, USA","reference-count":13,"publisher":"ACM Press","license":[{"start":{"date-parts":[[2017,1,1]],"date-time":"2017-01-01T00:00:00Z","timestamp":1483228800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"We Acknowledge the Data to Decisions CRC (D2D CRC) and the Cooperative Research Centers Programme for funding this research."}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017]]},"DOI":"10.1145\/3041021.3054726","type":"proceedings-article","created":{"date-parts":[[2018,1,11]],"date-time":"2018-01-11T18:39:25Z","timestamp":1515695965000},"page":"165-169","source":"Crossref","is-referenced-by-count":24,"title":["On Automating Basic Data Curation Tasks"],"prefix":"10.1145","author":[{"given":"Seyed-Mehdi-Reza","family":"Beheshti","sequence":"first","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Alireza","family":"Tabebordbar","sequence":"additional","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Boualem","family":"Benatallah","sequence":"additional","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Reza","family":"Nouri","sequence":"additional","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]}],"member":"320","reference":[{"key":"key-10.1145\/3041021.3054726-1","unstructured":"Michael R. Anderson, Dolan Antenucci, Victor Bittorf, Matthew Burgess, Michael J. Cafarella, Arun Kumar, Feng Niu, Yongjoo Park, Christopher R&#233; and Ce Zhang. 2013. Brainwash: A Data System for Feature Engineering.. In CIDR."},{"key":"key-10.1145\/3041021.3054726-2","doi-asserted-by":"crossref","unstructured":"Seyed-Mehdi-Reza Beheshti, Boualem Benatallah, and Hamid Reza Motahari-Nezhad. 2016a. Scalable graph-based OLAP analytics over process execution data. Distributed and Parallel Databases 34, 3 (2016), 379--423.","DOI":"10.1007\/s10619-014-7171-9"},{"key":"key-10.1145\/3041021.3054726-3","doi-asserted-by":"crossref","unstructured":"Seyed-Mehdi-Reza Beheshti, Boualem Benatallah, Sherif Sakr, Daniela Grigori, Hamid Reza Motahari-Nezhad, Moshe Chai Barukh, Ahmed Gater, and Seung Hwan Ryu. 2016b. Process Analytics - Concepts and Techniques for Querying and Analyzing Process Data. Springer.","DOI":"10.1007\/978-3-319-25037-3"},{"key":"key-10.1145\/3041021.3054726-4","unstructured":"Seyed-Mehdi-Reza Beheshti, Alireza Tabebordbar, Boualem Benatallah, and Reza Nouri. 2016d. Data Curation APIs. CoRR abs\/1612.03277 (2016). http:\/\/arxiv.org\/abs\/1612.03277"},{"key":"key-10.1145\/3041021.3054726-5","unstructured":"Seyed-Mehdi-Reza Beheshti, Boualem Benatallah, Srikumar Venugopal, Seung Hwan Ryu, Hamid Reza Motahari-Nezhad, and Wei Wang. 2016c. A systematic review and comparative analysis of cross-document coreference resolution methods and tools. Computing (2016), 1--37."},{"key":"key-10.1145\/3041021.3054726-6","unstructured":"Hsinchun Chen, Roger HL Chiang, and Veda C Storey. 2012. Business intelligence and analytics: From big data to big impact. MIS quarterly 36, 4 (2012), 1165--1188."},{"key":"key-10.1145\/3041021.3054726-7","doi-asserted-by":"crossref","unstructured":"Abhishek Gattani, Digvijay S. Lamba, Nikesh Garera, Mitul Tiwari, Xiaoyong Chai, Sanjib Das, Sri Subramaniam, Anand Rajaraman, Venky Harinarayan, and AnHai Doan. 2013. Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach. PVLDB 6, 11 (2013), 1126--1137. http:\/\/www.vldb.org\/pvldb\/vol6\/p1126-gattani.pdf","DOI":"10.14778\/2536222.2536237"},{"key":"key-10.1145\/3041021.3054726-8","unstructured":"Clinton Gormley and Zachary Tong. 2015. Elasticsearch: The Definitive Guide. -- O'Reilly Media, Inc."},{"key":"key-10.1145\/3041021.3054726-9","unstructured":"Krzystof Jajuga, Andrzej Sokolowski, and Hans-Hermann Bock. 2012. Classification, clustering, and data analysis: recent advances and applications. Springer Science &#38; Business Media."},{"key":"key-10.1145\/3041021.3054726-10","doi-asserted-by":"crossref","unstructured":"Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue B. Moon. 2010. What is Twitter, a social network or a news media?. In Proceedings of the 19th International Conference on World Wide Web, WWW 2010, Raleigh, North Carolina, USA, April 26-30, 2010. 591--600.","DOI":"10.1145\/1772690.1772751"},{"key":"key-10.1145\/3041021.3054726-11","doi-asserted-by":"crossref","unstructured":"Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Rose Finkel, Steven Bethard, and David McClosky. 2014. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014, June 22-27, 2014, Baltimore, MD, USA, System Demonstrations. 55--60. http:\/\/aclweb.org\/anthology\/P\/P14\/P14--5010.pdf","DOI":"10.3115\/v1\/P14-5010"},{"key":"key-10.1145\/3041021.3054726-12","unstructured":"James H Martin and Daniel Jurafsky. 2000. Speech and language processing. International Edition 710 (2000)."},{"key":"key-10.1145\/3041021.3054726-13","unstructured":"Omer Tene and Jules Polonetsky. 2012. Big data for all: Privacy and user control in the age of analytics. Nw. J. Tech. &#38; Intell. Prop. 11 (2012), xxvii."}],"event":{"name":"the 26th International Conference","location":"Perth, Australia","acronym":"WWW '17 Companion","number":"26","sponsor":["SIGWEB, ACM Special Interest Group on Hypertext, Hypermedia, and Web","IW3C2, International World Wide Web Conference Committee"],"start":{"date-parts":[[2017,4,3]]},"end":{"date-parts":[[2017,4,7]]}},"container-title":["Proceedings of the 26th International Conference on World Wide Web Companion - WWW '17 Companion"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3041021.3054726","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/dl.acm.org\/ft_gateway.cfm?id=3054726&ftid=1865119&dwn=1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:03:30Z","timestamp":1750215810000},"score":1,"resource":{"primary":{"URL":"http:\/\/dl.acm.org\/citation.cfm?doid=3041021.3054726"}},"subtitle":[],"proceedings-subject":"World Wide Web Companion","short-title":[],"issued":{"date-parts":[[2017]]},"references-count":13,"URL":"https:\/\/doi.org\/10.1145\/3041021.3054726","relation":{},"subject":[],"published":{"date-parts":[[2017]]}}}