{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,7,24]],"date-time":"2023-07-24T22:54:22Z","timestamp":1690239262928},"reference-count":9,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2018,8]]},"abstract":"<jats:p>As data science applications proliferate, more and more lay users must perform data integration (DI) tasks, which used to be done by sophisticated CS developers. Thus, it is increasingly critical that we develop hands-off DI services, which lay users can use to perform such tasks without asking for help from developers. We propose to demonstrate such a service. Specifically, we will demonstrate CloudMatcher, a hands-off cloud\/crowd service for entity matching (EM). To use CloudMatcher to match two tables, a lay user only needs to upload them to the CloudMatcher's Web page then iteratively label a set of tuple pairs as match\/no-match. Alternatively, the user can enlist a crowd of workers to label the pairs. In either case, the lay user can easily perform EM end-to-end without having to involve any developers. Cloud-Matcher has been used in several domain science projects at UW-Madison and at several organizations, and is scheduled to be deployed in a large company in Summer 2018. In the demonstration we will show how easy it is for lay users to perform EM (either via interactive labeling or crowdsourcing), how users can easily create and experiment with a range of EM workflows, and how CloudMatcher can scale to many concurrent users and large datasets.<\/jats:p>","DOI":"10.14778\/3229863.3236255","type":"journal-article","created":{"date-parts":[[2018,9,10]],"date-time":"2018-09-10T12:12:28Z","timestamp":1536581548000},"page":"2042-2045","source":"Crossref","is-referenced-by-count":8,"title":["Cloudmatcher"],"prefix":"10.14778","volume":"11","author":[{"given":"Yash","family":"Govind","sequence":"first","affiliation":[{"name":"University of Wisconsin-Madison"}]},{"given":"Erik","family":"Paulson","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison and Johnson Controls"}]},{"given":"Palaniappan","family":"Nagarajan","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison"}]},{"given":"Paul Suganthan G.","family":"C.","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison"}]},{"given":"AnHai","family":"Doan","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison"}]},{"given":"Youngchoon","family":"Park","sequence":"additional","affiliation":[{"name":"Johnson Controls"}]},{"given":"Glenn M.","family":"Fung","sequence":"additional","affiliation":[{"name":"American Family Insurance"}]},{"given":"Devin","family":"Conathan","sequence":"additional","affiliation":[{"name":"American Family Insurance"}]},{"given":"Marshall","family":"Carter","sequence":"additional","affiliation":[{"name":"American Family Insurance"}]},{"given":"Mingju","family":"Sun","sequence":"additional","affiliation":[{"name":"American Family Insurance"}]}],"member":"320","published-online":{"date-parts":[[2018,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Dedupe https:\/\/dedupe.io\/.  Dedupe https:\/\/dedupe.io\/."},{"key":"e_1_2_1_2_1","volume-title":"Entity Resolution, and Duplicate Detection","author":"Christen P.","year":"2012","unstructured":"P. Christen . Data Matching: Concepts and Techniques for Record Linkage , Entity Resolution, and Duplicate Detection . Springer Publishing Company, Inc orporated, 2012 . P. Christen. Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection. Springer Publishing Company, Incorporated, 2012."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035960"},{"key":"e_1_2_1_4_1","volume-title":"Principles of Data Integration. Morgan Kaufmann","author":"Doan A.","year":"2012","unstructured":"A. Doan , A. Halevy , and Z. Ives . Principles of Data Integration. Morgan Kaufmann , 1 st edition, 2012 . A. Doan, A. Halevy, and Z. Ives. Principles of Data Integration. Morgan Kaufmann, 1st edition, 2012.","edition":"1"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.250581"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2588576"},{"key":"e_1_2_1_7_1","volume-title":"BIGDAS","author":"Govind Y.","year":"2017","unstructured":"Y. Govind : A cloud\/crowd service for entity matching . In BIGDAS , 2017 . Y. Govind et al. Cloudmatcher: A cloud\/crowd service for entity matching. In BIGDAS, 2017."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994535"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350263"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3229863.3236255","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:12:44Z","timestamp":1672222364000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3229863.3236255"}},"subtitle":["a hands-off cloud\/crowd service for entity matching"],"short-title":[],"issued":{"date-parts":[[2018,8]]},"references-count":9,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2018,8]]}},"alternative-id":["10.14778\/3229863.3236255"],"URL":"https:\/\/doi.org\/10.14778\/3229863.3236255","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2018,8]]}}}