{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:55:19Z","timestamp":1775638519421,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":86,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,6,9]],"date-time":"2021-06-09T00:00:00Z","timestamp":1623196800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"DARPA","award":["D3M"],"award-info":[{"award-number":["D3M"]}]},{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["OAC-1640864"],"award-info":[{"award-number":["OAC-1640864"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,6,9]]},"DOI":"10.1145\/3448016.3458456","type":"proceedings-article","created":{"date-parts":[[2021,6,18]],"date-time":"2021-06-18T17:22:39Z","timestamp":1624036959000},"page":"1531-1544","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":38,"title":["Correlation Sketches for Approximate Join-Correlation Queries"],"prefix":"10.1145","author":[{"given":"A\u00e9cio","family":"Santos","sequence":"first","affiliation":[{"name":"New York University, New York, NY, USA"}]},{"given":"Aline","family":"Bessa","sequence":"additional","affiliation":[{"name":"New York University, New York, NY, USA"}]},{"given":"Fernando","family":"Chirigati","sequence":"additional","affiliation":[{"name":"Springer Nature, New York, NY, USA"}]},{"given":"Christopher","family":"Musco","sequence":"additional","affiliation":[{"name":"New York University, New York, NY, USA"}]},{"given":"Juliana","family":"Freire","sequence":"additional","affiliation":[{"name":"New York University, New York, NY, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,6,18]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/304181.304207"},{"key":"e_1_3_2_2_2_1","unstructured":"Apache lucene. https:\/\/lucene.apache.org\/index.html.  Apache lucene. https:\/\/lucene.apache.org\/index.html."},{"key":"e_1_3_2_2_3_1","volume-title":"A new correlation coefficient between categorical, ordinal and interval variables with pearson characteristics. Computational Statistics & Data Analysis, page 107043","author":"Baak M.","year":"2020","unstructured":"M. Baak , R. Koopman , H. Snoek , and S. Klous . A new correlation coefficient between categorical, ordinal and interval variables with pearson characteristics. Computational Statistics & Data Analysis, page 107043 , 2020 . M. Baak, R. Koopman, H. Snoek, and S. Klous. A new correlation coefficient between categorical, ordinal and interval variables with pearson characteristics. Computational Statistics & Data Analysis, page 107043, 2020."},{"key":"e_1_3_2_2_4_1","volume-title":"understand and manage your data with Data Catalog, now GA. https:\/\/cloud.google.com\/blog\/products\/data-analytics\/data-catalog-metadata-management-now-generally-available","author":"Bapat S.","year":"2020","unstructured":"S. Bapat . Discover , understand and manage your data with Data Catalog, now GA. https:\/\/cloud.google.com\/blog\/products\/data-analytics\/data-catalog-metadata-management-now-generally-available , 2020 . [Online; accessed 22-June-2020]. S. Bapat. Discover, understand and manage your data with Data Catalog, now GA. https:\/\/cloud.google.com\/blog\/products\/data-analytics\/data-catalog-metadata-management-now-generally-available, 2020. [Online; accessed 22-June-2020]."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/646978.711822"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.3150\/14-BEJ605"},{"key":"e_1_3_2_2_7_1","volume-title":"A monte carlo investigation of the fisher z transformation for normal and nonnormal distributions. Psychological Reports, 87(3_suppl):1101--1114","author":"Berry K. J.","year":"2000","unstructured":"K. J. Berry and P. W. Mielke Jr . A monte carlo investigation of the fisher z transformation for normal and nonnormal distributions. Psychological Reports, 87(3_suppl):1101--1114 , 2000 . K. J. Berry and P. W. Mielke Jr. A monte carlo investigation of the fisher z transformation for normal and nonnormal distributions. Psychological Reports, 87(3_suppl):1101--1114, 2000."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1562764.1562787"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247504"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1037\/a0028087"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1177\/0013164414557639"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.3758\/s13428-016-0702-8"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1111\/bmsp.12113"},{"key":"e_1_3_2_2_14_1","volume-title":"Statistics in biology. statistical methods for research in the natural sciences. Statistics in biology. Statistical methods for research in the natural sciences","author":"Bliss C. I.","year":"1967","unstructured":"C. I. Bliss Statistics in biology. statistical methods for research in the natural sciences. Statistics in biology. Statistical methods for research in the natural sciences ., 1967 . C. I. Bliss et al. Statistics in biology. statistical methods for research in the natural sciences. Statistics in biology. Statistical methods for research in the natural sciences., 1967."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02294183"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1928.10502991"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313685"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/2167714.2167726"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453916"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00109"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/335168.335230"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035921"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.14778\/3397230.3397235"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453884"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3097999"},{"key":"e_1_3_2_2_26_1","volume-title":"Synopses for massive data: Samples, histograms, wavelets, sketches. Foundations and Trends in Databases, 4(1--3):1--294","author":"Cormode G.","year":"2012","unstructured":"G. Cormode , M. N. Garofalakis , P. J. Haas , and C. Jermaine . Synopses for massive data: Samples, histograms, wavelets, sketches. Foundations and Trends in Databases, 4(1--3):1--294 , 2012 . G. Cormode, M. N. Garofalakis, P. J. Haas, and C. Jermaine. Synopses for massive data: Samples, histograms, wavelets, sketches. Foundations and Trends in Databases, 4(1--3):1--294, 2012."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295407"},{"key":"e_1_3_2_2_28_1","first-page":"1","volume-title":"19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs)","author":"Dasgupta A.","year":"2016","unstructured":"A. Dasgupta , K. J. Lang , L. Rhodes , and J. Thaler . A Framework for Estimating Stream Expression Cardinalities. In W. Martens and T. Zeume, editors , 19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs) , pages 6: 1 -- 6 :17, Dagstuhl, Germany , 2016 . Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik. A. Dasgupta, K. J. Lang, L. Rhodes, and J. Thaler. A Framework for Estimating Stream Expression Cardinalities. In W. Martens and T. Zeume, editors, 19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs), pages 6:1--6:17, Dagstuhl, Germany, 2016. Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik."},{"key":"e_1_3_2_2_29_1","first-page":"1","volume-title":"19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs)","author":"Dasgupta A.","year":"2016","unstructured":"A. Dasgupta , K. J. Lang , L. Rhodes , and J. Thaler . A Framework for Estimating Stream Expression Cardinalities. In W. Martens and T. Zeume, editors , 19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs) , pages 6: 1 -- 6 :17, Dagstuhl, Germany , 2016 . Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik. A. Dasgupta, K. J. Lang, L. Rhodes, and J. Thaler. A Framework for Estimating Stream Expression Cardinalities. In W. Martens and T. Zeume, editors, 19th International Conference on Database Theory (ICDT 2016), volume 48 of Leibniz International Proceedings in Informatics (LIPIcs), pages 6:1--6:17, Dagstuhl, Germany, 2016. Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1037\/met0000079"},{"key":"e_1_3_2_2_31_1","volume-title":"Cidr","author":"Deng D.","year":"2017","unstructured":"D. Deng , R. C. Fernandez , Z. Abedjan , S. Wang , M. Stonebraker , A. K. Elmagarmid , I. F. Ilyas , S. Madden , M. Ouzzani , and N. Tang . The data civilizer system . In Cidr , 2017 . D. Deng, R. C. Fernandez, Z. Abedjan, S. Wang, M. Stonebraker, A. K. Elmagarmid, I. F. Ilyas, S. Madden, M. Ouzzani, and N. Tang. The data civilizer system. In Cidr, 2017."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/62.3.531"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1314690.1314696"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1201\/9780429246593"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2006.61"},{"key":"e_1_3_2_2_36_1","first-page":"1001","volume-title":"Aurum: A Data Discovery System. In ICDE '18","author":"Fernandez R. C.","year":"2018","unstructured":"R. C. Fernandez , Z. Abedjan , F. Koko , G. Yuan , S. Madden , and M. Stonebraker . Aurum: A Data Discovery System. In ICDE '18 , pages 1001 -- 1012 , 2018 . R. C. Fernandez, Z. Abedjan, F. Koko, G. Yuan, S. Madden, and M. Stonebraker. Aurum: A Data Discovery System. In ICDE '18, pages 1001--1012, 2018."},{"key":"e_1_3_2_2_37_1","first-page":"137","volume-title":"AH, 2007 Conference on Analysis of Algorithms (AofA 07) of DMTCS Proceedings","author":"Flajolet P.","year":"2007","unstructured":"P. Flajolet , \u00c9. Fusy, O. Gandouet , and F. Meunier . HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm. In P. Jacquet, editor, AofA: Analysis of Algorithms, volume DMTCS Proceedings vol . AH, 2007 Conference on Analysis of Algorithms (AofA 07) of DMTCS Proceedings , pages 137 -- 156 , Juan les Pins, France , June 2007 . Discrete Mathematics and Theoretical Computer Science. P. Flajolet, \u00c9. Fusy, O. Gandouet, and F. Meunier. HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm. In P. Jacquet, editor, AofA: Analysis of Algorithms, volume DMTCS Proceedings vol. AH, 2007 Conference on Analysis of Algorithms (AofA 07) of DMTCS Proceedings, pages 137--156, Juan les Pins, France, June 2007. Discrete Mathematics and Theoretical Computer Science."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/235968.233340"},{"key":"e_1_3_2_2_39_1","volume-title":"https:\/\/eng.lyft.com\/amundsen-lyfts-data-discovery-metadata-engine-62d27254fbb9","author":"Grover M.","year":"2019","unstructured":"M. Grover . Amundsen - Lyft's data discovery & metadata engine. https:\/\/eng.lyft.com\/amundsen-lyfts-data-discovery-metadata-engine-62d27254fbb9 , 2019 . [Online; accessed 20-October-2019]. M. Grover. Amundsen - Lyft's data discovery & metadata engine. https:\/\/eng.lyft.com\/amundsen-lyfts-data-discovery-metadata-engine-62d27254fbb9, 2019. [Online; accessed 20-October-2019]."},{"key":"e_1_3_2_2_40_1","volume-title":"https:\/\/www.reportsanddata.com\/report-detail\/data-catalog-market","author":"Grover M.","year":"2020","unstructured":"M. Grover . Data Catalog Market | Size & Growth Report , 2020--2027. https:\/\/www.reportsanddata.com\/report-detail\/data-catalog-market , 2020 . [Online; accessed 28-March-2021]. M. Grover. Data Catalog Market | Size & Growth Report, 2020--2027. https:\/\/www.reportsanddata.com\/report-detail\/data-catalog-market, 2020. [Online; accessed 28-March-2021]."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/645921.673295"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3186728.3164145"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1963.10500830"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.2018.1437077"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.14778\/3372716.3372726"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.5555\/1315451.1315455"},{"key":"e_1_3_2_2_47_1","volume-title":"Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4):422--446","author":"K.","year":"2002","unstructured":"K. J\"arvelin and J. Kek\"al\"ainen. Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4):422--446 , 2002 . K. J\"arvelin and J. Kek\"al\"ainen. Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), 20(4):422--446, 2002."},{"key":"e_1_3_2_2_48_1","volume-title":"CIDR 2019, 9th Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 13--16, 2019, Online Proceedings. www.cidrdb.org","author":"Kipf A.","year":"2019","unstructured":"A. Kipf , T. Kipf , B. Radke , V. Leis , P. A. Boncz , and A. Kemper . Learned cardinalities: Estimating correlated joins with deep learning . In CIDR 2019, 9th Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 13--16, 2019, Online Proceedings. www.cidrdb.org , 2019 . A. Kipf, T. Kipf, B. Radke, V. Leis, P. A. Boncz, and A. Kemper. Learned cardinalities: Estimating correlated joins with deep learning. In CIDR 2019, 9th Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 13--16, 2019, Online Proceedings. www.cidrdb.org, 2019."},{"key":"e_1_3_2_2_49_1","volume-title":"Addison-Wesley","author":"Knuth D.","year":"1997","unstructured":"D. Knuth , Addison-Wesley, and P. Education . The Art of Computer Programming. Number v. 3 in Addison-Wesley series in computer science and information processing . Addison-Wesley , 1997 . D. Knuth, Addison-Wesley, and P. Education. The Art of Computer Programming. Number v. 3 in Addison-Wesley series in computer science and information processing. Addison-Wesley, 1997."},{"key":"e_1_3_2_2_50_1","volume-title":"A generalized metadata search & discovery tool. https:\/\/engineering.linkedin.com\/blog\/2019\/data-hub","author":"Lan M.","year":"2019","unstructured":"M. Lan . DataHub : A generalized metadata search & discovery tool. https:\/\/engineering.linkedin.com\/blog\/2019\/data-hub , 2019 . [Online; accessed 22-June-2020]. M. Lan. DataHub: A generalized metadata search & discovery tool. https:\/\/engineering.linkedin.com\/blog\/2019\/data-hub, 2019. [Online; accessed 22-June-2020]."},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872518.2889386"},{"issue":"159","key":"e_1_3_2_2_52_1","first-page":"166","article-title":"The mannheim search join engine","volume":"35","author":"Lehmberg O.","year":"2015","unstructured":"O. Lehmberg , D. Ritze , P. Ristoski , R. Meusel , H. Paulheim , and C. Bizer . The mannheim search join engine . Journal of Web Semantics , 35 : 159 -- 166 , 2015 . O. Lehmberg, D. Ritze, P. Ristoski, R. Meusel, H. Paulheim, and C. Bizer. The mannheim search join engine. Journal of Web Semantics, 35:159 -- 166, 2015.","journal-title":"Journal of Web Semantics"},{"key":"e_1_3_2_2_53_1","volume-title":"CIDR 2017, 8th Biennial Conference on Innovative Data Systems Research, Chaminade, CA, USA, January 8--11, 2017, Online Proceedings. www.cidrdb.org","author":"Leis V.","year":"2017","unstructured":"V. Leis , B. Radke , A. Gubichev , A. Kemper , and T. Neumann . Cardinality estimation done right: Index-based join sampling . In CIDR 2017, 8th Biennial Conference on Innovative Data Systems Research, Chaminade, CA, USA, January 8--11, 2017, Online Proceedings. www.cidrdb.org , 2017 . V. Leis, B. Radke, A. Gubichev, A. Kemper, and T. Neumann. Cardinality estimation done right: Index-based join sampling. In CIDR 2017, 8th Biennial Conference on Innovative Data Systems Research, Chaminade, CA, USA, January 8--11, 2017, Online Proceedings. www.cidrdb.org, 2017."},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/93605.93611"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1037\/0033-2909.105.1.156"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.14778\/3192965.3192973"},{"key":"e_1_3_2_2_57_1","unstructured":"Nyc vision zero initiative. http:\/\/www1.nyc.gov\/site\/visionzero\/index.page.  Nyc vision zero initiative. http:\/\/www1.nyc.gov\/site\/visionzero\/index.page."},{"key":"e_1_3_2_2_58_1","unstructured":"NYC OpenData. https:\/\/opendata.cityofnewyork.us.  NYC OpenData. https:\/\/opendata.cityofnewyork.us."},{"key":"e_1_3_2_2_59_1","unstructured":"United States Government Open Data. https:\/\/www.data.gov.  United States Government Open Data. https:\/\/www.data.gov."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/872757.872835"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316436"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.2307\/2685263"},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/1386118.1386121"},{"key":"e_1_3_2_2_64_1","volume-title":"Correlation sketches for approximate join-correlation queries. arXiv preprint arXiv:2104.03353","author":"Santos A.","year":"2021","unstructured":"A. Santos , A. Bessa , F. Chirigati , C. Musco , and J. Freire . Correlation sketches for approximate join-correlation queries. arXiv preprint arXiv:2104.03353 , 2021 . A. Santos, A. Bessa, F. Chirigati, C. Musco, and J. Freire. Correlation sketches for approximate join-correlation queries. arXiv preprint arXiv:2104.03353, 2021."},{"key":"e_1_3_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1002\/9781119264507"},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.3758\/BRM.42.4.906"},{"key":"e_1_3_2_2_67_1","unstructured":"The Socrata Open Data API. https:\/\/dev.socrata.com.  The Socrata Open Data API. https:\/\/dev.socrata.com."},{"key":"e_1_3_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1214\/009053607000000505"},{"key":"e_1_3_2_2_69_1","unstructured":"The Tablesaw Library. https:\/\/github.com\/jtablesaw\/tablesaw.  The Tablesaw Library. https:\/\/github.com\/jtablesaw\/tablesaw."},{"key":"e_1_3_2_2_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939772"},{"key":"e_1_3_2_2_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/2247596.2247642"},{"key":"e_1_3_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824051"},{"key":"e_1_3_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-9473(95)00038-0"},{"key":"e_1_3_2_2_74_1","volume-title":"https:\/\/medium.com\/airbnb-engineering\/democratizing-data-at-airbnb-852d76c51770","author":"Williams C. C.","year":"2017","unstructured":"C. C. Williams . Democratizing Data at Airbnb . https:\/\/medium.com\/airbnb-engineering\/democratizing-data-at-airbnb-852d76c51770 , 2017 . [Online; accessed 22-June-2020]. C. C. Williams. Democratizing Data at Airbnb. https:\/\/medium.com\/airbnb-engineering\/democratizing-data-at-airbnb-852d76c51770, 2017. [Online; accessed 22-June-2020]."},{"key":"e_1_3_2_2_75_1","unstructured":"World Bank Open Data. https:\/\/data.worldbank.org.  World Bank Open Data. https:\/\/data.worldbank.org."},{"key":"e_1_3_2_2_76_1","unstructured":"World Bank Group Finances. https:\/\/finances.worldbank.org.  World Bank Group Finances. https:\/\/finances.worldbank.org."},{"key":"e_1_3_2_2_77_1","volume-title":"Efficient similarity joins for near-duplicate detection. ACM Transactions on Database Systems (TODS), 36(3):1--41","author":"Xiao C.","year":"2011","unstructured":"C. Xiao , W. Wang , X. Lin , J. X. Yu , and G. Wang . Efficient similarity joins for near-duplicate detection. ACM Transactions on Database Systems (TODS), 36(3):1--41 , 2011 . C. Xiao, W. Wang, X. Lin, J. X. Yu, and G. Wang. Efficient similarity joins for near-duplicate detection. ACM Transactions on Database Systems (TODS), 36(3):1--41, 2011."},{"key":"e_1_3_2_2_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00048"},{"key":"e_1_3_2_2_79_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368294"},{"key":"e_1_3_2_2_80_1","doi-asserted-by":"publisher","DOI":"10.1006\/jmva.1999.1858"},{"key":"e_1_3_2_2_81_1","doi-asserted-by":"publisher","DOI":"10.1177\/0049124105280200"},{"key":"e_1_3_2_2_82_1","volume-title":"Proceedings of the 2018 World Wide Web Conference, WWW '18, pages 1553--1562, Republic and Canton of Geneva, Switzerland, 2018. International World Wide Web Conferences Steering Committee.","author":"Zhang S.","unstructured":"S. Zhang and K. Balog . Ad hoc table retrieval using semantic similarity . In Proceedings of the 2018 World Wide Web Conference, WWW '18, pages 1553--1562, Republic and Canton of Geneva, Switzerland, 2018. International World Wide Web Conferences Steering Committee. S. Zhang and K. Balog. Ad hoc table retrieval using semantic similarity. In Proceedings of the 2018 World Wide Web Conference, WWW '18, pages 1553--1562, Republic and Canton of Geneva, Switzerland, 2018. International World Wide Web Conferences Steering Committee."},{"key":"e_1_3_2_2_83_1","volume-title":"Web table extraction, retrieval, and augmentation: A survey. ACM Transactions on Intelligent Systems and Technology (TIST), 11(2):1--35","author":"Zhang S.","year":"2020","unstructured":"S. Zhang and K. Balog . Web table extraction, retrieval, and augmentation: A survey. ACM Transactions on Intelligent Systems and Technology (TIST), 11(2):1--35 , 2020 . S. Zhang and K. Balog. Web table extraction, retrieval, and augmentation: A survey. ACM Transactions on Intelligent Systems and Technology (TIST), 11(2):1--35, 2020."},{"key":"e_1_3_2_2_84_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389726"},{"key":"e_1_3_2_2_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3300065"},{"key":"e_1_3_2_2_86_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994534"}],"event":{"name":"SIGMOD\/PODS '21: International Conference on Management of Data","location":"Virtual Event China","acronym":"SIGMOD\/PODS '21","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"]},"container-title":["Proceedings of the 2021 International Conference on Management of Data"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3448016.3458456","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3448016.3458456","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3448016.3458456","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:25:04Z","timestamp":1750195504000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3448016.3458456"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,9]]},"references-count":86,"alternative-id":["10.1145\/3448016.3458456","10.1145\/3448016"],"URL":"https:\/\/doi.org\/10.1145\/3448016.3458456","relation":{},"subject":[],"published":{"date-parts":[[2021,6,9]]},"assertion":[{"value":"2021-06-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}