{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T05:40:28Z","timestamp":1768110028549,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":63,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,9]],"date-time":"2022-06-09T00:00:00Z","timestamp":1654732800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,9]]},"DOI":"10.1145\/3519939.3523705","type":"proceedings-article","created":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T21:05:05Z","timestamp":1654203905000},"page":"993-1009","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Landmarks and regions: a robust approach to data extraction"],"prefix":"10.1145","author":[{"given":"Suresh","family":"Parthasarathy","sequence":"first","affiliation":[{"name":"Microsoft, UK"}]},{"given":"Lincy","family":"Pattanaik","sequence":"additional","affiliation":[{"name":"Microsoft Research, India"}]},{"given":"Anirudh","family":"Khatry","sequence":"additional","affiliation":[{"name":"Microsoft Research, India"}]},{"given":"Arun","family":"Iyer","sequence":"additional","affiliation":[{"name":"Microsoft Research, India"}]},{"given":"Arjun","family":"Radhakrishna","sequence":"additional","affiliation":[{"name":"Microsoft, USA"}]},{"given":"Sriram K.","family":"Rajamani","sequence":"additional","affiliation":[{"name":"Microsoft Research, India"}]},{"given":"Mohammad","family":"Raza","sequence":"additional","affiliation":[{"name":"Microsoft, USA"}]}],"member":"320","published-online":{"date-parts":[[2022,6,9]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n.d.]. Beautiful soup: We called him tortoise because he taught us.. https:\/\/www.crummy.com\/software\/BeautifulSoup\/  [n.d.]. Beautiful soup: We called him tortoise because he taught us.. https:\/\/www.crummy.com\/software\/BeautifulSoup\/"},{"key":"e_1_3_2_1_2_1","unstructured":"[n.d.]. imacros. https:\/\/wiki.imacros.net\/Main_Page  [n.d.]. imacros. https:\/\/wiki.imacros.net\/Main_Page"},{"key":"e_1_3_2_1_3_1","unstructured":"[n.d.]. Selenium-web browser automation. https:\/\/www.selenium.dev\/  [n.d.]. Selenium-web browser automation. https:\/\/www.selenium.dev\/"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/276304.276330"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/336597.336644"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.3233\/978-1-61499-495-4-1"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-21668-3_10"},{"key":"e_1_3_2_1_8_1","volume-title":"Axel Legay and Tiziana Margaria (Eds.) (Lecture Notes in Computer Science","volume":"336","author":"Alur Rajeev","year":"2017","unstructured":"Rajeev Alur , Arjun Radhakrishna , and Abhishek Udupa . 2017 . Scaling Enumerative Program Synthesis via Divide and Conquer.. In TACAS (1) , Axel Legay and Tiziana Margaria (Eds.) (Lecture Notes in Computer Science , Vol. 10205). 319\u2013 336 . isbn:978-3-662-54577-5 http:\/\/dblp.uni-trier.de\/db\/conf\/tacas\/tacas2017-1.html##AlurRU17 Rajeev Alur, Arjun Radhakrishna, and Abhishek Udupa. 2017. Scaling Enumerative Program Synthesis via Divide and Conquer.. In TACAS (1), Axel Legay and Tiziana Margaria (Eds.) (Lecture Notes in Computer Science, Vol. 10205). 319\u2013336. isbn:978-3-662-54577-5 http:\/\/dblp.uni-trier.de\/db\/conf\/tacas\/tacas2017-1.html##AlurRU17"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/872757.872799"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2983990.2984020"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Ranjita Bhagwan Sonu Mehta Arjun Radhakrishna and Sahil Garg. 2021. Learning Patterns in Configuration. In ASE.  Ranjita Bhagwan Sonu Mehta Arjun Radhakrishna and Sahil Garg. 2021. Learning Patterns in Configuration. In ASE.","DOI":"10.1109\/ASE51524.2021.9678525"},{"key":"e_1_3_2_1_12_1","volume-title":"International workshop on the world wide web and databases. 172\u2013183","author":"Brin Sergey","year":"1998","unstructured":"Sergey Brin . 1998 . Extracting patterns and relations from the world wide web . In International workshop on the world wide web and databases. 172\u2013183 . Sergey Brin. 1998. Extracting patterns and relations from the world wide web. In International workshop on the world wide web and databases. 172\u2013183."},{"key":"e_1_3_2_1_13_1","unstructured":"Mengli Cheng Minghui Qiu Xing Shi Jun Huang and Wei Lin. 2020. One-shot Text Field labeling using Attention and Belief Propagation for Structure Information Extraction.. In ACM Multimedia Chang Wen Chen Rita Cucchiara Xian-Sheng Hua Guo-Jun Qi Elisa Ricci Zhengyou Zhang and Roger Zimmermann (Eds.). ACM 340\u2013348. isbn:978-1-4503-7988-5 http:\/\/dblp.uni-trier.de\/db\/conf\/mm\/mm2020.html##ChengQSH020  Mengli Cheng Minghui Qiu Xing Shi Jun Huang and Wei Lin. 2020. One-shot Text Field labeling using Attention and Belief Propagation for Structure Information Extraction.. In ACM Multimedia Chang Wen Chen Rita Cucchiara Xian-Sheng Hua Guo-Jun Qi Elisa Ricci Zhengyou Zhang and Roger Zimmermann (Eds.). ACM 340\u2013348. isbn:978-1-4503-7988-5 http:\/\/dblp.uni-trier.de\/db\/conf\/mm\/mm2020.html##ChengQSH020"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1142473.1142555"},{"key":"e_1_3_2_1_15_1","volume-title":"VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases","author":"Crescenzi Valter","year":"2001","unstructured":"Valter Crescenzi , Giansalvatore Mecca , and Paolo Merialdo . 2001 . RoadRunner: Towards Automatic Data Extraction from Large Web Sites . In VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases , September 11-14, 2001, Roma, Italy, Peter M. G. Apers, Paolo Atzeni, Stefano Ceri, Stefano Paraboschi, Kotagiri Ramamohanarao, and Richard T. Snodgrass (Eds.). Morgan Kaufmann, 109\u2013118. http:\/\/www.vldb.org\/conf\/ 2001\/P109.pdf Valter Crescenzi, Giansalvatore Mecca, and Paolo Merialdo. 2001. RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases, September 11-14, 2001, Roma, Italy, Peter M. G. Apers, Paolo Atzeni, Stefano Ceri, Stefano Paraboschi, Kotagiri Ramamohanarao, and Richard T. Snodgrass (Eds.). Morgan Kaufmann, 109\u2013118. http:\/\/www.vldb.org\/conf\/2001\/P109.pdf"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.14778\/1938545.1938547"},{"key":"e_1_3_2_1_17_1","volume-title":"A Machine Learning Framework for Data Ingestion in Document Images.. CoRR, abs\/2003.00838","author":"Fu Han","year":"2020","unstructured":"Han Fu , Yunyu Bai , Zhuo Li , Jun Shen , and Jianling Sun . 2020. A Machine Learning Framework for Data Ingestion in Document Images.. CoRR, abs\/2003.00838 ( 2020 ), http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr2003.html##abs-2003-00838 Han Fu, Yunyu Bai, Zhuo Li, Jun Shen, and Jianling Sun. 2020. A Machine Learning Framework for Data Ingestion in Document Images.. CoRR, abs\/2003.00838 (2020), http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr2003.html##abs-2003-00838"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2915214"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_3_2_1_20_1","volume-title":"Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout.. CoRR, abs\/2003.02356","author":"Gralinski Filip","year":"2020","unstructured":"Filip Gralinski , Tomasz Stanislawek , Anna Wr\u00f3blewska , Dawid Lipinski , Agnieszka Kaliska , Paulina Rosalska , Bartosz Topolski , and Przemyslaw Biecek . 2020 . Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout.. CoRR, abs\/2003.02356 (2020), http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr2003.html##abs-2003-02356 Filip Gralinski, Tomasz Stanislawek, Anna Wr\u00f3blewska, Dawid Lipinski, Agnieszka Kaliska, Paulina Rosalska, Bartosz Topolski, and Przemyslaw Biecek. 2020. Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout.. CoRR, abs\/2003.02356 (2020), http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr2003.html##abs-2003-02356"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1925844.1926423"},{"key":"e_1_3_2_1_22_1","volume-title":"Generating finite-state transducers for semi-structured data extraction from the web. Information systems, 23, 8","author":"Hsu Chun-Nan","year":"1998","unstructured":"Chun-Nan Hsu and Ming-Tzung Dung . 1998. Generating finite-state transducers for semi-structured data extraction from the web. Information systems, 23, 8 ( 1998 ), 521\u2013538. Chun-Nan Hsu and Ming-Tzung Dung. 1998. Generating finite-state transducers for semi-structured data extraction from the web. Information systems, 23, 8 (1998), 521\u2013538."},{"key":"e_1_3_2_1_23_1","volume-title":"Rajamani","author":"Iyer Arun Shankar","year":"2019","unstructured":"Arun Shankar Iyer , Manohar Jonnalagedda , Suresh Parthasarathy , Arjun Radhakrishna , and Sriram K . Rajamani . 2019 . Synthesis and machine learning for heterogeneous extraction.. In PLDI, Kathryn S. McKinley and Kathleen Fisher (Eds.). ACM , 301\u2013315. isbn:978-1-4503-6712-7 http:\/\/dblp.uni-trier.de\/db\/conf\/pldi\/pldi2019.html##IyerJPRR19 Arun Shankar Iyer, Manohar Jonnalagedda, Suresh Parthasarathy, Arjun Radhakrishna, and Sriram K. Rajamani. 2019. Synthesis and machine learning for heterogeneous extraction.. In PLDI, Kathryn S. McKinley and Kathleen Fisher (Eds.). ACM, 301\u2013315. isbn:978-1-4503-6712-7 http:\/\/dblp.uni-trier.de\/db\/conf\/pldi\/pldi2019.html##IyerJPRR19"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2479787.2479798"},{"key":"e_1_3_2_1_25_1","volume-title":"Wrapper induction for information extraction","author":"Kushmerick Nicholas","unstructured":"Nicholas Kushmerick . 1997. Wrapper induction for information extraction . University of Washington. Nicholas Kushmerick. 1997. Wrapper induction for information extraction. University of Washington."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(99)00100-9"},{"key":"e_1_3_2_1_27_1","unstructured":"Nicholas Kushmerick. 1999. Regression testing for wrapper maintenance. In Aaai\/iaai. 74\u201379.  Nicholas Kushmerick. 1999. Regression testing for wrapper maintenance. In Aaai\/iaai. 74\u201379."},{"key":"e_1_3_2_1_28_1","volume-title":"PLDI, Michael F. P. O\u2019Boyle and Keshav Pingali (Eds.). ACM, 542\u2013553. isbn:978-1-4503-2784-8 http:\/\/dblp.uni-trier.de\/db\/conf\/pldi\/pldi2014.html##LeG14","author":"Le Vu","unstructured":"Vu Le and Sumit Gulwani . 2014. FlashExtract: a framework for data extraction by examples .. In PLDI, Michael F. P. O\u2019Boyle and Keshav Pingali (Eds.). ACM, 542\u2013553. isbn:978-1-4503-2784-8 http:\/\/dblp.uni-trier.de\/db\/conf\/pldi\/pldi2014.html##LeG14 Vu Le and Sumit Gulwani. 2014. FlashExtract: a framework for data extraction by examples.. In PLDI, Michael F. P. O\u2019Boyle and Keshav Pingali (Eds.). ACM, 542\u2013553. isbn:978-1-4503-2784-8 http:\/\/dblp.uni-trier.de\/db\/conf\/pldi\/pldi2014.html##LeG14"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1002\/smr.1771"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622420.1622425"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357323"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1753326.1753432"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1502650.1502667"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-86549-8_35"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2398464"},{"key":"e_1_3_2_1_36_1","unstructured":"Microsoft. [n.d.]. Azure Form Recognizer. https:\/\/azure.microsoft.com\/en-in\/services\/form-recognizer\/  Microsoft. [n.d.]. Azure Form Recognizer. https:\/\/azure.microsoft.com\/en-in\/services\/form-recognizer\/"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/301136.301191"},{"key":"e_1_3_2_1_38_1","unstructured":"Jussi Myllymaki and Jared Jackson. 2002. Robust web data extraction with xml path expressions. Technical reportz.  Jussi Myllymaki and Jared Jackson. 2002. Robust web data extraction with xml path expressions. Technical reportz."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018661.3018740"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3276520"},{"key":"e_1_3_2_1_41_1","volume-title":"FlashMeta: a framework for inductive program synthesis","author":"Polozov Oleksandr","year":"2015","unstructured":"Oleksandr Polozov and Sumit Gulwani . 2015. FlashMeta: a framework for inductive program synthesis .. In OOPSLA, Jonathan Aldrich and Patrick Eugster (Eds.). ACM , 107\u2013126. isbn:978-1-4503-3689-5 http:\/\/dblp.uni-trier.de\/db\/conf\/oopsla\/oopsla 2015 .html##PolozovG15 Oleksandr Polozov and Sumit Gulwani. 2015. FlashMeta: a framework for inductive program synthesis.. In OOPSLA, Jonathan Aldrich and Patrick Eugster (Eds.). ACM, 107\u2013126. isbn:978-1-4503-3689-5 http:\/\/dblp.uni-trier.de\/db\/conf\/oopsla\/oopsla2015.html##PolozovG15"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1066677.1066826"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2837614.2837671"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v31i1.10668"},{"key":"e_1_3_2_1_45_1","volume-title":"Disjunctive Program Synthesis: A Robust Approach to Programming by Example","author":"Raza Mohammad","year":"2018","unstructured":"Mohammad Raza and Sumit Gulwani . 2018. Disjunctive Program Synthesis: A Robust Approach to Programming by Example .. In AAAI, Sheila A. McIlraith and Kilian Q. Weinberger (Eds.). AAAI Press , 1403\u20131412. http:\/\/dblp.uni-trier.de\/db\/conf\/aaai\/aaai 2018 .html##RazaG18 Mohammad Raza and Sumit Gulwani. 2018. Disjunctive Program Synthesis: A Robust Approach to Programming by Example.. In AAAI, Sheila A. McIlraith and Kilian Q. Weinberger (Eds.). AAAI Press, 1403\u20131412. http:\/\/dblp.uni-trier.de\/db\/conf\/aaai\/aaai2018.html##RazaG18"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380608"},{"key":"e_1_3_2_1_47_1","volume-title":"Barrett","author":"Reynolds Andrew","year":"2015","unstructured":"Andrew Reynolds , Morgan Deters , Viktor Kuncak , Cesare Tinelli , and Clark W . Barrett . 2015 . Counterexample-Guided Quantifier Instantiation for Synthesis in SMT.. In CAV (2), Daniel Kroening and Corina S. Pasareanu (Eds.) (Lecture Notes in Computer Science , Vol. 9207). Springer, 198\u2013 216 . isbn:978-3-319-21667-6 http:\/\/dblp.uni-trier.de\/db\/conf\/cav\/cav2015.html##ReynoldsDKTB15 Andrew Reynolds, Morgan Deters, Viktor Kuncak, Cesare Tinelli, and Clark W. Barrett. 2015. Counterexample-Guided Quantifier Instantiation for Synthesis in SMT.. In CAV (2), Daniel Kroening and Corina S. Pasareanu (Eds.) (Lecture Notes in Computer Science, Vol. 9207). Springer, 198\u2013216. isbn:978-3-319-21667-6 http:\/\/dblp.uni-trier.de\/db\/conf\/cav\/cav2015.html##ReynoldsDKTB15"},{"key":"e_1_3_2_1_48_1","volume-title":"Alchemist: Learning Guarded Affine Functions.. In CAV (1), Daniel Kroening and Corina S","author":"Saha Shambwaditya","year":"2015","unstructured":"Shambwaditya Saha , Pranav Garg , and P. Madhusudan . 2015 . Alchemist: Learning Guarded Affine Functions.. In CAV (1), Daniel Kroening and Corina S . Pasareanu (Eds.) (Lecture Notes in Computer Science, Vol . 9206). Springer , 440\u2013446. isbn:978-3-319-21689-8 http:\/\/dblp.uni-trier.de\/db\/conf\/cav\/cav2015-1.html##Saha0M15 Shambwaditya Saha, Pranav Garg, and P. Madhusudan. 2015. Alchemist: Learning Guarded Affine Functions.. In CAV (1), Daniel Kroening and Corina S. Pasareanu (Eds.) (Lecture Notes in Computer Science, Vol. 9206). Springer, 440\u2013446. isbn:978-3-319-21689-8 http:\/\/dblp.uni-trier.de\/db\/conf\/cav\/cav2015-1.html##Saha0M15"},{"key":"e_1_3_2_1_49_1","first-page":"738","article-title":"Building light-weight wrappers for legacy web data-sources using W4F","volume":"99","author":"Sahuguet Arnaud","year":"1999","unstructured":"Arnaud Sahuguet and Fabien Azavant . 1999 . Building light-weight wrappers for legacy web data-sources using W4F . In Vldb. 99 , 738 \u2013 741 . Arnaud Sahuguet and Fabien Azavant. 1999. Building light-weight wrappers for legacy web data-sources using W4F. In Vldb. 99, 738\u2013741.","journal-title":"Vldb."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1963192.1963251"},{"key":"e_1_3_2_1_51_1","volume-title":"Web information extraction using markov logic networks","author":"Satpal Sandeepkumar","year":"2011","unstructured":"Sandeepkumar Satpal , Sahely Bhadra , Sundararajan Sellamanickam , Rajeev Rastogi , and Prithviraj Sen . 2011. Web information extraction using markov logic networks .. In KDD, Chid Apt\u00e9, Joydeep Ghosh, and Padhraic Smyth (Eds.). ACM , 1406\u20131414. isbn:978-1-4503-0813-7 http:\/\/dblp.uni-trier.de\/db\/conf\/kdd\/kdd 2011 .html##SatpalBSRS11 Sandeepkumar Satpal, Sahely Bhadra, Sundararajan Sellamanickam, Rajeev Rastogi, and Prithviraj Sen. 2011. Web information extraction using markov logic networks.. In KDD, Chid Apt\u00e9, Joydeep Ghosh, and Padhraic Smyth (Eds.). ACM, 1406\u20131414. isbn:978-1-4503-0813-7 http:\/\/dblp.uni-trier.de\/db\/conf\/kdd\/kdd2011.html##SatpalBSRS11"},{"key":"e_1_3_2_1_52_1","volume-title":"14th International Workshop On Neural-symbolic Learning And Reasoning. abs\/1906","author":"Sunder Vishal","year":"2019","unstructured":"Vishal Sunder , Ashwin Srinivasan , Lovekesh Vig , Gautam M. Shroff , and Rohit Rahul . 2019 . One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis .. In 14th International Workshop On Neural-symbolic Learning And Reasoning. abs\/1906 .02427, http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr1906.html##abs-1906-02427 Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam M. Shroff, and Rohit Rahul. 2019. One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis.. In 14th International Workshop On Neural-symbolic Learning And Reasoning. abs\/1906.02427, http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr1906.html##abs-1906-02427"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/2491956.2462174"},{"key":"e_1_3_2_1_54_1","unstructured":"Wikipedia. [n.d.]. Precision and Recall. https:\/\/en.wikipedia.org\/wiki\/Precision_and_recall  Wikipedia. [n.d.]. Precision and Recall. https:\/\/en.wikipedia.org\/wiki\/Precision_and_recall"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403172"},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/2610384.2610390"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2006.197"},{"key":"e_1_3_2_1_58_1","volume-title":"25th International Conference on Pattern Recognition (ICPR).","author":"Zhang Mengshi","year":"2020","unstructured":"Mengshi Zhang , Daniel Perelman , Vu Le , and Sumit Gulwani . 2020 . An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction .. In 25th International Conference on Pattern Recognition (ICPR). Mengshi Zhang, Daniel Perelman, Vu Le, and Sumit Gulwani. 2020. An Integrated Approach of Deep Learning and Symbolic Analysis for Digital PDF Table Extraction.. In 25th International Conference on Pattern Recognition (ICPR)."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2788580"},{"key":"e_1_3_2_1_60_1","volume-title":"Smola","author":"Zhang Weinan","year":"2015","unstructured":"Weinan Zhang , Amr Ahmed , Jie Yang , Vanja Josifovski , and Alexander J . Smola . 2015 . Annotating Needles in the Haystack without Looking : Product Information Extraction from Emails.. In KDD, Longbing Cao, Chengqi Zhang, Thorsten Joachims, Geoffrey I. Webb, Dragos D. Margineantu, and Graham Williams (Eds.). ACM , 2257\u20132266. isbn:978-1-4503-3664-2 http:\/\/dblp.uni-trier.de\/db\/conf\/kdd\/kdd2015.html##ZhangAYJS15 Weinan Zhang, Amr Ahmed, Jie Yang, Vanja Josifovski, and Alexander J. Smola. 2015. Annotating Needles in the Haystack without Looking: Product Information Extraction from Emails.. In KDD, Longbing Cao, Chengqi Zhang, Thorsten Joachims, Geoffrey I. Webb, Dragos D. Margineantu, and Graham Williams (Eds.). ACM, 2257\u20132266. isbn:978-1-4503-3664-2 http:\/\/dblp.uni-trier.de\/db\/conf\/kdd\/kdd2015.html##ZhangAYJS15"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/1102351.1102483"},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150457"},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.5555\/1390681.1442784"}],"event":{"name":"PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation","location":"San Diego CA USA","acronym":"PLDI '22","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"]},"container-title":["Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3519939.3523705","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3519939.3523705","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:30Z","timestamp":1750183830000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3519939.3523705"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,9]]},"references-count":63,"alternative-id":["10.1145\/3519939.3523705","10.1145\/3519939"],"URL":"https:\/\/doi.org\/10.1145\/3519939.3523705","relation":{},"subject":[],"published":{"date-parts":[[2022,6,9]]},"assertion":[{"value":"2022-06-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}