{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,1]],"date-time":"2025-12-01T11:25:44Z","timestamp":1764588344078,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":46,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547751","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:01Z","timestamp":1665416581000},"page":"4614-4624","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Relational Representation Learning in Visually-Rich Documents"],"prefix":"10.1145","author":[{"given":"Xin","family":"Li","sequence":"first","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Yan","family":"Zheng","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Yiqing","family":"Hu","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Haoyu","family":"Cao","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Yunfei","family":"Wu","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Deqiang","family":"Jiang","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Yinsong","family":"Liu","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]},{"given":"Bo","family":"Ren","sequence":"additional","affiliation":[{"name":"Tencent YouTu Lab, Hefei, China"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00103"},{"key":"e_1_3_2_2_2_1","volume-title":"International conference on machine learning. PMLR, 1597--1607","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey Hinton . 2020 b. A simple framework for contrastive learning of visual representations . In International conference on machine learning. PMLR, 1597--1607 . Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020b. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597--1607."},{"key":"e_1_3_2_2_3_1","volume-title":"Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Kevin Swersky , Mohammad Norouzi , and Geoffrey E Hinton . 2020c. Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems , Vol. 33 ( 2020 ), 22243--22255. Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey E Hinton. 2020c. Big self-supervised models are strong semi-supervised learners. Advances in neural information processing systems, Vol. 33 (2020), 22243--22255."},{"key":"e_1_3_2_2_4_1","volume-title":"Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297","author":"Chen Xinlei","year":"2020","unstructured":"Xinlei Chen , Haoqi Fan , Ross Girshick , and Kaiming He. 2020a. Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 ( 2020 ). Xinlei Chen, Haoqi Fan, Ross Girshick, and Kaiming He. 2020a. Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 (2020)."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00950"},{"key":"e_1_3_2_2_6_1","volume-title":"Complicated table structure recognition. arXiv preprint arXiv:1908.04729","author":"Chi Zewen","year":"2019","unstructured":"Zewen Chi , Heyan Huang , Heng-Da Xu , Houjin Yu , Wanxuan Yin , and Xian-Ling Mao . 2019. Complicated table structure recognition. arXiv preprint arXiv:1908.04729 ( 2019 ). Zewen Chi, Heyan Huang, Heng-Da Xu, Houjin Yu, Wanxuan Yin, and Xian-Ling Mao. 2019. Complicated table structure recognition. arXiv preprint arXiv:1908.04729 (2019)."},{"key":"e_1_3_2_2_7_1","unstructured":"Zihang Dai Zhilin Yang Yiming Yang Jaime Carbonell Quoc Le and Ruslan Salakhutdinov. 2019. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context. In ACL. ACL 2978--2988.  Zihang Dai Zhilin Yang Yiming Yang Jaime Carbonell Quoc Le and Ruslan Salakhutdinov. 2019. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context. In ACL. ACL 2978--2988."},{"key":"e_1_3_2_2_8_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12063"},{"key":"e_1_3_2_2_10_1","volume-title":"ICDAR 2019 competition on table detection and recognition (cTDaR). In 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1510--1515","author":"Gao Liangcai","year":"2019","unstructured":"Liangcai Gao , Yilun Huang , Herv\u00e9 D\u00e9jean , Jean-Luc Meunier , Qinqin Yan , Yu Fang , Florian Kleber , and Eva Lang . 2019 . ICDAR 2019 competition on table detection and recognition (cTDaR). In 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1510--1515 . Liangcai Gao, Yilun Huang, Herv\u00e9 D\u00e9jean, Jean-Luc Meunier, Qinqin Yan, Yu Fang, Florian Kleber, and Eva Lang. 2019. ICDAR 2019 competition on table detection and recognition (cTDaR). In 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1510--1515."},{"key":"e_1_3_2_2_11_1","volume-title":"ICDAR 2013 table competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1449--1453","author":"G\u00f6bel Max","year":"2013","unstructured":"Max G\u00f6bel , Tamir Hassan , Ermelinda Oro , and Giorgio Orsi . 2013 . ICDAR 2013 table competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1449--1453 . Max G\u00f6bel, Tamir Hassan, Ermelinda Oro, and Giorgio Orsi. 2013. ICDAR 2013 table competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1449--1453."},{"key":"e_1_3_2_2_12_1","first-page":"21271","article-title":"Bootstrap your own latent-a new approach to self-supervised learning","volume":"33","author":"Grill Jean-Bastien","year":"2020","unstructured":"Jean-Bastien Grill , Florian Strub , Florent Altch\u00e9 , Corentin Tallec , Pierre Richemond , Elena Buchatskaya , Carl Doersch , Bernardo Avila Pires , Zhaohan Guo , Mohammad Gheshlaghi Azar , 2020 . Bootstrap your own latent-a new approach to self-supervised learning . Advances in Neural Information Processing Systems , Vol. 33 (2020), 21271 -- 21284 . Jean-Bastien Grill, Florian Strub, Florent Altch\u00e9, Corentin Tallec, Pierre Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Guo, Mohammad Gheshlaghi Azar, et al. 2020. Bootstrap your own latent-a new approach to self-supervised learning. Advances in Neural Information Processing Systems, Vol. 33 (2020), 21271--21284.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_13_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Gu Jiuxiang","year":"2021","unstructured":"Jiuxiang Gu , Jason Kuen , Vlad Morariu , Handong Zhao , Rajiv Jain , Nikolaos Barmpalios , Ani Nenkova , and Tong Sun . 2021 . UniDoc: Unified Pretraining Framework for Document Understanding . Advances in Neural Information Processing Systems , Vol. 34 (2021). Jiuxiang Gu, Jason Kuen, Vlad Morariu, Handong Zhao, Rajiv Jain, Nikolaos Barmpalios, Ani Nenkova, and Tong Sun. 2021. UniDoc: Unified Pretraining Framework for Document Understanding. Advances in Neural Information Processing Systems, Vol. 34 (2021)."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.322"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_17_1","volume-title":"BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents. arXiv preprint arXiv:2108.04539","author":"Hong Teakgyu","year":"2021","unstructured":"Teakgyu Hong , Donghyun Kim , Mingi Ji , Wonseok Hwang , Daehyun Nam , and Sungrae Park . 2021 . BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents. arXiv preprint arXiv:2108.04539 (2021). Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, and Sungrae Park. 2021. BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents. arXiv preprint arXiv:2108.04539 (2021)."},{"key":"e_1_3_2_2_18_1","volume-title":"Spatial dependency parsing for semi-structured document information extraction. arXiv preprint arXiv:2005.00642","author":"Hwang Wonseok","year":"2020","unstructured":"Wonseok Hwang , Jinyeong Yim , Seunghyun Park , Sohee Yang , and Minjoon Seo . 2020. Spatial dependency parsing for semi-structured document information extraction. arXiv preprint arXiv:2005.00642 ( 2020 ). Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Sohee Yang, and Minjoon Seo. 2020. Spatial dependency parsing for semi-structured document information extraction. arXiv preprint arXiv:2005.00642 (2020)."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDARW.2019.10029"},{"key":"e_1_3_2_2_20_1","volume-title":"StructuralLM: Structural Pre-training for Form Understanding. arXiv preprint arXiv:2105.11210","author":"Li Chenliang","year":"2021","unstructured":"Chenliang Li , Bin Bi , Ming Yan , Wei Wang , Songfang Huang , Fei Huang , and Luo Si. 2021a. StructuralLM: Structural Pre-training for Form Understanding. arXiv preprint arXiv:2105.11210 ( 2021 ). Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, and Luo Si. 2021a. StructuralLM: Structural Pre-training for Form Understanding. arXiv preprint arXiv:2105.11210 (2021)."},{"key":"e_1_3_2_2_21_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Li Junnan","year":"2021","unstructured":"Junnan Li , Ramprasaath Selvaraju , Akhilesh Gotmare , Shafiq Joty , Caiming Xiong , and Steven Chu Hong Hoi . 2021 c. Align before fuse: Vision and language representation learning with momentum distillation . Advances in Neural Information Processing Systems , Vol. 34 (2021). Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare, Shafiq Joty, Caiming Xiong, and Steven Chu Hong Hoi. 2021c. Align before fuse: Vision and language representation learning with momentum distillation. Advances in Neural Information Processing Systems, Vol. 34 (2021)."},{"key":"e_1_3_2_2_22_1","volume-title":"DocBank: A benchmark dataset for document layout analysis. arXiv preprint arXiv:2006.01038","author":"Li Minghao","year":"2020","unstructured":"Minghao Li , Yiheng Xu , Lei Cui , Shaohan Huang , Furu Wei , Zhoujun Li , and Ming Zhou . 2020. DocBank: A benchmark dataset for document layout analysis. arXiv preprint arXiv:2006.01038 ( 2020 ). Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, and Ming Zhou. 2020. DocBank: A benchmark dataset for document layout analysis. arXiv preprint arXiv:2006.01038 (2020)."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475345"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"e_1_3_2_2_25_1","volume-title":"Neural Collaborative Graph Machines for Table Structure Recognition. arXiv preprint arXiv:2111.13359","author":"Liu Hao","year":"2021","unstructured":"Hao Liu , Xin Li , Bing Liu , Deqiang Jiang , Yinsong Liu , and Bo Ren . 2021a. Neural Collaborative Graph Machines for Table Structure Recognition. arXiv preprint arXiv:2111.13359 ( 2021 ). Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, and Bo Ren. 2021a. Neural Collaborative Graph Machines for Table Structure Recognition. arXiv preprint arXiv:2111.13359 (2021)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3481534"},{"key":"e_1_3_2_2_27_1","volume-title":"Graph convolution for multimodal information extraction from visually rich documents. arXiv preprint arXiv:1903.11279","author":"Liu Xiaojing","year":"2019","unstructured":"Xiaojing Liu , Feiyu Gao , Qiong Zhang , and Huasha Zhao . 2019. Graph convolution for multimodal information extraction from visually rich documents. arXiv preprint arXiv:1903.11279 ( 2019 ). Xiaojing Liu, Feiyu Gao, Qiong Zhang, and Huasha Zhao. 2019. Graph convolution for multimodal information extraction from visually rich documents. arXiv preprint arXiv:1903.11279 (2019)."},{"key":"e_1_3_2_2_28_1","volume-title":"Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . Bleu: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318."},{"key":"e_1_3_2_2_29_1","volume-title":"Workshop on Document Intelligence at NeurIPS","author":"Park Seunghyun","year":"2019","unstructured":"Seunghyun Park , Seung Shin , Bado Lee , Junyeop Lee , Jaeheung Surh , Minjoon Seo , and Hwalsuk Lee . 2019 . CORD: a consolidated receipt dataset for post-OCR parsing . In Workshop on Document Intelligence at NeurIPS 2019. Seunghyun Park, Seung Shin, Bado Lee, Junyeop Lee, Jaeheung Surh, Minjoon Seo, and Hwalsuk Lee. 2019. CORD: a consolidated receipt dataset for post-OCR parsing. In Workshop on Document Intelligence at NeurIPS 2019."},{"key":"e_1_3_2_2_30_1","unstructured":"Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017).  Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017)."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2019.00031"},{"key":"e_1_3_2_2_32_1","volume-title":"International Conference on Machine Learning. PMLR, 8748--8763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , 2021 . Learning transferable visual models from natural language supervision . In International Conference on Machine Learning. PMLR, 8748--8763 . Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning. PMLR, 8748--8763."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58604-1_5"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2007.4376991"},{"key":"e_1_3_2_2_35_1","article-title":"Visualizing data using t-SNE","volume":"9","author":"der Maaten Laurens Van","year":"2008","unstructured":"Laurens Van der Maaten and Geoffrey Hinton . 2008 . Visualizing data using t-SNE . Journal of machine learning research , Vol. 9 , 11 (2008). Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, 11 (2008).","journal-title":"Journal of machine learning research"},{"key":"e_1_3_2_2_36_1","volume-title":"Attention is all you need. Advances in neural information processing systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. Advances in neural information processing systems , Vol. 30 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, Vol. 30 (2017)."},{"key":"e_1_3_2_2_37_1","volume-title":"LayoutReader: Pre-training of Text and Layout for Reading Order Detection. arXiv preprint arXiv:2108.11591","author":"Wang Zilong","year":"2021","unstructured":"Zilong Wang , Yiheng Xu , Lei Cui , Jingbo Shang , and Furu Wei . 2021. LayoutReader: Pre-training of Text and Layout for Reading Order Detection. arXiv preprint arXiv:2108.11591 ( 2021 ). Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, and Furu Wei. 2021. LayoutReader: Pre-training of Text and Layout for Reading Order Detection. arXiv preprint arXiv:2108.11591 (2021)."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401442"},{"key":"e_1_3_2_2_39_1","volume-title":"Spurthi Amba Hombaiah, and Michael Bendersky","author":"Wu Te-Lin","year":"2021","unstructured":"Te-Lin Wu , Cheng Li , Mingyang Zhang , Tao Chen , Spurthi Amba Hombaiah, and Michael Bendersky . 2021 . LAMPRET : Layout-Aware Multimodal PreTraining for Document Understanding . arXiv preprint arXiv:2104.08405 (2021). Te-Lin Wu, Cheng Li, Mingyang Zhang, Tao Chen, Spurthi Amba Hombaiah, and Michael Bendersky. 2021. LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding. arXiv preprint arXiv:2104.08405 (2021)."},{"key":"e_1_3_2_2_40_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey etal 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016).  Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey et al. 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016)."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403172"},{"key":"e_1_3_2_2_42_1","unstructured":"Yang Xu Yiheng Xu Tengchao Lv Lei Cui Furu Wei Guoxin Wang Yijuan Lu Dinei Florencio Cha Zhang Wanxiang Che etal 2020b. Layoutlmv2: Multi-modal pre-training for visually-rich document understanding. arXiv preprint arXiv:2012.14740 (2020).  Yang Xu Yiheng Xu Tengchao Lv Lei Cui Furu Wei Guoxin Wang Yijuan Lu Dinei Florencio Cha Zhang Wanxiang Che et al. 2020b. Layoutlmv2: Multi-modal pre-training for visually-rich document understanding. arXiv preprint arXiv:2012.14740 (2020)."},{"key":"e_1_3_2_2_43_1","volume-title":"Hugo Chen","author":"Yu Licheng","year":"2022","unstructured":"Licheng Yu , Jun Chen , Animesh Sinha , Mengjiao MJ Wang , Hugo Chen , Tamara L Berg , and Ning Zhang. 2022 . CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval. arXiv preprint arXiv:2202.07247 (2022). Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L Berg, and Ning Zhang. 2022. CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval. arXiv preprint arXiv:2202.07247 (2022)."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1203"},{"key":"e_1_3_2_2_45_1","volume-title":"Document-level relation extraction as semantic segmentation. arXiv preprint arXiv:2106.03618","author":"Zhang Ningyu","year":"2021","unstructured":"Ningyu Zhang , Xiang Chen , Xin Xie , Shumin Deng , Chuanqi Tan , Mosha Chen , Fei Huang , Luo Si , and Huajun Chen . 2021. Document-level relation extraction as semantic segmentation. arXiv preprint arXiv:2106.03618 ( 2021 ). Ningyu Zhang, Xiang Chen, Xin Xie, Shumin Deng, Chuanqi Tan, Mosha Chen, Fei Huang, Luo Si, and Huajun Chen. 2021. Document-level relation extraction as semantic segmentation. arXiv preprint arXiv:2106.03618 (2021)."},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413900"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Lisboa Portugal","acronym":"MM '22"},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547751","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547751","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:40Z","timestamp":1750188640000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547751"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":46,"alternative-id":["10.1145\/3503161.3547751","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547751","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}