{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T12:48:07Z","timestamp":1776084487737,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","funder":[{"name":"Ministry of Economic Development of the RF","award":["000000C313925P4G0002"],"award-info":[{"award-number":["000000C313925P4G0002"]}]},{"name":"Ivannikov Institute for System Programming of the Russian Academy of Sciences","award":["139-15-2025-011"],"award-info":[{"award-number":["139-15-2025-011"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,10,27]]},"DOI":"10.1145\/3746027.3758231","type":"proceedings-article","created":{"date-parts":[[2025,10,25]],"date-time":"2025-10-25T07:37:21Z","timestamp":1761377841000},"page":"12875-12881","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["MuMMy: Multimodal Dataset supporting VLM-based Egyptology Research Assistant"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0679-6981","authenticated-orcid":false,"given":"Maksim","family":"Golyadkin","sequence":"first","affiliation":[{"name":"AIRI, Moscow, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2621-507X","authenticated-orcid":false,"given":"Innokentiy","family":"Humonen","sequence":"additional","affiliation":[{"name":"AIRI, Moscow, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-7628-078X","authenticated-orcid":false,"given":"Valeria","family":"Rubanova","sequence":"additional","affiliation":[{"name":"ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-6082-5680","authenticated-orcid":false,"given":"Danil","family":"Kalin","sequence":"additional","affiliation":[{"name":"ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-2077-8509","authenticated-orcid":false,"given":"Ianis","family":"Plevokas","sequence":"additional","affiliation":[{"name":"HSE University, Moscow, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2025-7997","authenticated-orcid":false,"given":"Dmitry","family":"Nikolotov","sequence":"additional","affiliation":[{"name":"MIPT, Dolgoprudny, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-2817-7337","authenticated-orcid":false,"given":"Aleksandr","family":"Utkov","sequence":"additional","affiliation":[{"name":"ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-8180-3557","authenticated-orcid":false,"given":"Nikita","family":"Sidelnikov","sequence":"additional","affiliation":[{"name":"ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0527-8354","authenticated-orcid":false,"given":"Petr","family":"Ivanov","sequence":"additional","affiliation":[{"name":"ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-8825-6931","authenticated-orcid":false,"given":"Ekaterina","family":"Bureeva","sequence":"additional","affiliation":[{"name":"HSE University, Moscow, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6246-4722","authenticated-orcid":false,"given":"Ekaterina","family":"Alexandrova","sequence":"additional","affiliation":[{"name":"HSE University, Moscow, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3308-8825","authenticated-orcid":false,"given":"Ilya","family":"Makarov","sequence":"additional","affiliation":[{"name":"AIRI, Moscow, Russian Federation, ISP RAS, Moscow, Russian Federation, and ITMO University, Saint Petersburg, Russian Federation"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,10,27]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-20302-2_10"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3110082"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.ml4al-1.9"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.46298\/jdmdh.5581"},{"key":"e_1_3_2_1_5_1","unstructured":"Marta R Costa-juss\u00e0 James Cross Onur \u00c7elebi Maha Elbayad Kenneth Heafield Kevin Heffernan Elahe Kalbassi Janice Lam Daniel Licht Jean Maillard et al. 2022. No language left behind: Scaling human-centered machine translation. arXiv preprint arXiv:2207.04672 (2022)."},{"key":"e_1_3_2_1_6_1","first-page":"4171","volume-title":"Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 4171-4186."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502199"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3721250.3743025"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3746027.3754797"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV). 1-8.","author":"Golyadkin Maksim","year":"2025","unstructured":"Maksim Golyadkin, Valeria Rubanova, Aleksandr Utkov, Dmitry Nikolotov, and Ilya Makarov. 2025c. MEH: A Multi-Style Dataset and Toolkit for Advancing Egyptian Hieroglyph Recognition. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV). 1-8."},{"key":"e_1_3_2_1_11_1","first-page":"3929","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Guu Kelvin","year":"2020","unstructured":"Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Mingwei Chang. 2020. Retrieval Augmented Language Model Pre-Training. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 3929-3938. https:\/\/proceedings.mlr.press\/v119\/guu20a.html"},{"key":"e_1_3_2_1_12_1","unstructured":"Heidi Jauhiainen and Tommi Jauhiainen. 2023. Automatic Word Segmentation for Egyptian Hieroglyphic Texts.. In DH."},{"key":"e_1_3_2_1_13_1","volume-title":"Martin","author":"Jurafsky Daniel","year":"2009","unstructured":"Daniel Jurafsky and James H. Martin. 2009. Speech and Language Processing (2Nd Edition). Prentice-Hall, Inc., Upper Saddle River, NJ, USA."},{"key":"e_1_3_2_1_14_1","first-page":"22199","volume-title":"Oh (Eds.)","volume":"35","author":"Kojima Takeshi","year":"2022","unstructured":"Takeshi Kojima, Shixiang (Shane) Gu, Machel Reid, Yutaka Matsuo, and Yusuke Iwasawa. 2022. Large Language Models are Zero-Shot Reasoners. In Advances in Neural Information Processing Systems, S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh (Eds.), Vol. 35. Curran Associates, Inc., 22199-22213. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2022\/file\/8bb0d291acd4acf06ef112099c16f326-Paper-Conference.pdf"},{"key":"e_1_3_2_1_15_1","first-page":"9459","volume-title":"Lin (Eds.)","volume":"33","author":"Lewis Patrick","year":"2020","unstructured":"Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich K\u00fcttler, Mike Lewis, Wen-tau Yih, Tim Rockt\u00e4schel, Sebastian Riedel, and Douwe Kiela. 2020. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 9459-9474. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2020\/file\/6b493230205f780e1bc26945df7481e5-Paper.pdf"},{"key":"e_1_3_2_1_16_1","first-page":"34892","volume-title":"Levine (Eds.)","volume":"36","author":"Liu Haotian","year":"2023","unstructured":"Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. 2023. Visual Instruction Tuning. In Advances in Neural Information Processing Systems, A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine (Eds.), Vol. 36. Curran Associates, Inc., 34892-34916. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2023\/file\/6dcf277ea32ce3288914faf369fe6de0-Paper-Conference.pdf"},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the 38th International Conference on Machine Learning (Proceedings of Machine Learning Research","volume":"8763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 139), Marina Meila and Tong Zhang (Eds.). PMLR, 8748-8763. https:\/\/proceedings.mlr.press\/v139\/radford21a.html"},{"key":"e_1_3_2_1_18_1","volume-title":"Teilauszug der Datenbank des Vorhabens'' Strukturen und Transformationen des Wortschatzes der \u00e4gyptischen Sprache'' vom Januar","author":"Richter Tonio Sebastian","year":"2018","unstructured":"Tonio Sebastian Richter, Ingelore Hafemann, Hans-Werner Fischer-Elfert, and Peter Dils. 2018. Teilauszug der Datenbank des Vorhabens'' Strukturen und Transformationen des Wortschatzes der \u00e4gyptischen Sprache'' vom Januar 2018. Berlin-Brandenburgische Akademie der Wissenschaften."},{"key":"e_1_3_2_1_19_1","unstructured":"Tonio Sebastian Richter Daniel A. Werning Hans-Werner Fischer-Elfert and Peter Dils. 2025. Thesaurus Linguae Aegyptiae. Online Resource. https:\/\/thesaurus-linguae-aegyptiae.de Corpus issue 19 Web app version 2.2.0 11\/5\/2024 ed. by Tonio Sebastian Richter & Daniel A. Werning on behalf of the Berlin-Brandenburgische Akademie der Wissenschaften and Hans-Werner Fischer-Elfert & Peter Dils on behalf of the S\u00e4chsische Akademie der Wissenschaften zu Leipzig (accessed: 11.2.2025)."},{"key":"e_1_3_2_1_20_1","volume-title":"Word segmentation in Chinese language processing. Statistics and its Interface","author":"Shu Xinxin","year":"2017","unstructured":"Xinxin Shu, Junhui Wang, Xiaotong Shen, and Annie Qu. 2017. Word segmentation in Chinese language processing. Statistics and its Interface, Vol. 10, 2 (2017), 165-173."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2023.3267981"},{"key":"e_1_3_2_1_22_1","first-page":"24824","volume-title":"Oh (Eds.)","volume":"35","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, brian ichter, Fei Xia, Ed Chi, Quoc V Le, and Denny Zhou. 2022. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. In Advances in Neural Information Processing Systems, S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh (Eds.), Vol. 35. Curran Associates, Inc., 24824-24837. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2022\/file\/9d5609613524ecf4f15af0f7b31abca4-Paper-Conference.pdf"},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the International Workshop on Spoken Language Translation (2019","author":"Wiesenbach Philipp","year":"2019","unstructured":"Philipp Wiesenbach and Stefan Riezler. 2019. Multi-Task Modeling of Phonographic Languages: Translating Middle Egyptian Hieroglyphs. Proceedings of the International Workshop on Spoken Language Translation (2019). https:\/\/www.cl.uni-heidelberg.de\/statnlpgroup\/publications\/IWSLT2019_v2.pdf"}],"event":{"name":"MM '25: The 33rd ACM International Conference on Multimedia","location":"Dublin Ireland","acronym":"MM '25","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 33rd ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3746027.3758231","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T05:00:45Z","timestamp":1765342845000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3746027.3758231"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,27]]},"references-count":23,"alternative-id":["10.1145\/3746027.3758231","10.1145\/3746027"],"URL":"https:\/\/doi.org\/10.1145\/3746027.3758231","relation":{},"subject":[],"published":{"date-parts":[[2025,10,27]]},"assertion":[{"value":"2025-10-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}