{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,23]],"date-time":"2026-04-23T07:59:16Z","timestamp":1776931156176,"version":"3.51.2"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","funder":[{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["16SV9094"],"award-info":[{"award-number":["16SV9094"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,10,13]]},"DOI":"10.1145\/3716553.3750772","type":"proceedings-article","created":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:13:16Z","timestamp":1760188396000},"page":"155-163","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Lightweight Transformers for Isolated Sign Language Recognition"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5369-856X","authenticated-orcid":false,"given":"Cristina","family":"Luna-Jim\u00e9nez","sequence":"first","affiliation":[{"name":"Human-Centered Artificial Intelligence, University of Augsburg, Augsburg, Bayern, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-8855-7659","authenticated-orcid":false,"given":"Lennart","family":"Eing","sequence":"additional","affiliation":[{"name":"Human-Centered Artificial Intelligence, University of Augsburg, Augsburg, Bayern, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5634-5556","authenticated-orcid":false,"given":"Annalena","family":"Bea Aicher","sequence":"additional","affiliation":[{"name":"Human-Centered Artificial Intelligence, University of Augsburg, Augsburg, Bayern, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1596-4043","authenticated-orcid":false,"given":"Fabrizio","family":"Nunnari","sequence":"additional","affiliation":[{"name":"German Research Center for Artificial Intelligence (DFKI), Saarbr\u00fccken, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2367-162X","authenticated-orcid":false,"given":"Elisabeth","family":"Andr\u00e9","sequence":"additional","affiliation":[{"name":"Human-Centered Artificial Intelligence, University of Augsburg, Augsburg, Bayern, Germany"}]}],"member":"320","published-online":{"date-parts":[[2025,10,12]]},"reference":[{"key":"e_1_3_3_3_2_2","unstructured":"Apple. [n. d.]. VisionAPI. https:\/\/developer.apple.com\/documentation\/vision Accessed on 12-12-2024."},{"key":"e_1_3_3_3_3_2","unstructured":"Adrien Bardes Quentin Garrido Jean Ponce Xinlei Chen Michael Rabbat Yann LeCun Mahmoud Assran and Nicolas Ballas. 2024. Revisiting Feature Prediction for Learning Visual Representations from Video. arxiv:https:\/\/arXiv.org\/abs\/2404.08471\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2404.08471"},{"key":"e_1_3_3_3_4_2","doi-asserted-by":"publisher","unstructured":"Tobias Baur Alexander Heimerl Florian Lingenfelser Johannes Wagner Michel\u00a0F. Valstar Bj\u00f6rn Schuller and Elisabeth Andr\u00e9. 2020. eXplainable Cooperative Machine Learning with NOVA. KI - K\u00fcnstliche Intelligenz (19 Jan 2020). 10.1007\/s13218-020-00632-3","DOI":"10.1007\/s13218-020-00632-3"},{"key":"e_1_3_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3529190.3529202"},{"key":"e_1_3_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACVW54805.2022.00024"},{"key":"e_1_3_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00812"},{"key":"e_1_3_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01004"},{"key":"e_1_3_3_3_9_2","doi-asserted-by":"publisher","unstructured":"Xiaokang Chen Mingyu Ding Xiaodi Wang Ying Xin Shentong Mo Yunhao Wang Shumin Han Ping Luo Gang Zeng and Jingdong Wang. 2023. Context Autoencoder for Self-supervised Representation Learning. International Journal of Computer Vision 132 1 (Aug. 2023) 208\u2013223. 10.1007\/s11263-023-01852-4","DOI":"10.1007\/s11263-023-01852-4"},{"key":"e_1_3_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.18653\/V1\/N19-1423"},{"key":"e_1_3_3_3_11_2","volume-title":"9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021","author":"Dosovitskiy Alexey","year":"2021","unstructured":"Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net. https:\/\/openreview.net\/forum?id=YicbFdNTTy"},{"key":"e_1_3_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00276"},{"key":"e_1_3_3_3_13_2","first-page":"98","volume-title":"Proceedings of the LREC2004 Workshop on the Representation and Processing of Sign Languages: From SignWriting to Image Processing. Information techniques and their implications for teaching, documentation and communication","author":"Elliott Ralph","year":"2004","unstructured":"Ralph Elliott, John Glauert, Vince Jennings, and Richard Kennaway. 2004. An Overview of the SiGML Notation and SiGMLSigning Software System. In Proceedings of the LREC2004 Workshop on the Representation and Processing of Sign Languages: From SignWriting to Image Processing. Information techniques and their implications for teaching, documentation and communication, Oliver Streiter and Chiara Vettori (Eds.). European Language Resources Association (ELRA), Lisbon, Portugal, 98\u2013104. https:\/\/www.sign-lang.uni-hamburg.de\/lrec\/pub\/04020.pdf"},{"key":"e_1_3_3_3_14_2","doi-asserted-by":"publisher","unstructured":"Hezhen Hu Weichao Zhao Wengang Zhou and Houqiang Li. 2023. SignBERT+: Hand-Model-Aware Self-Supervised Pre-Training for Sign Language Understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence 45 9 (2023) 11221\u201311239. 10.1109\/TPAMI.2023.3269220","DOI":"10.1109\/TPAMI.2023.3269220"},{"key":"e_1_3_3_3_15_2","first-page":"100","volume-title":"30th British Machine Vision Conference 2019, BMVC 2019, Cardiff, UK, September 9-12, 2019","author":"Joze Hamid Reza\u00a0Vaezi","year":"2019","unstructured":"Hamid Reza\u00a0Vaezi Joze and Oscar Koller. 2019. MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language. In 30th British Machine Vision Conference 2019, BMVC 2019, Cardiff, UK, September 9-12, 2019. BMVA Press, 100. https:\/\/bmvc2019.org\/wp-content\/uploads\/papers\/0254-paper.pdf"},{"key":"e_1_3_3_3_16_2","doi-asserted-by":"crossref","unstructured":"Oscar Koller Jens Forster and Hermann Ney. 2015. Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers. Computer Vision and Image Understanding 141 (Dec. 2015) 108\u2013125.","DOI":"10.1016\/j.cviu.2015.09.013"},{"key":"e_1_3_3_3_17_2","doi-asserted-by":"publisher","unstructured":"Reiner Konrad Thomas Hanke Gabriele Langer Dolly Blanck Julian Bleicken Ilona Hofmann Olga Jeziorski Lutz K\u00f6nig Susanne K\u00f6nig Rie Nishio Anja Regen Uta Salden Sven Wagner Satu Worseck Oliver B\u00f6se Elena Jahn and Marc Schulder. 2020. MEINE DGS \u2013 annotiert. \u00d6ffentliches Korpus der Deutschen Geb\u00e4rdensprache 3. Release \/ MY DGS \u2013 annotated. Public Corpus of German Sign Language 3rd release. 10.25592\/dgs.corpus-3.0","DOI":"10.25592\/dgs.corpus-3.0"},{"key":"e_1_3_3_3_18_2","doi-asserted-by":"publisher","unstructured":"Reiner Konrad Thomas Hanke Gabriele Langer Susanne K\u00f6nig Lutz K\u00f6nig Rie Nishio and Anja Regen. 2022. Public DGS Corpus: Annotation Conventions \/ \u00d6ffentliches DGS-Korpus: Annotationskonventionen. 10.25592\/uhhfdm.10251","DOI":"10.25592\/uhhfdm.10251"},{"key":"e_1_3_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093512"},{"key":"e_1_3_3_3_20_2","doi-asserted-by":"crossref","unstructured":"Dongxu Li Cristian Rodriguez-Opazo Xin Yu and Hongdong Li. 2019. Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) (2019) 1448\u20131458. https:\/\/api.semanticscholar.org\/CorpusID:204851909","DOI":"10.1109\/WACV45572.2020.9093512"},{"key":"e_1_3_3_3_21_2","volume-title":"Third Workshop on Computer Vision for AR\/VR at IEEE Computer Vision and Pattern Recognition (CVPR) 2019","author":"Lugaresi Camillo","year":"2019","unstructured":"Camillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris McClanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Yong, Juhyun Lee, Wan-Teh Chang, Wei Hua, Manfred Georg, and Matthias Grundmann. 2019. MediaPipe: A Framework for Perceiving and Processing Reality. In Third Workshop on Computer Vision for AR\/VR at IEEE Computer Vision and Pattern Recognition (CVPR) 2019. https:\/\/mixedreality.cs.cornell.edu\/s\/NewTitle_May1_MediaPipe_CVPR_CV4ARVR_Workshop_2019.pdf"},{"key":"e_1_3_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3577190.3614143"},{"key":"e_1_3_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCS56119.2022.9918744"},{"key":"e_1_3_3_3_24_2","first-page":"4847","volume-title":"Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)","author":"Nunnari Fabrizio","year":"2024","unstructured":"Fabrizio Nunnari, Eleftherios Avramidis, Cristina Espa\u00f1a-Bonet, Marco Gonz\u00e1lez, Anna Hennes, and Patrick Gebhard. 2024. DGS-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between German Sign Language and German Text. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italia, 4847\u20134857. https:\/\/aclanthology.org\/2024.lrec-main.434"},{"key":"e_1_3_3_3_25_2","doi-asserted-by":"publisher","unstructured":"Nina-Kristi Pendzich Jens-Michael Cramer Thomas Finkbeiner Annika Herrmann and Markus Steinbach. 2022. How do signers mark conditionals in German Sign Language? Insights from a Sentence Reproduction Task on the use of nonmanual and manual markers. Hrvatska revija za rehabilitacijska istra\u017eivanja 58 (10 2022) 206\u2013226. 10.31299\/hrri.58.si.11","DOI":"10.31299\/hrri.58.si.11"},{"key":"e_1_3_3_3_26_2","doi-asserted-by":"crossref","unstructured":"Anirudh Tunga Sai\u00a0Vidyaranya Nuthalapati and Juan\u00a0Pablo Wachs. 2020. Pose-based Sign Language Recognition using GCN and BERT. 2021 IEEE Winter Conference on Applications of Computer Vision Workshops (WACVW) (2020) 31\u201340. https:\/\/api.semanticscholar.org\/CorpusID:227247599","DOI":"10.1109\/WACVW52041.2021.00008"},{"key":"e_1_3_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"}],"event":{"name":"ICMI '25: International Conference on Multimodal Interaction","location":"Canberra Australia","acronym":"ICMI '25","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 27th International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3716553.3750772","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T22:28:57Z","timestamp":1769466537000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3716553.3750772"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,12]]},"references-count":26,"alternative-id":["10.1145\/3716553.3750772","10.1145\/3716553"],"URL":"https:\/\/doi.org\/10.1145\/3716553.3750772","relation":{},"subject":[],"published":{"date-parts":[[2025,10,12]]},"assertion":[{"value":"2025-10-12","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}