{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T18:00:22Z","timestamp":1772906422988,"version":"3.50.1"},"reference-count":18,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T00:00:00Z","timestamp":1752796800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T00:00:00Z","timestamp":1752796800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Fraunhofer-Institut f\u00fcr Intelligente Analyse- und Informationssysteme IAIS"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>We introduce a tool for automated translation checking of financial reports in German-English. It uses a heuristic matching algorithm followed by a transformer encoder based error detection model on sentence pair level. For generating the training data, we leverage state-of-the-art large language models such as GPT-4o, thereby alleviating the need for expert annotations. The results suggest that smaller models fine-tuned specifically for this task significantly outperform large multi-purpose generative models like GPT-4 for this particular problem, and that a combination of informed and deep learning approaches works best in this case. The tool is being made publicly available as a demonstrator.<\/jats:p>","DOI":"10.1007\/s10579-025-09862-z","type":"journal-article","created":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T08:22:01Z","timestamp":1752826921000},"page":"3873-3887","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Automating translation checks of financial documents using large language models"],"prefix":"10.1007","volume":"59","author":[{"given":"Maren","family":"Pielka","sequence":"first","affiliation":[]},{"given":"Max","family":"Hahnb\u00fcck","sequence":"additional","affiliation":[]},{"given":"Tobias","family":"Deu\u00dfer","sequence":"additional","affiliation":[]},{"given":"Daniel","family":"Uedelhoven","sequence":"additional","affiliation":[]},{"given":"Moinam","family":"Chatterjee","sequence":"additional","affiliation":[]},{"given":"Vijul","family":"Shah","sequence":"additional","affiliation":[]},{"given":"Osama","family":"Soliman","sequence":"additional","affiliation":[]},{"given":"Jannis","family":"von der Bank","sequence":"additional","affiliation":[]},{"given":"Writwick","family":"Das","sequence":"additional","affiliation":[]},{"given":"Maria Chiara","family":"Talarico","sequence":"additional","affiliation":[]},{"given":"Cong","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Carolina","family":"Held Celis","sequence":"additional","affiliation":[]},{"given":"Christian","family":"Temath","sequence":"additional","affiliation":[]},{"given":"Rafet","family":"Sifa","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,18]]},"reference":[{"key":"9862_CR1","volume-title":"Natural language processing with python","author":"S Bird","year":"2009","unstructured":"Bird, S., Loper, E., & Klein, E. (2009). Natural language processing with python. Sebastopol: O\u2019Reilly Media Inc."},{"key":"9862_CR2","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzm\u00e1n, F., Grave, E., Ott, M., Zettlemoyer, L., & Stoyanov, V. (2019). Unsupervised cross-lingual representation learning at scale. In: Annual Meeting of the Association for Computational Linguistics. https:\/\/api.semanticscholar.org\/CorpusID:207880568","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"9862_CR3","doi-asserted-by":"crossref","unstructured":"Deu\u00dfer, T., Pielka, M., Pucknat, L., Jacob, B., Dilmaghani, T., Nourimand, M., Kliem, B., Loitz, R., Bauckhage, C., & Sifa, R. (2023). Contradiction detection in financial reports. In: Proceedings of the Northern Lights Deep Learning Workshop (NLDL), vol. 4.","DOI":"10.7557\/18.6799"},{"key":"9862_CR4","unstructured":"Devlin, J., Chang, M. -W., Lee, K., & Toutanova, K. (2019). Bert: Pre-training of deep bidirectional transformers for language understanding. In: North American Chapter of the Association for Computational Linguistics. https:\/\/api.semanticscholar.org\/CorpusID:52967399"},{"issue":"1","key":"9862_CR5","first-page":"75","volume":"19","author":"WA Gale","year":"1994","unstructured":"Gale, W. A., Church, K. W., et al. (1994). A program for aligning sentences in bilingual corpora. Computational Linguistics, 19(1), 75\u2013102.","journal-title":"Computational Linguistics"},{"key":"9862_CR6","doi-asserted-by":"crossref","unstructured":"Hillebrand, L., Deu\u00dfer, T., Dilmaghani, T., Kliem, B., Loitz, R., Bauckhage, C., & Sifa, R. (2022). Kpi-bert: A joint named entity recognition and relation extraction model for financial reports. In: Proceedings of the International Conference on Pattern Recognition (ICPR).","DOI":"10.1109\/ICPR56361.2022.9956191"},{"key":"9862_CR7","doi-asserted-by":"crossref","unstructured":"Hillebrand, L., Deu\u00dfer, T., Dilmaghani, T., Kliem, B., Loitz, R., Bauckhage, C., & Sifa, R. (2022). Towards automating numerical consistency checks in financial reports. In: Proceedings of the IEEE International Conference on Big Data (IEEE BigData).","DOI":"10.1109\/BigData55660.2022.10020308"},{"key":"9862_CR8","doi-asserted-by":"crossref","unstructured":"Lu, Q., Qiu, B., Ding, L., Zhang, K., Kocmi, T., & Tao, D. (2024). Error analysis prompting enables human-like translation evaluation in large language models. In: Findings of the Association for Computational Linguistics: ACL 2024. Association for Computational Linguistics, Bangkok, Thailand.","DOI":"10.18653\/v1\/2024.findings-acl.520"},{"key":"9862_CR9","unstructured":"OpenAI, Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, I., Aleman, F. L., Almeida, D., Altenschmidt, J., Altman, S., Anadkat, S., Avila, R., Babuschkin, I., Balaji, S., Balcom, V., Baltescu, P., Bao, H., Bavarian, M., Belgum, J., Bello, I., Berdine, J., Bernadett-Shapiro, G., Berner, C., Bogdonoff, L., Boiko, O., Boyd, M., Brakman, A.-L., Brockman, G., Brooks, T., Brundage, M., Button, K., Cai, T., Campbell, R., Cann, A., Carey, B., Carlson, C., Carmichael, R., Chan, B., Chang, C., Chantzis, F., Chen, D., Chen, S., Chen, R., Chen, J., Chen, M., Chess, B., Cho, C., Chu, C., Chung, H.W., Cummings, D., Currier, J., Dai, Y., Decareaux, C., Degry, T., Deutsch, N., Deville, D., Dhar, A., Dohan, D., Dowling, S., Dunning, S., Ecoffet, A., Eleti, A., Eloundou, T., Farhi, D., Fedus, L., Felix, N., Fishman, S. P., Forte, J., Fulford, I., Gao, L., Georges, E., Gibson, C., Goel, V., Gogineni, T., Goh, G., Gontijo-Lopes, R., Gordon, J., Grafstein, M., Gray, S., Greene, R., Gross, J., Gu, S. S., Guo, Y., Hallacy, C., Han, J., Harris, J., He, Y., Heaton, M., Heidecke, J., Hesse, C., Hickey, A., Hickey, W., Hoeschele, P., Houghton, B., Hsu, K., Hu, S., Hu, X., Huizinga, J., Jain, S., Jain, S., Jang, J., Jiang, A., Jiang, R., Jin, H., Jin, D., Jomoto, S., Jonn, B., Jun, H., Kaftan, T., Kaiser, Kamali, A., Kanitscheider, I., Keskar, N. S., Khan, T., Kilpatrick, L., Kim, J. W., Kim, C., Kim, Y., Kirchner, J. H., Kiros, J., Knight, M., Kokotajlo, D., Kondraciuk, Kondrich, A., Konstantinidis, A., Kosic, K., Krueger, G., Kuo, V., Lampe, M., Lan, I., Lee, T., Leike, J., Leung, J., Levy, D., Li, C. M., Lim, R., Lin, M., Lin, S., Litwin, M., Lopez, T., Lowe, R., Lue, P., Makanju, A., Malfacini, K., Manning, S., Markov, T., Markovski, Y., Martin, B., Mayer, K., Mayne, A., McGrew, B., McKinney, S. M., McLeavey, C., McMillan, P., McNeil, J., Medina, D., Mehta, A., Menick, J., Metz, L., Mishchenko, A., Mishkin, P., Monaco, V., Morikawa, E., Mossing, D., Mu, T., Murati, M., Murk, O., M\u00e9ly, D., Nair, A., Nakano, R., Nayak, R., Neelakantan, A., Ngo, R., Noh, H., Ouyang, L., O\u2019Keefe, C., Pachocki, J., Paino, A., Palermo, J., Pantuliano, A., Parascandolo, G., Parish, J., Parparita, E., Passos, A., Pavlov, M., Peng, A., Perelman, A., Avila Belbute Peres, F., Petrov, M., Oliveira\u00a0Pinto, H. P., Michael, Pokorny, Pokrass, M., Pong, V. H., Powell, T., Power, A., Power, B., Proehl, E., Puri, R., Radford, A., Rae, J., Ramesh, A., Raymond, C., Real, F., Rimbach, K., Ross, C., Rotsted, B., Roussez, H., Ryder, N., Saltarelli, M., Sanders, T., Santurkar, S., Sastry, G., Schmidt, H., Schnurr, D., Schulman, J., Selsam, D., Sheppard, K., Sherbakov, T., Shieh, J., Shoker, S., Shyam, P., Sidor, S., Sigler, E., Simens, M., Sitkin, J., Slama, K., Sohl, I., Sokolowsky, B., Song, Y., Staudacher, N., Such, F. P., Summers, N., Sutskever, I., Tang, J., Tezak, N., Thompson, M. B., Tillet, P., Tootoonchian, A., Tseng, E., Tuggle, P., Turley, N., Tworek, J., Uribe, J. F. C., Vallone, A., Vijayvergiya, A., Voss, C., Wainwright, C., Wang, J. J., Wang, A., Wang, B., Ward, J., Wei, J., Weinmann, C., Welihinda, A., Welinder, P., Weng, J., Weng, L., Wiethoff, M., Willner, D., Winter, C., Wolrich, S., Wong, H., Workman, L., Wu, S., Wu, J., Wu, M., Xiao, K., Xu, T., Yoo, S., Yu, K., Yuan, Q., Zaremba, W., Zellers, R., Zhang, C., Zhang, M., Zhao, S., Zheng, T., Zhuang, J., Zhuk, W., & Zoph, B. (2024). GPT-4 Technical Report."},{"key":"9862_CR10","doi-asserted-by":"crossref","unstructured":"Peng, K., Ding, L., Zhong, Q., Shen, L., Liu, X., Zhang, M., Ouyang, Y., & Tao, D. (2023). Towards making the most of chatgpt for machine translation. In: Conference on Empirical Methods in Natural Language Processing. https:\/\/api.semanticscholar.org\/CorpusID:257704711","DOI":"10.2139\/ssrn.4390455"},{"key":"9862_CR11","unstructured":"Pucknat, L., Pielka, M., & Sifa, R. (2022). Towards informed pre-training for critical error detection in english-german. In: Proceedings of Lernen. Wissen. Daten. Analysen. (LWDA) 2022."},{"key":"9862_CR12","unstructured":"Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. In: 5th Workshop on Energy Efficient Machine Learning and Cognitive Computing @ NeurIPS 2019."},{"key":"9862_CR13","doi-asserted-by":"crossref","unstructured":"Sifa, R., Ladi, A., Pielka, M., Ramamurthy, R., Hillebrand, L., Kirsch, B., Biesner, D., Stenzel, R., Bell, T., & L\u00fcbbering, M., et al. (2019). Towards automated auditing with machine learning. In: Proceedings of the ACM Symposium on Document Engineering (DocEng).","DOI":"10.1145\/3342558.3345421"},{"key":"9862_CR14","unstructured":"Specia, L., Blain, F., Fomicheva, M., Zerva, C., Li, Z., Chaudhary, V., & Martins, A. F. (2021). Findings of the wmt 2021 shared task on quality estimation. In: Sixth Conference on Machine Translation (WMT)."},{"key":"9862_CR15","unstructured":"Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., Bikel, D., Blecher, L., Ferrer, C. C., Chen, M., Cucurull, G., Esiobu, D., Fernandes, J., Fu, J., Fu, W., Fuller, B., Gao, C., Goswami, V., Goyal, N., Hartshorn, A., Hosseini, S., Hou, R., Inan, H., Kardas, M., Kerkez, V., Khabsa, M., Kloumann, I., Korenev, A., Koura, P. S., Lachaux, M. -A., Lavril, T., Lee, J., Liskovich, D., Lu, Y., Mao, Y., Martinet, X., Mihaylov, T., Mishra, P., Molybog, I., Nie, Y., Poulton, A., Reizenstein, J., Rungta, R., Saladi, K., Schelten, A., Silva, R., Smith, E. M., Subramanian, R., Tan, X. E., Tang, B., Taylor, R., Williams, A., Kuan, J. X., Xu, P., Yan, Z., Zarov, I., Zhang, Y., Fan, A., Kambadur, M., Narang, S., Rodriguez, A., Stojnic, R., Edunov, S., & Scialom, T. (2023) Llama 2: Open Foundation and Fine-Tuned Chat Models."},{"key":"9862_CR16","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. NIPS\u201917, pp. 6000\u20136010. Curran Associates Inc., Red Hook, NY, USA."},{"key":"9862_CR17","unstructured":"Wang, L., Yang, N., Huang, X., Yang, L., Majumder, R., & Wei, F. (2024). Multilingual e5 text embeddings: a technical report. arXiv preprint arXiv:2402.05672"},{"key":"9862_CR18","unstructured":"Zerva, C., Blain, F., Rei, R., Lertvittayakumjorn, P., Souza, J. G., Eger, S., Kanojia, D., Alves, D., Or\u0103san, C., Fomicheva, M., Martins, A. F. T., & Specia, L. (2022). Findings of the WMT 2022 shared task on quality estimation. In: Koehn, P., Barrault, L., Bojar, O., Bougares, F., Chatterjee, R., Costa-juss\u00e0, M.R., Federmann, C., Fishel, M., Fraser, A., Freitag, M., Graham, Y., Grundkiewicz, R., Guzman, P., Haddow, B., Huck, M., Jimeno\u00a0Yepes, A., Kocmi, T., Martins, A., Morishita, M., Monz, C., Nagata, M., Nakazawa, T., Negri, M., N\u00e9v\u00e9ol, A., Neves, M., Popel, M., Turchi, M., Zampieri, M. (eds.) Proceedings of the Seventh Conference on Machine Translation (WMT), pp. 69\u201399. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid). https:\/\/aclanthology.org\/2022.wmt-1.3"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-025-09862-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-025-09862-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-025-09862-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T05:15:49Z","timestamp":1765257349000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-025-09862-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,18]]},"references-count":18,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["9862"],"URL":"https:\/\/doi.org\/10.1007\/s10579-025-09862-z","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"value":"1574-020X","type":"print"},{"value":"1574-0218","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,18]]},"assertion":[{"value":"28 February 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 June 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 July 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}