{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:06:25Z","timestamp":1760058385013,"version":"build-2065373602"},"reference-count":23,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2025,3,28]],"date-time":"2025-03-28T00:00:00Z","timestamp":1743120000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Edith-Haberland-Wagner-Stiftung"},{"name":"Buhl-Strohmaier-Foundation"},{"name":"W\u00fcrth Foundation"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>As the prevalence of machine-written texts grows, it has become increasingly important to distinguish between human- and machine-generated content, especially when such texts are not explicitly labeled. Current artificial intelligence (AI) detection methods primarily focus on human-like characteristics, such as emotionality and subjectivity. However, these features can be easily modified through AI humanization, which involves altering word choice. In contrast, altering the underlying grammar without affecting the conveyed information is considerably more challenging. Thus, the grammatical characteristics of a text can be used as additional indicators of its origin. To address this, we employ a newly developed rule-based parser to analyze the grammatical structures in human- and machine-written texts. Our findings reveal systematic grammatical differences between human- and machine-written texts, providing a reliable criterion for the determination of the text origin. We further examine the stability of this criterion in the context of AI humanization and translation to other languages.<\/jats:p>","DOI":"10.3390\/info16040274","type":"journal-article","created":{"date-parts":[[2025,3,31]],"date-time":"2025-03-31T03:25:02Z","timestamp":1743391502000},"page":"274","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Comparison of Grammar Characteristics of Human-Written Corpora and Machine-Generated Texts Using a Novel Rule-Based Parser"],"prefix":"10.3390","volume":"16","author":[{"given":"Simon","family":"Str\u00fcbbe","sequence":"first","affiliation":[{"name":"Research Unit of the Buhl-Strohmaier Foundation for Cerebral Palsy and Paediatric Neuroorthopaedics, Department of Orthopaedics and Sports Orthopaedics, Klinikum Rechts der Isar, TUM School of Medicine and Health, Technical University of Munich, Ismaningerstr 22, 81675 Munich, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4158-9416","authenticated-orcid":false,"given":"Irina","family":"Sidorenko","sequence":"additional","affiliation":[{"name":"Research Unit of the Buhl-Strohmaier Foundation for Cerebral Palsy and Paediatric Neuroorthopaedics, Department of Orthopaedics and Sports Orthopaedics, Klinikum Rechts der Isar, TUM School of Medicine and Health, Technical University of Munich, Ismaningerstr 22, 81675 Munich, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ren\u00e9e","family":"Lampe","sequence":"additional","affiliation":[{"name":"Research Unit of the Buhl-Strohmaier Foundation for Cerebral Palsy and Paediatric Neuroorthopaedics, Department of Orthopaedics and Sports Orthopaedics, Klinikum Rechts der Isar, TUM School of Medicine and Health, Technical University of Munich, Ismaningerstr 22, 81675 Munich, Germany"},{"name":"Markus W\u00fcrth Professorship, Technical University of Munich, 81675 Munich, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,3,28]]},"reference":[{"key":"ref_1","unstructured":"OpenAI (2024, December 12). ChatGPT (GPT-4, GPT-3.5, DALLE, etc.). Available online: https:\/\/openai.com."},{"key":"ref_2","unstructured":"Tian, E. (2024, December 12). GPTZero: AI Content Detection Tool. Available online: https:\/\/gptzero.me."},{"key":"ref_3","unstructured":"Copyleaks Ltd. (2024, December 12). Copyleaks: AI-Based Plagiarism & AI Content Detection. Available online: https:\/\/copyleaks.com."},{"key":"ref_4","unstructured":"Crossplag (2024, December 12). Crossplag: AI Content Detection Tool. Available online: https:\/\/crossplag.com."},{"key":"ref_5","unstructured":"HIX.AI (2024, December 12). AI Humanizer. Available online: https:\/\/bypass.hix.ai."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"012019","DOI":"10.1088\/1742-6596\/2514\/1\/012019","article-title":"Semantic approach for solving the decision problem in natural language","volume":"2514","author":"Sidorenko","year":"2023","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Str\u00fcbbe, S.M., Gr\u00fcnwald, A.T., Sidorenko, I., and Lampe, R. (2024). A Rule-Based Parser in Comparison with Statistical Neuronal Approaches in Terms of Grammar Competence. Appl. Sci., 15.","DOI":"10.3390\/app15010087"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1111\/j.1749-818X.2010.00187.x","article-title":"Dependency parsing","volume":"4","author":"Nivre","year":"2010","journal-title":"Lang. Linguist. Compass"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"051915","DOI":"10.1103\/PhysRevE.69.051915","article-title":"Patterns in syntactic dependency networks","volume":"69","year":"2004","journal-title":"Phys. Rev. E-Stat. Nonlinear Soft Matter Phys."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1898","DOI":"10.1007\/s11431-020-1666-4","article-title":"A survey of syntactic-semantic parsing based on constituent and dependency structures","volume":"63","author":"Zhang","year":"2020","journal-title":"Sci. China Technol. Sci."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"259","DOI":"10.2307\/412278","article-title":"Dependency structures and transformational rules","volume":"46","author":"Robinson","year":"1970","journal-title":"Language"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Opara, C. (2024, January 8\u201312). StyloAI: Distinguishing AI-generated content with stylometric analysis. Proceedings of the International Conference on Artificial Intelligence in Education, Recife, Brazil.","DOI":"10.1007\/978-3-031-64312-5_13"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Alamleh, H., AlQahtani, A.A.S., and ElSaid, A. (2023, January 27\u201328). Distinguishing human-written and ChatGPT-generated text using machine learning. Proceedings of the 2023 Systems and Information Engineering Design Symposium (SIEDS), Charlottesville, VA, USA.","DOI":"10.1109\/SIEDS58326.2023.10137767"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","article-title":"Term-weighting approaches in automatic text retrieval","volume":"24","author":"Salton","year":"1988","journal-title":"Inf. Process. Manag."},{"key":"ref_15","first-page":"88320","article-title":"Detective: Detecting ai-generated text via multi-level contrastive learning","volume":"37","author":"Guo","year":"2024","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_16","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2\u20137). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Mu\u00f1oz-Ortiz, A., G\u00f3mez-Rodr\u00edguez, C., and Vilares, D. (2023). Contrasting linguistic patterns in human and llm-generated text. arXiv.","DOI":"10.21203\/rs.3.rs-4077382\/v1"},{"key":"ref_18","unstructured":"Goldhahn, D., Eckart, T., and Quasthoff, U. (2012, January 21\u201327). Building large monolingual dictionaries at the leipzig corpora collection: From 100 to 200 languages. Proceedings of the LREC, Istanbul, Turkey."},{"key":"ref_19","unstructured":"Lezius, W. (2000, January 8\u201312). Morphy-German morphology, part-of-speech tagging and applications. Proceedings of the 9th EURALEX International Congress, Stuttgart, Germany."},{"key":"ref_20","unstructured":"(2024, December 12). DeepL GmbH DeepL Translator. Available online: https:\/\/www.deepl.com."},{"key":"ref_21","unstructured":"Schmieder \u00dcbersetzungen & Sprachmanagement (2025, March 19). DeepL vs. ChatGPT: Wer Liefert Die Bessere \u00dcbersetzung?. Available online: https:\/\/www.schmieder-uebersetzungen.de\/blog\/detail\/deepl-vs-chatgpt."},{"key":"ref_22","unstructured":"DOG GmbH (2025, March 19). DeepL\u2014Ein Erfahrungsbericht. n.d. Available online: https:\/\/www.dog-gmbh.de\/blog\/deepl-ein-erfahrungsbericht."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"100012","DOI":"10.1016\/j.nlp.2023.100012","article-title":"Development and verification of a user-friendly software for German text simplification focused on patients with cerebral palsy","volume":"3","author":"Sidorenko","year":"2023","journal-title":"Nat. Lang. Process. J."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/4\/274\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:04:52Z","timestamp":1760029492000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/4\/274"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,28]]},"references-count":23,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,4]]}},"alternative-id":["info16040274"],"URL":"https:\/\/doi.org\/10.3390\/info16040274","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2025,3,28]]}}}