{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,24]],"date-time":"2026-06-24T09:47:58Z","timestamp":1782294478864,"version":"3.54.5"},"reference-count":189,"publisher":"Cambridge University Press (CUP)","issue":"5","license":[{"start":{"date-parts":[[2021,7,2]],"date-time":"2021-07-02T00:00:00Z","timestamp":1625184000000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["cambridge.org"],"crossmark-restriction":true},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2022,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Automatic deception detection is a crucial task that has many applications both in direct physical and in computer-mediated human communication. Our focus is on automatic deception detection in text across cultures. In this context, we view culture through the prism of the individualism\/collectivism dimension, and we approximate culture by using country as a proxy. Having as a starting point recent conclusions drawn from the social psychology discipline, we explore if differences in the usage of specific linguistic features of deception across cultures can be confirmed and attributed to cultural norms in respect to the individualism\/collectivism divide. In addition, we investigate if a universal feature set for cross-cultural text deception detection tasks exists. We evaluate the predictive power of different feature sets and approaches. We create culture\/language-aware classifiers by experimenting with a wide range of n-gram features from several levels of linguistic analysis, namely phonology, morphology and syntax, other linguistic cues like word and phoneme counts, pronouns use, etc., and token embeddings. We conducted our experiments over eleven data sets from five languages (English, Dutch, Russian, Spanish, and Romanian), from six countries (United States of America, Belgium, India, Russia, Mexico, and Romania), and we applied two classification methods, namely logistic regression and fine-tuned BERT models. The results showed that the undertaken task is fairly complex and demanding. Furthermore, there are indications that some linguistic cues of deception have cultural origins and are consistent in the context of diverse domains and data set settings for the same language. This is more evident for the usage of pronouns and the expression of sentiment in deceptive language. The results of this work show that the automatic deception detection across cultures and languages cannot be handled in unified manners and that such approaches should be augmented with knowledge about cultural differences and the domains of interest.<\/jats:p>","DOI":"10.1017\/s1351324921000152","type":"journal-article","created":{"date-parts":[[2021,7,2]],"date-time":"2021-07-02T13:53:41Z","timestamp":1625234021000},"page":"545-606","update-policy":"https:\/\/doi.org\/10.1017\/policypage","source":"Crossref","is-referenced-by-count":15,"title":["Deception detection in text and its relation to the cultural dimension of individualism\/collectivism"],"prefix":"10.1017","volume":"28","author":[{"given":"Katerina","family":"Papantoniou","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Panagiotis","family":"Papadakos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Theodore","family":"Patkos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"George","family":"Flouris","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ion","family":"Androutsopoulos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dimitris","family":"Plexousakis","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"56","published-online":{"date-parts":[[2021,7,2]]},"reference":[{"key":"S1351324921000152_ref46","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-1030"},{"key":"S1351324921000152_ref164","doi-asserted-by":"publisher","DOI":"10.1177\/0146167208318067"},{"key":"S1351324921000152_ref30","volume-title":"Through the Language Glass: Why the World Looks Different in Other Languages.","author":"Deutscher","year":"2010"},{"key":"S1351324921000152_ref64","doi-asserted-by":"publisher","DOI":"10.1148\/radiology.143.1.7063747"},{"key":"S1351324921000152_ref1","volume-title":"Language Shock: Understanding the Culture of Conversation","author":"Agar","year":"1994"},{"key":"S1351324921000152_ref12","doi-asserted-by":"publisher","DOI":"10.1145\/3184558.3191577"},{"key":"S1351324921000152_ref109","doi-asserted-by":"publisher","DOI":"10.3115\/1667583.1667679"},{"key":"S1351324921000152_ref113","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-99722-3_33"},{"key":"S1351324921000152_ref68","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-016-2409-2"},{"key":"S1351324921000152_ref171","author":"Vogel","year":"2019"},{"key":"S1351324921000152_ref177","doi-asserted-by":"publisher","DOI":"10.1111\/j.1744-6570.1980.tb02165.x"},{"key":"S1351324921000152_ref5","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00288"},{"key":"S1351324921000152_ref34","doi-asserted-by":"publisher","DOI":"10.1177\/0093650213480175"},{"key":"S1351324921000152_ref91","doi-asserted-by":"publisher","DOI":"10.2307\/2347628"},{"key":"S1351324921000152_ref81","doi-asserted-by":"publisher","DOI":"10.1111\/1556-4029.13645"},{"key":"S1351324921000152_ref42","doi-asserted-by":"publisher","DOI":"10.1080\/00437956.1961.11659754"},{"key":"S1351324921000152_ref85","unstructured":"Kuratov, Y. and Arkhipov, M. (2019). Adaptation of deep bidirectional multilingual transformers for russian language. CoRR, abs\/1905.07213."},{"key":"S1351324921000152_ref143","unstructured":"Sahlgren, M. (2006). The Word-Space Model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. PhD thesis, University of Stockholm. http:\/\/eprints.sics.se\/437\/1\/TheWordSpaceModel.pdf"},{"key":"S1351324921000152_ref100","unstructured":"Loshchilov, I. and Hutter, F. (2019). Decoupled weight decay regularization. In International Conference on Learning Representations."},{"key":"S1351324921000152_ref61","volume-title":"Beyond Culture","author":"Hall","year":"1976"},{"key":"S1351324921000152_ref75","doi-asserted-by":"crossref","unstructured":"Jawahar, G. , Sagot, B. and Seddah, D. (2019). What does BERT learn about the structure of language? In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics\u201e pp. 3651\u20133657.","DOI":"10.18653\/v1\/P19-1356"},{"key":"S1351324921000152_ref187","doi-asserted-by":"publisher","DOI":"10.1109\/HICSS.2008.109"},{"key":"S1351324921000152_ref167","first-page":"26","article-title":"Beurteilung der glaubhaftigkeit von aussagen [evaluation of statement credibility\/ statement validity assessment]","volume":"11","author":"Undeutsch","year":"1967","journal-title":"Forensische Psychologie"},{"key":"S1351324921000152_ref78","unstructured":"Karimi, H. and Tang, J. (2019). Learning hierarchical discourse-level structure for fake news detection. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 3432\u20133442. Minneapolis, Minnesota: Association for Computational Linguistics."},{"key":"S1351324921000152_ref117","doi-asserted-by":"publisher","DOI":"10.1177\/0146167203029005010"},{"key":"S1351324921000152_ref29","unstructured":"DePaulo, B. , Stone, J. and Lassiter, D. (1985). Deceiving and detecting deceit. In The Self and Social Life, pp. 323\u2013370."},{"key":"S1351324921000152_ref146","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-98932-7_13"},{"key":"S1351324921000152_ref89","doi-asserted-by":"publisher","DOI":"10.1038\/242190a0"},{"key":"S1351324921000152_ref97","author":"Ling","year":"2003"},{"key":"S1351324921000152_ref176","doi-asserted-by":"publisher","DOI":"10.2466\/pms.1999.89.1.19"},{"key":"S1351324921000152_ref139","unstructured":"Rubin, V.L. , Conroy, N. and Chen, Y. (2015). Towards news verification: Deception detection methods for news discourse. In Proceedings of the Rapid Screening Technologies, Deception Detection and Credibility Assessment Symposium, Hawaii International Conference on System Sciences (HICSS 48)."},{"key":"S1351324921000152_ref130","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-3608"},{"key":"S1351324921000152_ref133","unstructured":"Qin, T. , Burgoon, J.K. , Blair, J.P. and Nunamaker, J.F. (2005). Modality effects in deception detection and applications in automatic-deception-detection. In Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pp. 23b\u201323b."},{"key":"S1351324921000152_ref178","volume-title":"Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf","author":"Whorf","year":"1956"},{"key":"S1351324921000152_ref138","doi-asserted-by":"crossref","unstructured":"Rubin, V.L. (2010). On deception and deception detection: Content analysis of computer-mediated stated beliefs. In Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem \u2013 Volume 47, ASIS&T 2010. Silver Springs, MD, USA: American Society for Information Science, pp. 32:1\u201332:10.","DOI":"10.1002\/meet.14504701124"},{"key":"S1351324921000152_ref3","unstructured":"Almela, A. , Valencia-Garca, R. and Cantos, P. (2012). Seeing through deception: A computational approach to deceit detection in written communication. In Proceedings of the Workshop on Computational Approaches to Deception Detection, EACL 2012, Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 15\u201322."},{"key":"S1351324921000152_ref165","doi-asserted-by":"publisher","DOI":"10.1037\/0022-3514.54.2.323"},{"key":"S1351324921000152_ref131","doi-asserted-by":"publisher","DOI":"10.1007\/BF01498980"},{"key":"S1351324921000152_ref53","author":"Fusilier","year":"2015"},{"key":"S1351324921000152_ref48","doi-asserted-by":"publisher","DOI":"10.1037\/10012-000"},{"key":"S1351324921000152_ref31","unstructured":"Devlin, J. , Chang, M.-W. , Lee, K. and Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota: Association for Computational Linguistics, pp. 4171\u20134186."},{"key":"S1351324921000152_ref59","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939754"},{"key":"S1351324921000152_ref88","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-005-0466-3"},{"key":"S1351324921000152_ref90","unstructured":"Le, Q. and Mikolov, T. (2014). Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on International Conference on Machine Learning \u2013 Volume 32, ICML 2014, II\u20131188\u2013II\u20131196. JMLR.org."},{"key":"S1351324921000152_ref114","unstructured":"Mukherjee, A. , Venkataraman, V. , Liu, B. and Glance, N.S. (2013a). What Yelp fake review filter might be doing? In Kiciman E., Ellison N.B., Hogan B., Resnick P. and Soboroff I. (eds.), ICWSM. The AAAI Press."},{"key":"S1351324921000152_ref157","doi-asserted-by":"publisher","DOI":"10.1007\/11564126_72"},{"key":"S1351324921000152_ref94","doi-asserted-by":"publisher","DOI":"10.1016\/j.chb.2008.05.002"},{"key":"S1351324921000152_ref73","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-23281-8_7"},{"key":"S1351324921000152_ref101","unstructured":"Loukachevitch, N. and Levchik, A. (2016). Creating a general russian sentiment lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Paris, France: European Language Resources Association (ELRA)."},{"key":"S1351324921000152_ref125","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-2072"},{"key":"S1351324921000152_ref162","doi-asserted-by":"crossref","unstructured":"Taylor, P.J. , Larner, S. , Conchie, S.M. and van der Zee, S. (2014). Cross-Cultural Deception Detection, Chapter 8. Wiley-Blackwell, pp. 175\u2013201.","DOI":"10.1002\/9781118510001.ch8"},{"key":"S1351324921000152_ref179","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220619"},{"key":"S1351324921000152_ref11","doi-asserted-by":"publisher","DOI":"10.1109\/ICWR.2019.8765275"},{"key":"S1351324921000152_ref128","doi-asserted-by":"crossref","unstructured":"Pisarevskaya, D. and Galitsky, B. (2019). An anatomy of a lie: Discourse patterns in customer complaints deception dataset. In Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference \u201cDialogue 2019\", Dialogue 2019, pp. 513\u2013531.","DOI":"10.1145\/3308560.3316604"},{"key":"S1351324921000152_ref137","unstructured":"Rotman, L. (2012). How culture influences the telling and detection of lies: Differences between low- and high-context individuals. Master\u2019s thesis, Lancaster University, Twente University, Enschede, the Netherlands. https:\/\/essay.utwente.nl\/62456\/1\/Rotman"},{"key":"S1351324921000152_ref23","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"S1351324921000152_ref118","unstructured":"Ott, M. , Cardie, C. and Hancock, J.T. (2013). Negative deceptive opinion spam. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, pp. 497\u2013501."},{"key":"S1351324921000152_ref153","doi-asserted-by":"publisher","DOI":"10.1145\/3349536"},{"key":"S1351324921000152_ref25","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1269"},{"key":"S1351324921000152_ref119","unstructured":"Ott, M. , Choi, Y. , Cardie, C. and Hancock, J.T. (2011). Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies \u2013 Volume 1, HLT 2011. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 309\u2013319."},{"key":"S1351324921000152_ref13","unstructured":"Blei, D.M. , Ng, A.Y. and Jordan, M.I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research 3, 993\u20131022."},{"key":"S1351324921000152_ref36","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.10.474"},{"key":"S1351324921000152_ref84","unstructured":"Krishnamurthy, G. , Majumder, N. , Poria, S. and Cambria, E. (2018). A deep learning approach for multimodal deception detection. In Proceedings of the 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2018)."},{"key":"S1351324921000152_ref145","unstructured":"Salvetti, F. , Lowe, J.B. and Martin, J.H. (2016). A tangled web: The faint signals of deception in text \u2013 boulder lies and truth corpus (BLT-C). In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Paris, France: European Language Resources Association (ELRA)."},{"key":"S1351324921000152_ref15","doi-asserted-by":"publisher","DOI":"10.1007\/BF00996226"},{"key":"S1351324921000152_ref33","unstructured":"Dumitrescu, S.D. and Avram, A.-M. (2020b). Introducing RONEC \u2013 The Romanian named entity corpus. In Proceedings of The 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, pp. 4436\u20134443."},{"key":"S1351324921000152_ref169","doi-asserted-by":"publisher","DOI":"10.1037\/0022-3514.77.2.279"},{"key":"S1351324921000152_ref2","doi-asserted-by":"publisher","DOI":"10.1109\/SPW.2018.00022"},{"key":"S1351324921000152_ref80","doi-asserted-by":"publisher","DOI":"10.21236\/ADA006655"},{"key":"S1351324921000152_ref37","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022966505471"},{"key":"S1351324921000152_ref175","first-page":"239","article-title":"A linguistic-based measure of cultural distance and its relationship to managerial values","volume":"44","author":"West","year":"2004","journal-title":"Management International Review"},{"key":"S1351324921000152_ref121","unstructured":"Pennebaker, J.W. , Francis, M.E. and Booth, R.J. (2001). Linguistic Inquiry and Word Count. Mahwah, NJ: Lawerence Erlbaum Associates."},{"key":"S1351324921000152_ref62","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"S1351324921000152_ref24","unstructured":"Conneau, A. and Lample, G. (2019). Cross-lingual language model pretraining. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., pp. 7059\u20137069."},{"key":"S1351324921000152_ref132","doi-asserted-by":"crossref","unstructured":"Potthast, M. , Kiesel, J. , Reinartz, K. , Bevendorff, J. and Stein, B. (2018). A stylometric inquiry into hyperpartisan and fake news. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 231\u2013240. Melbourne, Australia: Association for Computational Linguistics.","DOI":"10.18653\/v1\/P18-1022"},{"key":"S1351324921000152_ref74","doi-asserted-by":"publisher","DOI":"10.1080\/13183222.2018.1418964"},{"key":"S1351324921000152_ref39","unstructured":"Feng, S. , Banerjee, R. and Choi, Y. (2012). Syntactic stylometry for deception detection. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers \u2013 Volume 2, ACL 2012. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 171\u2013175."},{"key":"S1351324921000152_ref60","volume-title":"Managerial Thinking: An International Study","author":"Haire","year":"1966"},{"key":"S1351324921000152_ref71","unstructured":"Hu, J. , Ruder, S. , Siddhant, A. , Neubig, G. , Firat, O. and Johnson, M. (2020). XTREME: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation. In Daum\u00c9 III H. and Singh A. (eds.), Proceedings of the 37th International Conference on Machine Learning, volume 119. Proceedings of Machine Learning Research, Virtual. PMLR, pp. 4411\u20134421."},{"key":"S1351324921000152_ref77","unstructured":"Jones, T. and Newburn, T. (2001). Widening access: Improving police relations with hard to reach groups. Police Research Series 138."},{"key":"S1351324921000152_ref98","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-4005"},{"key":"S1351324921000152_ref19","unstructured":"Caete, J. , Chaperon, G. , Fuentes, R. and Prez, J. (2020). Spanish pre-trained BERT model and evaluation data. In to appear in PML4DC at ICLR 2020."},{"key":"S1351324921000152_ref16","doi-asserted-by":"publisher","DOI":"10.1002\/0470018860.s00567"},{"key":"S1351324921000152_ref7","unstructured":"Ba, J.L. , Kiros, J.R. and Hinton, G.E. (2016). Layer Normalization."},{"key":"S1351324921000152_ref79","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-2048"},{"key":"S1351324921000152_ref183","doi-asserted-by":"publisher","DOI":"10.1109\/ASONAM.2018.8508314"},{"key":"S1351324921000152_ref54","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-45442-5_18"},{"key":"S1351324921000152_ref28","unstructured":"de Vries, W. , van Cranenburgh, A. , Bisazza, A. , Caselli, T. , van Noord, G. and Nissim, M. (2019). BERTje: A Dutch BERT Model."},{"key":"S1351324921000152_ref32","unstructured":"Dumitrescu, S. and Avram, A.-M. (2020a). RomanianBERT."},{"key":"S1351324921000152_ref63","doi-asserted-by":"publisher","DOI":"10.1080\/01638530701739181"},{"key":"S1351324921000152_ref159","doi-asserted-by":"publisher","DOI":"10.1007\/s11575-016-0283-x"},{"key":"S1351324921000152_ref102","unstructured":"Mafela, M.J. (2013). Cultural diversity and the element of negation. Intercultural Communication Studies."},{"key":"S1351324921000152_ref50","volume-title":"The Language Parallax: Linguistic Relativism and Poetic indeterminacy","author":"Friedrich","year":"1989"},{"key":"S1351324921000152_ref38","unstructured":"Feng, F. , Yang, Y. , Cer, D. , Arivazhagan, N. and Wang, W. (2020). Language-Agnostic BERT Sentence Embedding."},{"key":"S1351324921000152_ref124","unstructured":"P\u00e9rez-Rosas, V. , Kleinberg, B. , Lefevre, A. and Mihalcea, R. (2017). Automatic detection of fake news. CoRR, abs\/1708.07104."},{"key":"S1351324921000152_ref51","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-7687.2008.00695.x"},{"key":"S1351324921000152_ref173","volume-title":"Detecting Lies and Deceit: Pitfalls and Opportunities","author":"Vrij","year":"2008"},{"key":"S1351324921000152_ref186","doi-asserted-by":"publisher","DOI":"10.1023\/B:GRUP.0000011944.62889.6f"},{"key":"S1351324921000152_ref174","doi-asserted-by":"publisher","DOI":"10.1177\/1529100610390861"},{"key":"S1351324921000152_ref52","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2008.11.001"},{"key":"S1351324921000152_ref155","first-page":"1929","article-title":"Dropout: A simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"Journal of Machine Learning Research"},{"key":"S1351324921000152_ref65","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8682553"},{"key":"S1351324921000152_ref189","doi-asserted-by":"publisher","DOI":"10.1145\/3395046"},{"key":"S1351324921000152_ref69","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2005-580"},{"key":"S1351324921000152_ref8","unstructured":"Baccianella, S. , Esuli, A. and Sebastiani, F. (2010). SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta: European Language Resources Association (ELRA)."},{"key":"S1351324921000152_ref184","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.96.3.395"},{"key":"S1351324921000152_ref66","doi-asserted-by":"publisher","DOI":"10.1177\/1088868314556539"},{"key":"S1351324921000152_ref154","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2012.00453"},{"key":"S1351324921000152_ref110","unstructured":"Mikolov, T. , Sutskever, I. , Chen, K. , Corrado, G. and Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems \u2013 Volume 2, NIPS 2013. Red Hook, NY, USA: Curran Associates Inc., pp. 3111\u20133119"},{"key":"S1351324921000152_ref9","doi-asserted-by":"publisher","DOI":"10.3115\/1599081.1599087"},{"key":"S1351324921000152_ref188","doi-asserted-by":"publisher","DOI":"10.1145\/1378727.1389972"},{"key":"S1351324921000152_ref103","unstructured":"Maks, I. , Izquierdo, R. , Frontini, F. , Agerri, R. , Vossen, P. and Azpeitia, A. (2014). Generating polarity lexicons with WordNet propagation in 5 languages. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), pp. 1155\u20131161. Reykjavik, Iceland: European Language Resources Association (ELRA)."},{"key":"S1351324921000152_ref126","doi-asserted-by":"publisher","DOI":"10.5195\/LESLI.2013.2"},{"key":"S1351324921000152_ref41","unstructured":"Fitzpatrick, E. and Bachenko, J. (2012). Building a data collection for deception research. In Proceedings of the Workshop on Computational Approaches to Deception Detection, Avignon, France: Association for Computational Linguistics, pp. 31\u201338."},{"key":"S1351324921000152_ref115","unstructured":"Mukherjee, A. , Venkataraman, V.V. , Liu, B. and Glance, N.S. (2013b). Fake review detection: Classification and analysis of real and pseudo reviews. Technical report, University of Illinois at Chicago."},{"key":"S1351324921000152_ref18","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2015.01965"},{"key":"S1351324921000152_ref35","doi-asserted-by":"publisher","DOI":"10.1037\/0003-066X.46.9.913"},{"key":"S1351324921000152_ref150","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2012.6289079"},{"key":"S1351324921000152_ref170","unstructured":"Verhoeven, B. and Daelemans, W. (2014). CLiPS stylometry investigation (CSI) corpus: A Dutch corpus for the detection of age, gender, personality, sentiment and deception in text. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014). European Language Resources Association (ELRA)."},{"key":"S1351324921000152_ref142","unstructured":"Saeed, R.M. , Rady, S. and Gharib, T.F. (2019). An ensemble approach for spam detection in arabic opinion texts. Journal of King Saud University \u2013 Computer and Information Sciences."},{"key":"S1351324921000152_ref141","first-page":"318","volume-title":"Learning Internal Representations by Error Propagation","author":"Rumelhart","year":"1987"},{"key":"S1351324921000152_ref87","doi-asserted-by":"publisher","DOI":"10.1007\/BF00262952"},{"key":"S1351324921000152_ref127","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1493"},{"key":"S1351324921000152_ref22","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-4330"},{"key":"S1351324921000152_ref120","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1203"},{"key":"S1351324921000152_ref112","unstructured":"Mohammad, S.M. , Salameh, M. and Kiritchenko, S. (2016). Sentiment lexicons for Arabic social media. In Proceedings of 10th edition of the the Language Resources and Evaluation Conference (LREC), Portoro\u017e, Slovenia."},{"key":"S1351324921000152_ref180","doi-asserted-by":"publisher","DOI":"10.1111\/j.1083-6101.2006.tb00313.x"},{"key":"S1351324921000152_ref57","doi-asserted-by":"publisher","DOI":"10.1002\/9781118510001"},{"key":"S1351324921000152_ref96","unstructured":"Libovick\u00fd, J. , Rosa, R. and Fraser, A. (2019). How language-neutral is multilingual BERT? arXiv e-prints, arXiv:1911.03310."},{"key":"S1351324921000152_ref181","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-demos.12"},{"key":"S1351324921000152_ref149","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2014.00080"},{"key":"S1351324921000152_ref108","unstructured":"Mihalcea, R. (2014). Romanian-english dictionary. LINDAT\/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (\u00daFAL), Faculty of Mathematics and Physics, Charles University."},{"key":"S1351324921000152_ref58","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2013.6707742"},{"key":"S1351324921000152_ref17","unstructured":"Bradley, M. and Lang, P. (1999). Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings. Technical report. Gainesville, FL: UF Center for the Study of Emotion and Attention."},{"key":"S1351324921000152_ref70","volume-title":"2nd and enlarged edition","author":"Hofstede","year":"2001"},{"key":"S1351324921000152_ref160","doi-asserted-by":"publisher","DOI":"10.1037\/h0022737"},{"key":"S1351324921000152_ref166","first-page":"177","volume-title":"An Analysis Towards Dialogue-Based Deception Detection","author":"Tsunomori","year":"2015"},{"key":"S1351324921000152_ref116","unstructured":"Nastase, V. , Sokolova, M. and Shirabad, J.S. (2007). Do happy words sound happy? A study of the relation between form and meaning for English words expressing emotions. In Proceedings of Recent Advances in Natural Language Processing (RANLP 2007)."},{"key":"S1351324921000152_ref4","unstructured":"Alyafeai, Z. , AlShaibani, M.S. and Ahmad, I. (2020). A Survey on Transfer Learning in Natural Language Processing."},{"key":"S1351324921000152_ref161","doi-asserted-by":"publisher","DOI":"10.1098\/rsos.170128"},{"key":"S1351324921000152_ref76","doi-asserted-by":"publisher","DOI":"10.1109\/INFOMAN.2018.8392850"},{"key":"S1351324921000152_ref163","doi-asserted-by":"publisher","DOI":"10.1109\/HICSS.2005.284"},{"key":"S1351324921000152_ref14","doi-asserted-by":"publisher","DOI":"10.1177\/0146167200265010"},{"key":"S1351324921000152_ref105","doi-asserted-by":"publisher","DOI":"10.1016\/j.tourman.2019.06.003"},{"key":"S1351324921000152_ref47","doi-asserted-by":"publisher","DOI":"10.1037\/0022-3514.72.6.1429"},{"key":"S1351324921000152_ref99","unstructured":"Liu, Y. , Ott, M. , Goyal, N. , Du, J. , Joshi, M. , Chen, D. , Levy, O. , Lewis, M. , Zettlemoyer, L. and Stoyanov, V. (2019). RoBERTa: A robustly optimized BERT pretraining approach. CoRR, abs\/1907.11692."},{"key":"S1351324921000152_ref26","doi-asserted-by":"publisher","DOI":"10.1007\/BF00994018"},{"key":"S1351324921000152_ref6","doi-asserted-by":"publisher","DOI":"10.1080\/01638531003674894"},{"key":"S1351324921000152_ref10","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1389"},{"key":"S1351324921000152_ref45","unstructured":"Fornaciari, T. and Poesio, M. (2012). DeCour: A corpus of DEceptive statements in Italian COURts. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012). Istanbul, Turkey: European Language Resources Association (ELRA), pp. 1585\u20131590."},{"key":"S1351324921000152_ref43","doi-asserted-by":"publisher","DOI":"10.1109\/DSAA.2017.51"},{"key":"S1351324921000152_ref72","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014073"},{"key":"S1351324921000152_ref134","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2017.01.015"},{"key":"S1351324921000152_ref27","first-page":"666","volume-title":"Electrodermal Activity (EDA)","author":"Critchley","year":"2013"},{"key":"S1351324921000152_ref40","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219885"},{"key":"S1351324921000152_ref56","first-page":"2672","volume-title":"Advances in Neural Information Processing Systems","volume":"27","author":"Goodfellow","year":"2014"},{"key":"S1351324921000152_ref92","doi-asserted-by":"publisher","DOI":"10.1111\/lcrp.12131"},{"key":"S1351324921000152_ref122","unstructured":"P\u00e9rez-Rosas, V. , Banea, C. and Mihalcea, R. (2012). Learning sentiment lexicons in spanish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012). Istanbul, Turkey: European Language Resources Association (ELRA), pp. 3077\u20133081. ACL Anthology Identifier: L12-1645."},{"key":"S1351324921000152_ref185","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2018.03.007"},{"key":"S1351324921000152_ref107","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijintrel.2007.06.002"},{"key":"S1351324921000152_ref111","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298958"},{"key":"S1351324921000152_ref151","doi-asserted-by":"publisher","DOI":"10.1080\/10570310209374731"},{"key":"S1351324921000152_ref135","doi-asserted-by":"crossref","unstructured":"Riloff, E. and Wiebe, J. (2003). Learning extraction patterns for subjective expressions. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003, pp. 105\u2013112. Stroudsburg, PA, USA: Association for Computational Linguistics.","DOI":"10.3115\/1119355.1119369"},{"key":"S1351324921000152_ref104","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.98.2.224"},{"key":"S1351324921000152_ref21","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1424"},{"key":"S1351324921000152_ref136","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00349"},{"key":"S1351324921000152_ref172","unstructured":"Vrij, A. (2008a). Detecting Lies and Deceit: Pitfalls and Opportunities. Wiley Series in the Psychology of Crime, Policing and Law. Wiley."},{"key":"S1351324921000152_ref20","unstructured":"Capuozzo, P. , Lauriola, I. , Strapparava, C. , Aiolli, F. and Sartori, G. (2020). DecOp: A multilingual and multi-domain corpus for detecting deception in typed text. In Proceedings of the 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, pp. 1423\u20131430."},{"key":"S1351324921000152_ref168","first-page":"101","volume-title":"The Development of Statement Reality Analysis","author":"Undeutsch","year":"1989"},{"key":"S1351324921000152_ref49","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1016218223"},{"key":"S1351324921000152_ref67","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.201"},{"key":"S1351324921000152_ref158","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2014.00590"},{"key":"S1351324921000152_ref86","unstructured":"Lafferty, J.D. , McCallum, A. and Pereira, F.C.N. (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, ICML 2001, San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., pp. 282\u2013289"},{"key":"S1351324921000152_ref140","unstructured":"Rubin, V.L. and Vashchilko, T. (2012). Identification of truth and deception in text: Application of vector space model to rhetorical structure theory. In Proceedings of the EACL 2012 Workshop on Computational Approaches to Deception Detection, EACL 2012. Association for Computational Linguistics, pp. 97\u2013106."},{"key":"S1351324921000152_ref152","first-page":"105","volume-title":"The Language and Culture Debate","author":"Shaules","year":"2019"},{"key":"S1351324921000152_ref129","doi-asserted-by":"publisher","DOI":"10.26615\/978-954-452-038-0_001"},{"key":"S1351324921000152_ref147","unstructured":"Sanh, V. , Debut, L. , Chaumond, J. and Wolf, T. (2019). DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter."},{"key":"S1351324921000152_ref144","unstructured":"Salvetti, F. (2014). Detecting Deception in Text: A Corpus-Driven Approach. PhD thesis, University of Colorado Boulder, Department of Computer Science."},{"key":"S1351324921000152_ref95","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1147"},{"key":"S1351324921000152_ref123","first-page":"157","volume-title":"Deception Detection Within and Across Cultures","author":"P\u00e9rez-Rosas","year":"2014"},{"key":"S1351324921000152_ref82","doi-asserted-by":"publisher","DOI":"10.1017\/S0261444810000431"},{"key":"S1351324921000152_ref83","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2016.01779"},{"key":"S1351324921000152_ref148","unstructured":"Sapir, E. (1921). Language, An Introduction to the Study of Speech. Brace, NY."},{"key":"S1351324921000152_ref106","doi-asserted-by":"publisher","DOI":"10.1177\/0022022107311854"},{"key":"S1351324921000152_ref44","doi-asserted-by":"publisher","DOI":"10.1109\/EISIC.2013.8"},{"key":"S1351324921000152_ref93","unstructured":"Levitan, S.I. , Levine, M. , Hirschberg, J. , Nishmar, C. , Guozhen, A. and Rosenberg, A. (2015). Individual differences in deception and deception detection. In Proceedings of COGNITIVE 2015."},{"key":"S1351324921000152_ref182","first-page":"5753","article-title":"XLNet: Generalized autoregressive pretraining for language understanding","volume":"32","author":"Yang","year":"2019","journal-title":"In Advances in Neural Information Processing Systems"},{"key":"S1351324921000152_ref55","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1047"},{"key":"S1351324921000152_ref156","doi-asserted-by":"publisher","DOI":"10.1002\/meet.2014.14505101086"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324921000152","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,9]],"date-time":"2022-08-09T09:24:07Z","timestamp":1660037047000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324921000152\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,2]]},"references-count":189,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9]]}},"alternative-id":["S1351324921000152"],"URL":"https:\/\/doi.org\/10.1017\/s1351324921000152","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,2]]},"assertion":[{"value":"\u00a9 The Author(s), 2021. Published by Cambridge University Press","name":"copyright","label":"Copyright","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}},{"value":"This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http:\/\/creativecommons.org\/licenses\/by\/4.0\/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.","name":"license","label":"License","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}},{"value":"This content has been made available to all.","name":"free","label":"Free to read"}]}}