{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T03:35:51Z","timestamp":1773200151613,"version":"3.50.1"},"reference-count":31,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,11,1]],"date-time":"2021-11-01T00:00:00Z","timestamp":1635724800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,1,31]]},"abstract":"<jats:p>\n            Nowadays, broadcasting news on social media and websites has grown at a swifter pace, which has had negative impacts on both the general public and governments; hence, this has urged us to build a fake news detection system. Contextualized word embeddings have achieved great success in recent years due to their power to embed both syntactic and semantic features of textual contents. In this article, we aim to address the problem of the lack of fake news datasets in Persian by introducing a new dataset crawled from different news agencies, and propose two deep models based on the Bidirectional Encoder Representations from Transformers model (BERT), which is a deep contextualized pre-trained model for extracting valuable features. In our proposed models, we benefit from two different settings of BERT, namely pool-based representation, which provides a representation for the whole document, and sequence representation, which provides a representation for each token of the document. In the former one, we connect a Single Layer Perceptron (SLP) to the BERT to use the embedding directly for detecting fake news. The latter one uses Convolutional Neural Network (CNN) after the BERT\u2019s embedding layer to extract extra features based on the collocation of words in a corpus. Furthermore, we present the TAJ dataset, which is a new Persian fake news dataset crawled from news agencies\u2019 websites. We evaluate our proposed models on the newly provided TAJ dataset as well as the two different Persian rumor datasets as baselines. The results indicate the effectiveness of using deep contextualized embedding approaches for the fake news detection task. We also show that both BERT-SLP and BERT-CNN models achieve superior performance to the previous baselines and traditional machine learning models, with 15.58% and 17.1% improvement compared to the reported results by Zamani et\u00a0al. [\n            <jats:xref ref-type=\"bibr\">30<\/jats:xref>\n            ], and 11.29% and 11.18% improvement compared to the reported results by Jahanbakhsh-Nagadeh et\u00a0al. [\n            <jats:xref ref-type=\"bibr\">9<\/jats:xref>\n            ].\n          <\/jats:p>","DOI":"10.1145\/3472620","type":"journal-article","created":{"date-parts":[[2021,11,1]],"date-time":"2021-11-01T13:54:37Z","timestamp":1635774877000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Persian Fake News Detection: Neural Representation and Classification at Word and Text Levels"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9297-1693","authenticated-orcid":false,"given":"Mohammadreza","family":"Samadi","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Amirkabir University of Technology, Tehran, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maryam","family":"Mousavian","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Amirkabir University of Technology, Tehran, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8110-1342","authenticated-orcid":false,"given":"Saeedeh","family":"Momtazi","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Amirkabir University of Technology, Tehran, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,11]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"127","volume-title":"International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments","author":"Ahmed Hadeer","year":"2017","unstructured":"Hadeer Ahmed, Issa Traore, and Sherif Saad. 2017. Detection of online fake news using n-gram analysis and machine learning techniques. In International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments. Springer, 127\u2013138."},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1002\/spy2.9"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.953"},{"key":"e_1_3_2_5_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, MN, 4171\u20134186. DOI:https:\/\/doi.org\/10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2020.106991"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102418"},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","first-page":"900","DOI":"10.1109\/UKRCON.2017.8100379","volume-title":"2017 IEEE 1st Ukraine Conference on Electrical and Computer Engineering (UKRCON\u201917)","author":"Granik Mykhailo","year":"2017","unstructured":"Mykhailo Granik and Volodymyr Mesyura. 2017. Fake news detection using naive Bayes classifier. In 2017 IEEE 1st Ukraine Conference on Electrical and Computer Engineering (UKRCON\u201917). IEEE, 900\u2013903."},{"key":"e_1_3_2_9_2","volume-title":"Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey","author":"Isotani Ryosuke","year":"1994","unstructured":"Ryosuke Isotani and Shoichi Matsunaga. 1994. Speech recognition using a stochastic language model integrating local and global constraints. In Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey. https:\/\/www.aclweb.org\/anthology\/H94-1015."},{"key":"e_1_3_2_10_2","article-title":"A model to measure the spread power of rumors","author":"Jahanbakhsh-Nagadeh Zoleikha","year":"2020","unstructured":"Zoleikha Jahanbakhsh-Nagadeh, Mohammad-Reza Feizi-Derakhshi, Majid Ramezani, Taymaz Rahkar-Farshi, Meysam Asgari-Chenaghlu, Narjes Nikzad-Khasmakhi, Ali-Reza Feizi-Derakhshi, Mehrdad Ranjbar-Khadivi, Elnaz Zafarani-Moattar, and Mohammad-Ali Balafar. 2020. A model to measure the spread power of rumors. arXiv preprint arXiv:2002.07563 (2020).","journal-title":"arXiv preprint arXiv:2002.07563"},{"key":"e_1_3_2_11_2","doi-asserted-by":"crossref","first-page":"2915","DOI":"10.1145\/3308558.3313552","volume-title":"The World Wide Web Conference","author":"Khattar Dhruv","year":"2019","unstructured":"Dhruv Khattar, Jaipal Singh Goud, Manish Gupta, and Vasudeva Varma. 2019. Mvae: Multimodal variational autoencoder for fake news detection. In The World Wide Web Conference. 2915\u20132921."},{"key":"e_1_3_2_12_2","article-title":"Text summarization with pretrained encoders","author":"Liu Yang","year":"2019","unstructured":"Yang Liu and Mirella Lapata. 2019. Text summarization with pretrained encoders. arXiv preprint arXiv:1908.08345 (2019).","journal-title":"arXiv preprint arXiv:1908.08345"},{"key":"e_1_3_2_13_2","first-page":"1","volume-title":"2019 28th International Conference on Computer Communication and Networks (ICCCN\u201919)","author":"Liu Zhan","year":"2019","unstructured":"Zhan Liu, Shaban Shabani, Nicole Glassey Balet, and Maria Sokhn. 2019. Detection of satiric news on social media: Analysis of the phenomenon with a French dataset. In 2019 28th International Conference on Computer Communication and Networks (ICCCN\u201919). 1\u20136."},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1109\/ISTEL.2018.8661007","volume-title":"2018 9th International Symposium on Telecommunications (IST\u201918)","author":"Mahmoodabad Sajjad Dehghani","year":"2018","unstructured":"Sajjad Dehghani Mahmoodabad, Saeed Farzi, and Danial Bidekani Bakhtiarvand. 2018. Persian rumor detection on twitter. In 2018 9th International Symposium on Telecommunications (IST\u201918). IEEE, 597\u2013602."},{"key":"e_1_3_2_15_2","first-page":"3111","volume-title":"Advances in Neural Information Processing Systems","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111\u20133119."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835619"},{"key":"e_1_3_2_17_2","first-page":"1","volume-title":"2019 Artificial Intelligence for Transforming Business and Society","author":"Munikar M.","year":"2019","unstructured":"M. Munikar, S. Shakya, and A. Shrestha. 2019. Fine-grained sentiment classification using BERT. In 2019 Artificial Intelligence for Transforming Business and Society (AITB\u201919), Vol. 1. 1\u20135."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1202"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/290941.291008"},{"key":"e_1_3_2_21_2","unstructured":"Jes\u00fas Reyes and Leon Palafox. 2019. Detection of fake news based on readability. In Proceedings of Reuni\u00f3n Internacional de Inteligencia Artificial y sus Aplicaciones ."},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1002\/pra2.2015.145052010083"},{"key":"e_1_3_2_23_2","article-title":"Hyperpartisan Facebook pages are publishing false and misleading information at an alarming rate","volume":"20","author":"Silverman Craig","year":"2016","unstructured":"Craig Silverman, Lauren Strapagiel, Hamza Shaban, Ellie Hall, and Jeremy Singer-Vine. 2016. Hyperpartisan Facebook pages are publishing false and misleading information at an alarming rate. Buzzfeed News 20 (2016).","journal-title":"Buzzfeed News"},{"key":"e_1_3_2_24_2","unstructured":"Ehsan Taher Seyed Abbas Hoseini and Mehrnoush Shamsfard. 2019. Beheshti-NER: Persian named entity recognition using BERT. In Proceedings of International Conference on Natural Language and Speech Processing Trento Italy ."},{"key":"e_1_3_2_25_2","first-page":"61","volume-title":"Proceedings of the ACL-08: HLT Student Research Workshop","author":"Trnka Keith","year":"2008","unstructured":"Keith Trnka. 2008. Adaptive language modeling for word prediction. In Proceedings of the ACL-08: HLT Student Research Workshop. Association for Computational Linguistics, Columbus, OH, 61\u201366. https:\/\/www.aclweb.org\/anthology\/P08-3011."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/3316809"},{"key":"e_1_3_2_27_2","first-page":"288","volume-title":"International Conference on Theory and Practice of Digital Libraries","author":"Vogel Inna","year":"2019","unstructured":"Inna Vogel and Peter Jiang. 2019. Fake news detection with the new German dataset \u201cGermanFakeNC.\u201d In International Conference on Theory and Practice of Digital Libraries. Springer, 288\u2013295."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-2067"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1599"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6479"},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1109\/IranianCEE.2017.7985287","volume-title":"2017 Iranian Conference on Electrical Engineering (ICEE\u201917)","author":"Zamani Somayeh","year":"2017","unstructured":"Somayeh Zamani, Masoud Asadpour, and Dara Moazzami. 2017. Rumor detection for persian tweets. In 2017 Iranian Conference on Electrical Engineering (ICEE\u201917). IEEE, 1532\u20131536."},{"key":"e_1_3_2_32_2","doi-asserted-by":"crossref","unstructured":"Majid Zarharan Samane Ahangar Fateme Rezvaninejad Mahdi Bidhendi Mohammad Taher Pilevar Behrouz Minaei and Sauleh Eetemadi. 2019. Persian stance classification data set. In Conference For Truth and Trust Online . DOI:https:\/\/doi.org\/10.36370\/tto.2019.30","DOI":"10.36370\/tto.2019.30"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472620","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472620","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:24Z","timestamp":1750191444000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472620"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11]]},"references-count":31,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1,31]]}},"alternative-id":["10.1145\/3472620"],"URL":"https:\/\/doi.org\/10.1145\/3472620","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11]]},"assertion":[{"value":"2020-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-11-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}