{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T21:14:55Z","timestamp":1775337295686,"version":"3.50.1"},"reference-count":44,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2023,5,24]],"date-time":"2023-05-24T00:00:00Z","timestamp":1684886400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,24]],"date-time":"2023-05-24T00:00:00Z","timestamp":1684886400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100023674","name":"Deanship of Scientific Research, King Khalid University","doi-asserted-by":"crossref","award":["(RGP1\/357\/43)"],"award-info":[{"award-number":["(RGP1\/357\/43)"]}],"id":[{"id":"10.13039\/501100023674","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>With the rapid growth of social networks and technology, knowing what news to believe and what not to believe become a challenge in this digital era. Fake news is defined as provably erroneous information transmitted intending to defraud. This kind of misinformation poses a serious threat to social cohesion and well-being, since it fosters political polarisation and can destabilize trust in the government or the service provided. As a result, fake news detection has emerged as an important field of study, with the goal of identifying whether a certain piece of content is real or fake. In this paper, we propose a novel hybrid fake news detection system that combines a BERT-based (bidirectional encoder representations from transformers) with a light gradient boosting machine (LightGBM) model. We compare the performance of the proposed method to four different classification approaches using different word embedding techniques on three real-world fake news datasets to validate the performance of the proposed method compared to other methods. The proposed method is evaluated to detect fake news based on the headline-only or full text of the news content. The results show the superiority of the proposed method for fake news detection compared to many state-of-the-art methods.<\/jats:p>","DOI":"10.1007\/s40747-023-01098-0","type":"journal-article","created":{"date-parts":[[2023,5,24]],"date-time":"2023-05-24T05:01:51Z","timestamp":1684904511000},"page":"6581-6592","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":46,"title":["Fake news detection based on a hybrid BERT and LightGBM models"],"prefix":"10.1007","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3360-7285","authenticated-orcid":false,"given":"Ehab","family":"Essa","sequence":"first","affiliation":[]},{"given":"Karima","family":"Omar","sequence":"additional","affiliation":[]},{"given":"Ali","family":"Alqahtani","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,5,24]]},"reference":[{"issue":"6380","key":"1098_CR1","doi-asserted-by":"publisher","first-page":"1094","DOI":"10.1126\/science.aao2998","volume":"359","author":"DMJ Lazer","year":"2018","unstructured":"Lazer DMJ, Baum MA, Benkler Y, Berinsky AJ, Greenhill KM, Menczer F, Metzger MJ, Nyhan B, Pennycook G, Rothschild D, Schudson M, Sloman SA, Sunstein CR, Thorson EA, Watts DJ, Zittrain JL (2018) The science of fake news. Science 359(6380):1094\u20131096. https:\/\/doi.org\/10.1126\/science.aao2998","journal-title":"Science"},{"key":"1098_CR2","unstructured":"Weedon J, Nuland W, Stamos A (2017) Information operations and Facebook. Retrieved from Facebook: https:\/\/fbnewsroomus.files.wordpress.com\/2017\/04\/facebook-and-information-operations-v1.pdf, https:\/\/about.fb.com\/br\/wpcontent\/uploads\/sites\/3\/2017\/09\/facebook-and-information-operations-v1.pdf. Accessed 15 Aug 2022"},{"key":"1098_CR3","unstructured":"Gunther R, Beck PA, Nisbet EC (2018) Fake news may have contributed to trump\u2019s 2016 victory. Ohio state university, [Online]. https:\/\/www.documentcloud.org\/documents\/4429952-Fake-News-May-Have-Contributed-to-Trump-s-2016.html. Accessed 24 Aug 2022"},{"key":"1098_CR4","unstructured":"The Economic Times (2022) Fake news of Tesla acquiring lithium miner sent its stock up over 250%. [Online]. https:\/\/economictimes.indiatimes.com\/tech\/tech-bytes\/fake-news-of-tesla-acquiring-lithium-miner-sent-its-stock-up-over-250\/articleshow\/90835997.cms. Accessed 24 Aug 2022"},{"issue":"3","key":"1098_CR5","volume":"12","author":"R Kouzy","year":"2020","unstructured":"Kouzy R, Abi Jaoude J, Kraitem A, El Alam MB, Karam B, Adib E, Zarka J, Traboulsi C, Akl EW, Baddour K (2020) Coronavirus goes viral: quantifying the covid-19 misinformation epidemic on twitter. Cureus 12(3):e7255","journal-title":"Cureus"},{"key":"1098_CR6","doi-asserted-by":"publisher","first-page":"62097","DOI":"10.1109\/ACCESS.2022.3181184","volume":"10","author":"N Seddari","year":"2022","unstructured":"Seddari N, Derhab A, Belaoued M, Halboob W, Al-Muhtadi J, Bouras A (2022) A hybrid linguistic and knowledge-based analysis approach for fake news detection on social media. IEEE Access 10:62097\u201362109","journal-title":"IEEE Access"},{"key":"1098_CR7","doi-asserted-by":"crossref","unstructured":"Essa E, Xie X, Jones J-L (2015) Minimum s-excess graph for segmenting and tracking multiple borders with hmm. In: International conference on medical image computing and computer-assisted intervention, pp 28\u201335","DOI":"10.1007\/978-3-319-24571-3_4"},{"issue":"2","key":"1098_CR8","doi-asserted-by":"publisher","first-page":"2916","DOI":"10.1002\/cnm.2916","volume":"34","author":"E Essa","year":"2018","unstructured":"Essa E, Xie X (2018) Phase contrast cell detection using multilevel classification. Int J Numer Methods Biomed Eng 34(2):2916. https:\/\/doi.org\/10.1002\/cnm.2916","journal-title":"Int J Numer Methods Biomed Eng"},{"issue":"1","key":"1098_CR9","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-021-00444-8","volume":"8","author":"L Alzubaidi","year":"2021","unstructured":"Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, Santamar\u00eda J, Fadhel MA, Al-Amidie M, Farhan L (2021) Review of deep learning: concepts, cnn architectures, challenges, applications, future directions. J Big Data 8(1):1\u201374","journal-title":"J Big Data"},{"key":"1098_CR10","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol 1 (long and short papers), pp 4171\u20134186. https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"1098_CR11","unstructured":"Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR, Le QV (2019) Xlnet: generalized autoregressive pretraining for language understanding. In: Advances in neural information processing systems, vol 32, pp 5753\u20135763"},{"key":"1098_CR12","unstructured":"Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized bert pretraining approach. arXiv preprint. arXiv:1907.11692"},{"key":"1098_CR13","doi-asserted-by":"crossref","unstructured":"Ahmed H, Traore I, Saad S (2017) Detection of online fake news using n-gram analysis and machine learning techniques. In: International conference on intelligent, secure, and dependable systems in distributed and cloud environments, pp 127\u2013138","DOI":"10.1007\/978-3-319-69155-8_9"},{"key":"1098_CR14","unstructured":"Wijeratne Y (2021) How much bullshit do we need? benchmarking classical machine learning for fake news classification. LIRNEasia, [Online]. https:\/\/lirneasia.net\/2021\/07\/how-much-bullshit-do-we-need-benchmarking-classical-machine-learning-for-fake-news-classification\/. Accessed 24 Aug 2022"},{"key":"1098_CR15","doi-asserted-by":"publisher","first-page":"123174","DOI":"10.1016\/j.physa.2019.123174","volume":"540","author":"FA Ozbay","year":"2020","unstructured":"Ozbay FA, Alatas B (2020) Fake news detection within online social media using supervised artificial intelligence algorithms. Phys A 540:123174. https:\/\/doi.org\/10.1016\/j.physa.2019.123174","journal-title":"Phys A"},{"issue":"26","key":"1098_CR16","doi-asserted-by":"publisher","first-page":"34333","DOI":"10.1007\/s11042-021-11006-8","volume":"80","author":"FA Ozbay","year":"2021","unstructured":"Ozbay FA, Alatas B (2021) Adaptive salp swarm optimization algorithms with inertia weights for novel fake news detection model in online social media. Multimed Tools Appl 80(26):34333\u201334357","journal-title":"Multimed Tools Appl"},{"key":"1098_CR17","doi-asserted-by":"publisher","DOI":"10.1080\/19361610.2021.1963605","author":"A Kansal","year":"2021","unstructured":"Kansal A (2021) Fake news detection using pos tagging and machine learning. J Appl Secur Res. https:\/\/doi.org\/10.1080\/19361610.2021.1963605","journal-title":"J Appl Secur Res"},{"issue":"1","key":"1098_CR18","doi-asserted-by":"publisher","first-page":"100007","DOI":"10.1016\/j.jjimei.2020.100007","volume":"1","author":"JA Nasir","year":"2021","unstructured":"Nasir JA, Khan OS, Varlamis I (2021) Fake news detection: a hybrid cnn-rnn based deep learning approach. Int J Inf Manag Data Insights 1(1):100007. https:\/\/doi.org\/10.1016\/j.jjimei.2020.100007","journal-title":"Int J Inf Manag Data Insights"},{"key":"1098_CR19","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1016\/j.cogsys.2019.12.005","volume":"61","author":"RK Kaliyar","year":"2020","unstructured":"Kaliyar RK, Goswami A, Narang P, Sinha S (2020) Fndnet\u2014a deep convolutional neural network for fake news detection. Cogn Syst Res 61:32\u201344. https:\/\/doi.org\/10.1016\/j.cogsys.2019.12.005","journal-title":"Cogn Syst Res"},{"issue":"3","key":"1098_CR20","doi-asserted-by":"publisher","first-page":"396","DOI":"10.1016\/j.icte.2021.10.003","volume":"8","author":"IK Sastrawan","year":"2022","unstructured":"Sastrawan IK, Bayupati IPA, Arsa DMS (2022) Detection of fake news using deep learning cnn-rnn based methods. ICT Express 8(3):396\u2013408. https:\/\/doi.org\/10.1016\/j.icte.2021.10.003","journal-title":"ICT Express"},{"key":"1098_CR21","doi-asserted-by":"publisher","first-page":"129471","DOI":"10.1109\/ACCESS.2021.3112806","volume":"9","author":"H Saleh","year":"2021","unstructured":"Saleh H, Alharbi A, Alsamhi SH (2021) Opcnn-fake: optimized convolutional neural network for fake news detection. IEEE Access 9:129471\u2013129489. https:\/\/doi.org\/10.1109\/ACCESS.2021.3112806","journal-title":"IEEE Access"},{"key":"1098_CR22","unstructured":"Yang Y, Zheng L, Zhang J, Cui Q, Li Z, Yu PS (2018) TI-CNN: convolutional neural networks for fake news detection. arXiv preprint. arXiv:1806.00749"},{"issue":"11","key":"1098_CR23","doi-asserted-by":"publisher","first-page":"8132","DOI":"10.1007\/s10489-021-02345-y","volume":"51","author":"C Raj","year":"2021","unstructured":"Raj C, Meel P (2021) Convnet frameworks for multi-modal fake news detection. Appl Intell 51(11):8132\u20138148","journal-title":"Appl Intell"},{"key":"1098_CR24","doi-asserted-by":"crossref","unstructured":"Sch\u00fctz M, Schindler A, Siegel M, Nazemi K (2021) Automatic fake news detection with pre-trained transformer models. In: International conference on pattern recognition, pp 627\u2013641","DOI":"10.1007\/978-3-030-68787-8_45"},{"issue":"8","key":"1098_CR25","doi-asserted-by":"publisher","first-page":"11765","DOI":"10.1007\/s11042-020-10183-2","volume":"80","author":"RK Kaliyar","year":"2021","unstructured":"Kaliyar RK, Goswami A, Narang P (2021) Fakebert: fake news detection in social media with a Bert-based deep learning approach. Multimed Tools Appl 80(8):11765\u201311788","journal-title":"Multimed Tools Appl"},{"issue":"4","key":"1098_CR26","doi-asserted-by":"publisher","first-page":"569","DOI":"10.3390\/math10040569","volume":"10","author":"C-O Truic\u0103","year":"2022","unstructured":"Truic\u0103 C-O, Apostol E-S (2022) Misrob\u00e6rta: transformers versus misinformation. Mathematics 10(4):569","journal-title":"Mathematics"},{"key":"1098_CR27","doi-asserted-by":"publisher","unstructured":"Qazi M, Khan MUS, Ali M (2020) Detection of fake news using transformer model. In: 2020 3rd international conference on computing, mathematics and engineering technologies (iCoMET), pp 1\u20136. https:\/\/doi.org\/10.1109\/iCoMET48670.2020.9074071","DOI":"10.1109\/iCoMET48670.2020.9074071"},{"key":"1098_CR28","doi-asserted-by":"publisher","first-page":"101164","DOI":"10.1016\/j.csl.2020.101164","volume":"66","author":"Y Arase","year":"2021","unstructured":"Arase Y, Tsujii J (2021) Transfer fine-tuning of BERT with phrasal paraphrases. Comput Speech Lang 66:101164","journal-title":"Comput Speech Lang"},{"key":"1098_CR29","doi-asserted-by":"crossref","unstructured":"Williams A, Nangia N, Bowman SR (2017) A broad-coverage challenge corpus for sentence understanding through inference. arXiv preprint. arXiv:1704.05426","DOI":"10.18653\/v1\/N18-1101"},{"key":"1098_CR30","unstructured":"Storks S, Gao Q, Chai JY (2019) Recent advances in natural language inference: a survey of benchmarks, resources, and approaches. arXiv preprint. arXiv:1904.01172"},{"key":"1098_CR31","unstructured":"Souza F, Nogueira R, Lotufo R (2019) Portuguese named entity recognition using bert-crf. arXiv preprint. arXiv:1909.10649"},{"key":"1098_CR32","doi-asserted-by":"crossref","unstructured":"Yang W, Xie Y, Lin A, Li X, Tan L, Xiong K, Li M, Lin J (2019) End-to-end open-domain question answering with bertserini. arXiv preprint. arXiv:1902.01718","DOI":"10.18653\/v1\/N19-4013"},{"key":"1098_CR33","doi-asserted-by":"crossref","unstructured":"Yang Z, Garcia N, Chu C, Otani M, Nakashima Y, Takemura H (2020) Bert representations for video question answering. In: Proceedings of the IEEE\/CVF Winter conference on applications of computer vision (WACV)","DOI":"10.1109\/WACV45572.2020.9093596"},{"key":"1098_CR34","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, vol 30"},{"issue":"5","key":"1098_CR35","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1214\/aos\/1013203451","volume":"29","author":"JH Friedman","year":"2001","unstructured":"Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29(5):1189\u20131232","journal-title":"Ann Stat"},{"key":"1098_CR36","unstructured":"Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y (2017) Lightgbm: a highly efficient gradient boosting decision tree. In: Advances in neural information processing systems, vol 30"},{"key":"1098_CR37","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd Acm Sigkdd international conference on knowledge discovery and data mining, pp 785\u2013794","DOI":"10.1145\/2939672.2939785"},{"key":"1098_CR38","unstructured":"Szpakowski M (2020) Fake News Corpus. [Online]. https:\/\/github.com\/several27\/FakeNewsCorpus. Accessed 24 Aug 2022"},{"key":"1098_CR39","unstructured":"Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning, vol 242, pp 29\u201348"},{"key":"1098_CR40","doi-asserted-by":"crossref","unstructured":"Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532\u20131543","DOI":"10.3115\/v1\/D14-1162"},{"key":"1098_CR41","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"1098_CR42","doi-asserted-by":"crossref","unstructured":"Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M (2020) Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pp 38\u201345","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"1098_CR43","unstructured":"Smith LN (2018) A disciplined approach to neural network hyper-parameters: part 1\u2014learning rate, batch size, momentum, and weight decay. arXiv preprint. arXiv:1803.09820"},{"key":"1098_CR44","doi-asserted-by":"crossref","unstructured":"Akiba T, Sano S, Yanase T, Ohta T, Koyama M (2019) Optuna: a next-generation hyperparameter optimization framework. In: Proceedings of the 25th ACM SIGKDD International conference on knowledge discovery and data mining, pp 2623\u20132631","DOI":"10.1145\/3292500.3330701"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01098-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-023-01098-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01098-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,27]],"date-time":"2023-10-27T19:19:33Z","timestamp":1698434373000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-023-01098-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,24]]},"references-count":44,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,12]]}},"alternative-id":["1098"],"URL":"https:\/\/doi.org\/10.1007\/s40747-023-01098-0","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,24]]},"assertion":[{"value":"1 November 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 May 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}