{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T18:46:22Z","timestamp":1775069182488,"version":"3.50.1"},"reference-count":64,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Vietnam J. Comp. Sci."],"published-print":{"date-parts":[[2023,11]]},"abstract":"<jats:p> This paper presents an approach to improve the file fragment classification by proposing new features for classification and evaluating them on a dataset that includes both low- and high-entropy file fragments. High-entropy fragments, belonging to compressed and encrypted files, are particularly challenging to classify because they lack exploitable patterns. To address this challenge, the proposed feature vectors are constructed based on the byte frequency distribution (BFD) of file fragments, along with discrete Fourier transform coefficients and several randomness measures. These feature vectors are tested using three machine learning models: Support vector machines (SVMs), artificial neural networks (ANNs), and random forests (RFs). The proposed approach is evaluated on the govdocs1 dataset, which is freely available and widely used in this field, to enable reproducibility and fair comparison with other published research. The results show that the proposed approach outperforms existing methods and achieves better classification accuracy for both low- and high-entropy file fragments. <\/jats:p>","DOI":"10.1142\/s2196888823500070","type":"journal-article","created":{"date-parts":[[2023,7,28]],"date-time":"2023-07-28T16:02:41Z","timestamp":1690560161000},"page":"433-462","source":"Crossref","is-referenced-by-count":10,"title":["Classification of Low- and High-Entropy File Fragments Using Randomness Measures and Discrete Fourier Transform Coefficients"],"prefix":"10.1142","volume":"10","author":[{"given":"Kristian","family":"Skra\u010di\u0107","sequence":"first","affiliation":[{"name":"Department of Electronic Systems and Information Processing, University of Zagreb Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb, Croatia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Juraj","family":"Petrovi\u0107","sequence":"additional","affiliation":[{"name":"Department of Electronic Systems and Information Processing, University of Zagreb Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb, Croatia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Predrag","family":"Pale","sequence":"additional","affiliation":[{"name":"Department of Electronic Systems and Information Processing, University of Zagreb Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb, Croatia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2023,7,28]]},"reference":[{"key":"S2196888823500070BIB001","first-page":"475","volume-title":"2013 Eighth Int. Conf. Availability, Reliability and Security (ARES)","author":"Poisel R."},{"key":"S2196888823500070BIB002","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2019.2897912"},{"key":"S2196888823500070BIB003","doi-asserted-by":"publisher","DOI":"10.1016\/j.diin.2010.05.003"},{"key":"S2196888823500070BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/j.diin.2009.06.016"},{"key":"S2196888823500070BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2018.2823697"},{"key":"S2196888823500070BIB006","first-page":"S69","volume":"10","author":"Roussev V.","year":"2013","journal-title":"Proc. Thirteen. Annu. DFRWS Conf. Annu. Digit. Forensics Res. Conf."},{"key":"S2196888823500070BIB007","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2013.2274728"},{"key":"S2196888823500070BIB008","first-page":"105","volume-title":"2014 IEEE Int. Conf. Bioinformatics and Bioengineering (BIBE)","author":"Qiu W."},{"key":"S2196888823500070BIB009","doi-asserted-by":"publisher","DOI":"10.1016\/j.dsp.2020.102952"},{"key":"S2196888823500070BIB010","first-page":"S44","volume":"9","author":"Fitzgerald S.","year":"2012","journal-title":"Proc. Twelfth Annu. DFRWS Conf. Annu. Digit. Forensics Res. Conf."},{"key":"S2196888823500070BIB011","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2020.3004266"},{"key":"S2196888823500070BIB012","doi-asserted-by":"publisher","DOI":"10.1186\/s13104-019-4837-4"},{"key":"S2196888823500070BIB013","doi-asserted-by":"publisher","DOI":"10.1186\/s13104-019-4812-0"},{"key":"S2196888823500070BIB014","first-page":"1","volume":"31","author":"Li Z.","year":"2002","journal-title":"Chin. J. Electron."},{"key":"S2196888823500070BIB015","first-page":"S24","volume":"7","author":"Axelsson S.","year":"2010","journal-title":"Proc. Tenth Annu. DFRWS Conf."},{"key":"S2196888823500070BIB016","doi-asserted-by":"publisher","DOI":"10.1016\/j.diin.2013.08.004"},{"key":"S2196888823500070BIB017","doi-asserted-by":"publisher","DOI":"10.1145\/3231884.3231889"},{"key":"S2196888823500070BIB018","volume-title":"Proc. 36th Annual Hawaii Int. Conf. System Sciences","author":"McDaniel M."},{"key":"S2196888823500070BIB019","first-page":"68","volume-title":"Proc. 2011 10th Int. Conf. Machine Learning and Applications and Workshops","author":"Gopal S."},{"key":"S2196888823500070BIB020","first-page":"277","volume-title":"2018 10th Int. Conf. Electrical and Computer Engineering","author":"Bhat K."},{"key":"S2196888823500070BIB021","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-24212-0_5"},{"key":"S2196888823500070BIB023","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2015.09.156"},{"key":"S2196888823500070BIB024","first-page":"3371","volume":"11","author":"Vincent P.","year":"2010","journal-title":"J. Mach. Learn. Res."},{"key":"S2196888823500070BIB025","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12148-7_12"},{"key":"S2196888823500070BIB026","first-page":"64","volume-title":"Proc. Sixth Annual IEEE SMC Information Assurance Workshop","author":"Li W.-J."},{"key":"S2196888823500070BIB027","doi-asserted-by":"publisher","DOI":"10.1002\/sec.553"},{"key":"S2196888823500070BIB028","first-page":"58","volume-title":"2010 IEEE Int. Conf. Intelligent Computing and Intelligent Systems","author":"Cao D."},{"key":"S2196888823500070BIB029","series-title":"Cryptology and Information Security Series","first-page":"129","volume-title":"A Systems Approach to Cyber Security","volume":"15","author":"Lee C.","year":"2017"},{"key":"S2196888823500070BIB030","first-page":"3","volume-title":"Fourth INT IEEE Workshop Systematic Approaches to Digital Forensic Eng.","author":"Roussev V."},{"key":"S2196888823500070BIB031","first-page":"1194","volume-title":"42nd Int. Convention on Information and Communication Technology, Electronics and Microelectronics","author":"Vulinovi\u0107 K."},{"issue":"1","key":"S2196888823500070BIB032","volume":"2","author":"Chang W.","year":"2010","journal-title":"J. Comput."},{"key":"S2196888823500070BIB033","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1948.tb01338.x"},{"key":"S2196888823500070BIB034","first-page":"S3","volume":"7","author":"Conti G.","year":"2010","journal-title":"Proc. Tenth Annu. DFRWS Conf."},{"key":"S2196888823500070BIB035","volume-title":"Proc. South Afr. Inf. Secur. Multi-Conf.","author":"Li Q.","year":"2010"},{"key":"S2196888823500070BIB036","volume-title":"Proc. 2011 Sixth Int. Conf. Availability, Reliability and Security","author":"Sportiello L."},{"key":"S2196888823500070BIB037","first-page":"67","volume":"25","author":"Nguyen K. T.","year":"2017","journal-title":"Log. J."},{"key":"S2196888823500070BIB038","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2021.09.019"},{"key":"S2196888823500070BIB039","first-page":"1","volume-title":"Proc. SoutheastCon 2015","author":"Alamri N. S."},{"key":"S2196888823500070BIB040","doi-asserted-by":"publisher","DOI":"10.1016\/j.diin.2017.06.004"},{"key":"S2196888823500070BIB041","doi-asserted-by":"publisher","DOI":"10.1109\/CITSM.2014.7042177"},{"key":"S2196888823500070BIB042","doi-asserted-by":"publisher","DOI":"10.1109\/SKG.2010.44"},{"key":"S2196888823500070BIB043","doi-asserted-by":"publisher","DOI":"10.1109\/JCN.2015.000091"},{"key":"S2196888823500070BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/VPPC53923.2021.9699234"},{"key":"S2196888823500070BIB045","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1976.1055501"},{"key":"S2196888823500070BIB046","doi-asserted-by":"publisher","DOI":"10.2307\/2332226"},{"key":"S2196888823500070BIB048","doi-asserted-by":"publisher","DOI":"10.1109\/JPHOT.2013.2264276"},{"key":"S2196888823500070BIB049","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177731569"},{"key":"S2196888823500070BIB050","doi-asserted-by":"publisher","DOI":"10.1016\/S0045-7930(02)00083-X"},{"key":"S2196888823500070BIB051","doi-asserted-by":"publisher","DOI":"10.1109\/TBME.2006.883625"},{"key":"S2196888823500070BIB052","doi-asserted-by":"publisher","DOI":"10.1007\/s11749-016-0481-7"},{"key":"S2196888823500070BIB053","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.109"},{"key":"S2196888823500070BIB054","doi-asserted-by":"publisher","DOI":"10.1007\/BF00994018"},{"key":"S2196888823500070BIB055","doi-asserted-by":"publisher","DOI":"10.1023\/B:STCO.0000035301.49549.88"},{"key":"S2196888823500070BIB056","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8655(99)00087-2"},{"key":"S2196888823500070BIB057","first-page":"1103","volume-title":"Proc. Comput. Commun. IEEE Symp","author":"Amirani M. C."},{"key":"S2196888823500070BIB058","doi-asserted-by":"publisher","DOI":"10.1109\/5326.897072"},{"key":"S2196888823500070BIB059","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(98)00117-8"},{"key":"S2196888823500070BIB060","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.50"},{"key":"S2196888823500070BIB061","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2017.2761740"},{"key":"S2196888823500070BIB062","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2014.2387439"},{"key":"S2196888823500070BIB063","doi-asserted-by":"publisher","DOI":"10.1177\/1094428107309326"},{"key":"S2196888823500070BIB064","first-page":"315","volume-title":"Proc. 14th Int. Conf. Artificial Intelligence and Statistics","author":"Glorot X.","year":"2011"},{"key":"S2196888823500070BIB065","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2005.10.010"},{"key":"S2196888823500070BIB067","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.6154"}],"container-title":["Vietnam Journal of Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S2196888823500070","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,18]],"date-time":"2023-12-18T08:14:41Z","timestamp":1702887281000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S2196888823500070"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,28]]},"references-count":64,"journal-issue":{"issue":"04","published-print":{"date-parts":[[2023,11]]}},"alternative-id":["10.1142\/S2196888823500070"],"URL":"https:\/\/doi.org\/10.1142\/s2196888823500070","relation":{},"ISSN":["2196-8888","2196-8896"],"issn-type":[{"value":"2196-8888","type":"print"},{"value":"2196-8896","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,28]]}}}