{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T17:10:15Z","timestamp":1772039415465,"version":"3.50.1"},"reference-count":36,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2022,9,16]],"date-time":"2022-09-16T00:00:00Z","timestamp":1663286400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"AUDI ARAMCO Cybersecurity Chair at Imam Abdulrahman Bin Faisal University, Saudi Arabia"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JSAN"],"abstract":"<jats:p>Phishing is still a major security threat in cyberspace. In phishing, attackers steal critical information from victims by presenting a spoofing\/fake site that appears to be a visual clone of a legitimate site. Several Unicode characters are visually identical to ASCII characters. This similarity in characters is generally known as homoglyphs. Malicious adversaries utilize homoglyphs in URLs and DNS domains to target organizations. To reduce the risks caused by phishing attacks, effective ways of detecting phishing websites are urgently required. This paper proposes a homoglyph attack detection model that combines a hash function and machine learning. There are two phases to the model approach. The machine was being trained during the development phase. The deployment phase involved deploying the model with a Java interface and testing the outcomes through actual user interaction. The results are more accurate when the URL is hashed, as any little changes to the URL can be recognized. The homoglyph detector can be developed as a stand-alone software that is used as the initial step in requesting a webpage as it enhances browser security and protects websites from phishing attempts. To verify the effectiveness, we compared the proposed model on several criteria to existing phishing detection methods. By using the hash function, the proposed security features increase the overall security of the homoglyph attack detection in terms of accuracy, integrity, and availability. The experiment results showed that the model can detect phishing sites with an accuracy of 99.8% using Random Forest, and the hash function improves the accuracy of homoglyph attack detection.<\/jats:p>","DOI":"10.3390\/jsan11030054","type":"journal-article","created":{"date-parts":[[2022,9,19]],"date-time":"2022-09-19T02:06:50Z","timestamp":1663553210000},"page":"54","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Homoglyph Attack Detection Model Using Machine Learning and Hash Function"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2004-5324","authenticated-orcid":false,"given":"Abdullah M.","family":"Almuhaideb","sequence":"first","affiliation":[{"name":"SAUDI ARAMCO Cybersecurity Chair, Department of Networks and Communications, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1619-5733","authenticated-orcid":false,"given":"Nida","family":"Aslam","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Almaha","family":"Alabdullatif","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sarah","family":"Altamimi","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shooq","family":"Alothman","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Amnah","family":"Alhussain","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Waad","family":"Aldosari","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8262-5025","authenticated-orcid":false,"given":"Shikah J.","family":"Alsunaidi","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3975-5879","authenticated-orcid":false,"given":"Khalid A.","family":"Alissa","sequence":"additional","affiliation":[{"name":"SAUDI ARAMCO Cybersecurity Chair, Department of Networks and Communications, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,9,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Woodbridge, J., Anderson, H.S., Ahuja, A., and Grant, D. (2018, January 24). Detecting Homoglyph Attacks with a Siamese Neural Network. Proceedings of the 2018 IEEE Security and Privacy Workshops (SPW), San Francisco, CA, USA.","DOI":"10.1109\/SPW.2018.00012"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Elsayed, Y., and Shosha, A. (2018, January 15\u201317). Large scale detection of IDN domain name masquerading. Proceedings of the 018 APWG Symposium on Electronic Crime Research (eCrime), San Diego, CA, USA.","DOI":"10.1109\/ECRIME.2018.8376212"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Thao, T.P., Sawaya, Y., Nguyen-Son, H.-Q., Yamada, A., Kubota, A., Van Sang, T., and Yamaguchi, R.S. (2020). Human Factors in Homograph Attack Recognition. Applied Cryptography and Network Security, Springer.","DOI":"10.1007\/978-3-030-57878-7_20"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Simpson, G., Moore, T., and Clayton, R. (2020, January 16\u201319). Ten years of attacks on companies using visual impersonation of domain names. Proceedings of the 2020 APWG Symposium on Electronic Crime Research (eCrime), Boston, MA, USA.","DOI":"10.1109\/eCrime51433.2020.9493251"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1493","DOI":"10.1587\/transinf.2019ICP0002","article-title":"DomainScouter: Analyzing the Risks of Deceptive Internationalized Domain Names","volume":"E103.D","author":"Chiba","year":"2020","journal-title":"IEICE Trans. Inf. Syst."},{"key":"ref_6","unstructured":"(2022, July 18). Summary of the Phishing and Attempted Stealing Incident on Binance\u2013Binance. Available online: https:\/\/support.binance.com\/hc\/en-us\/articles\/360001547431-Summary-of-the-Phishing-and-Attempted-Stealing-Incident-on-Binance."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Helfrich, J.N., and Neff, R. (2012, January 23\u201324). Dual canonicalization: An answer to the homograph attack. Proceedings of the 2012 eCrime Researchers Summit, Las Croabas, PR, USA.","DOI":"10.1109\/eCrime.2012.6489517"},{"key":"ref_8","first-page":"6","article-title":"A Novel String Matching Algorithm and Comparison with KMP Algorithm","volume":"179","author":"Pandey","year":"2017","journal-title":"Int. J. Comput. Appl."},{"key":"ref_9","first-page":"4732","article-title":"A Study on String Matching Methodologies","volume":"5","author":"Pandey","year":"2014","journal-title":"Int. J. Comput. Sci. Inf. Technol."},{"key":"ref_10","unstructured":"Sawabe, Y., Chiba, D., Akiyama, M., and Goto, S. (2018, January 25\u201329). Detecting homograph IDNs using OCR. Proceedings of the Asia-Pacific Advanced Network, Singapore."},{"key":"ref_11","first-page":"3148","article-title":"An efficient character recognition technique using K-nearest neighbor classifier","volume":"7","author":"Barnouti","year":"2018","journal-title":"Int. J. Eng. Technol."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Suzuki, H., Chiba, D., Yoneya, Y., Mori, T., and Goto, S. (2019, January 21\u201323). ShamFinder: An automated framework for detecting IDN homographs. Proceedings of the Internet Measurement Conference, Amsterdam, The Netherlands.","DOI":"10.1145\/3355369.3355587"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.icte.2019.05.002","article-title":"Siamese neural network architecture for homoglyph attacks detection","volume":"6","author":"Vinayakumar","year":"2020","journal-title":"ICT Express"},{"key":"ref_14","unstructured":"(2022, May 18). Alexa Top 1 Million Sites. Available online: https:\/\/www.kaggle.com\/datasets\/cheedcheed\/top1m."},{"key":"ref_15","unstructured":"Sahoo, D., Liu, C., and Hoi, S.C. (2017). Malicious URL Detection using Machine Learning: A Survey. arXiv."},{"key":"ref_16","unstructured":"Le, H., Pham, Q., Sahoo, D., and Hoi, S.C.H. (2018). URLNet: Learning a URL Representation with Deep Learning for Malicious URL Detection. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"73271","DOI":"10.1109\/ACCESS.2019.2920655","article-title":"OFS-NN: An Effective Phishing Websites Detection Model Based on Optimal Feature Selection and Neural Network","volume":"7","author":"Zhu","year":"2019","journal-title":"IEEE Access"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Vanhoenshoven, F., Napoles, G., Falcon, R., Vanhoof, K., and Koppen, M. (2016, January 6\u20139). Detecting malicious URLs using machine learning techniques. Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI), Athens, Greece.","DOI":"10.1109\/SSCI.2016.7850079"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2435215.2435218","article-title":"A comprehensive study of techniques for URL-based web page language classification","volume":"7","author":"Baykan","year":"2013","journal-title":"ACM Trans. Web"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Talavera, L. (2005). An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering. Advances in Intelligent Data Analysis VI, Springer.","DOI":"10.1007\/11552253_40"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Kotthoff, L., Thornton, C., Hoos, H.H., Hutter, F., and Leyton-Brown, K. (2019). Auto-WEKA: Automatic Model Selection and Hyperpa-rameter Optimization. Automated Machine Learning: Methods, Systems, Challenges, Springer.","DOI":"10.1007\/978-3-030-05318-5_4"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"5948","DOI":"10.1016\/j.eswa.2014.03.019","article-title":"Phishing detection based Associative Classification data mining","volume":"41","author":"Abdelhamid","year":"2014","journal-title":"Expert Syst. Appl."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Desai, A., Jatakia, J., Naik, R., and Raul, N. (2017, January 19\u201320). Malicious web content detection using machine leaning. Proceedings of the 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), Bangalore, India.","DOI":"10.1109\/RTEICT.2017.8256834"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Machado, L., and Gadge, J. (2017, January 17\u201318). Phishing Sites Detection Based on C4.5 Decision Tree Algorithm. Proceedings of the 2017 International Conference on Computing, Communication, Control and Automation (ICCUBEA), Pune, India.","DOI":"10.1109\/ICCUBEA.2017.8463818"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Tyagi, I., Shad, J., Sharma, S., Gaur, S., and Kaur, G. (2018, January 22\u201323). A Novel Machine Learning Approach to Detect Phishing Websites. Proceedings of the 2018 5th International Conference on Signal Processing and Integrated Networks (SPIN), Noida, India.","DOI":"10.1109\/SPIN.2018.8474040"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Sonmez, Y., Tuncer, T., Gokal, H., and Avci, E. (2018, January 22\u201325). Phishing web sites features classification based on extreme learning machine. Proceedings of the 2018 6th International Symposium on Digital Forensic and Security (ISDFS), Antalya, Turkey.","DOI":"10.1109\/ISDFS.2018.8355342"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Karabatak, M., and Mustafa, T. (2018, January 22\u201325). Performance comparison of classifiers on reduced phishing website dataset. Proceedings of the 2018 6th International Symposium on Digital Forensic and Security (ISDFS), Antalya, Turkey.","DOI":"10.1109\/ISDFS.2018.8355357"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Ginsberg, A., and Yu, C. (2018, January 8\u201310). Rapid Homoglyph Prediction and Detection. Proceedings of the 2018 1st International Conference on Data Intelligence and Security (ICDIS), South Padre Island, TX, USA.","DOI":"10.1109\/ICDIS.2018.00010"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Parekh, S., Parikh, D., Kotak, S., and Sankhe, S. (2018, January 20\u201321). A New Method for Detection of Phishing Websites: URL Detection. Proceedings of the 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India.","DOI":"10.1109\/ICICCT.2018.8473085"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.ins.2019.01.064","article-title":"A new hybrid ensemble feature selection framework for machine learning-based phishing detection system","volume":"484","author":"Chiew","year":"2019","journal-title":"Inf. Sci."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1016\/j.eswa.2018.09.029","article-title":"Machine learning based phishing detection from URLs","volume":"117","author":"Sahingoz","year":"2019","journal-title":"Expert Syst. Appl."},{"key":"ref_32","first-page":"99","article-title":"PhiDMA\u2013A phishing detection model with multi-filter approach","volume":"32","author":"Sonowal","year":"2020","journal-title":"J. King Saud Univ.-Comput. Inf. Sci."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1007\/s11280-016-0418-9","article-title":"Two-stage ELM for phishing Web pages detection using hybrid features","volume":"20","author":"Zhang","year":"2017","journal-title":"World Wide Web"},{"key":"ref_34","unstructured":"Mertooetomo, E.R., and Chen, J. (1997, January 21\u201324). Character recognition with fuzzy features and fuzzy regions. Proceedings of the 1997 Annual Meeting of the North American Fuzzy Information Processing Society-NAFIPS (Cat. No.97TH8297), Syracuse, NY, USA."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1145\/1143120.1143132","article-title":"The methodology and an application to fight against Unicode attacks","volume":"Volume 149","author":"Fu","year":"2006","journal-title":"Proceedings of the Second Symposium on Usable Privacy and Security"},{"key":"ref_36","first-page":"449","article-title":"Database security-attacks and control methods","volume":"4","author":"Burtescu","year":"2009","journal-title":"J. Appl. Quant. Methods"}],"container-title":["Journal of Sensor and Actuator Networks"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2224-2708\/11\/3\/54\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:32:49Z","timestamp":1760142769000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2224-2708\/11\/3\/54"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,16]]},"references-count":36,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,9]]}},"alternative-id":["jsan11030054"],"URL":"https:\/\/doi.org\/10.3390\/jsan11030054","relation":{},"ISSN":["2224-2708"],"issn-type":[{"value":"2224-2708","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,16]]}}}