{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T18:14:45Z","timestamp":1754158485446,"version":"3.41.2"},"reference-count":53,"publisher":"Emerald","issue":"5\/6","license":[{"start":{"date-parts":[[2020,11,30]],"date-time":"2020-11-30T00:00:00Z","timestamp":1606694400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["EL"],"published-print":{"date-parts":[[2020,11,30]]},"abstract":"<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title>\n<jats:p>The aim of this study is to propose an efficient rule extraction and integration approach for identifying phishing websites. The proposed approach can elucidate patterns of phishing websites and identify them accurately.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title>\n<jats:p>Hyperlink indicators along with URL-based features are used to build the identification model. In the proposed approach, very simple rules are first extracted based on individual features to provide meaningful and easy-to-understand rules. Then, the <jats:italic>F<\/jats:italic>-measure score is used to select high-quality rules for identifying phishing websites. To construct a reliable and promising phishing website identification model, the selected rules are integrated using a simple neural network model.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Findings<\/jats:title>\n<jats:p>Experiments conducted using self-collected and benchmark data sets show that the proposed approach outperforms 16 commonly used classifiers (including seven non\u2013rule-based and four rule-based classifiers as well as five deep learning models) in terms of interpretability and identification performance.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title>\n<jats:p>Investigating patterns of phishing websites based on hyperlink indicators using the efficient rule-based approach is innovative. It is not only helpful for identifying phishing websites, but also beneficial for extracting simple and understandable rules.<\/jats:p>\n<\/jats:sec>","DOI":"10.1108\/el-01-2020-0016","type":"journal-article","created":{"date-parts":[[2020,11,27]],"date-time":"2020-11-27T04:38:48Z","timestamp":1606451928000},"page":"1073-1093","source":"Crossref","is-referenced-by-count":6,"title":["Identification of phishing websites through hyperlink analysis and rule extraction"],"prefix":"10.1108","volume":"38","author":[{"given":"Chaoqun","family":"Wang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhongyi","family":"Hu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Raymond","family":"Chiong","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yukun","family":"Bao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiang","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"issue":"13","key":"key2020121210210927300_ref001","doi-asserted-by":"crossref","first-page":"5948","DOI":"10.1016\/j.eswa.2014.03.019","article-title":"Phishing detection based associative classification data mining","volume":"41","year":"2014","journal-title":"Expert Systems with Applications"},{"key":"key2020121210210927300_ref002","first-page":"60","article-title":"A comparison of machine learning techniques for phishing detection","volume-title":"in Anti-Phishing Working Groups 2nd Annual eCrime Researchers Summit (eCrime '07), 4-5 October","year":"2007"},{"key":"key2020121210210927300_ref003","first-page":"281","article-title":"Using case-based reasoning for phishing detection","volume-title":"Proceedings of the 8th International Conference on Ambient Systems, Networks and Technologies","year":"2017"},{"key":"key2020121210210927300_ref004","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1016\/j.eswa.2018.07.067","article-title":"Intelligent web-phishing detection and protection scheme using integrated features of images, frames and text","volume":"115","year":"2019","journal-title":"Expert Systems with Applications"},{"issue":"3","key":"key2020121210210927300_ref005","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1109\/TIFS.2010.2050767","article-title":"Web spam detection: new classification features based on qualified link analysis and language models","volume":"5","year":"2010","journal-title":"IEEE Transactions on Information Forensics and Security"},{"key":"key2020121210210927300_ref006","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.inffus.2019.12.012","article-title":"Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI","volume":"58","year":"2020","journal-title":"Information Fusion"},{"issue":"12","key":"key2020121210210927300_ref007","first-page":"4315","article-title":"Heuristic nonlinear regression strategy for detecting phishing websites","volume":"23","year":"2018","journal-title":"Soft Computing"},{"key":"key2020121210210927300_ref008","first-page":"51","article-title":"Anti-phishing based on automated individual white-list","volume-title":"Proceedings of the 4th ACM Workshop on Digital Identity Management (DIM '08), co-located with the 15th ACM Computer and Communications Security Conference (CCS '08), 27-31 October","year":"2008"},{"key":"key2020121210210927300_ref009","first-page":"157","article-title":"Countering web spam with credibility-based link analysis","volume-title":"Proceedings of the 26th Annual ACM Symposium on Principles of Distributed Computing (PODC '07)","year":"2007"},{"key":"key2020121210210927300_ref010","first-page":"227","article-title":"Detecting phishing websites through deep reinforcement learning","volume-title":"Proceedings of the IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC'19)","year":"2019"},{"key":"key2020121210210927300_ref011","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.cose.2015.07.006","article-title":"Utilisation of website logo for phishing detection","volume":"54","year":"2015","journal-title":"Computers and Security"},{"key":"key2020121210210927300_ref012","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.ins.2019.01.064","article-title":"A new hybrid ensemble feature selection framework for machine learning-based phishing detection system","volume":"484","year":"2019","journal-title":"Information Sciences"},{"volume-title":"Neural Networks with R: Smart Models Using CNN, RNN, Deep Learning, and Artificial Intelligence Principles","year":"2017","key":"key2020121210210927300_ref013"},{"key":"key2020121210210927300_ref014","doi-asserted-by":"crossref","first-page":"921","DOI":"10.1016\/j.ins.2015.04.022","article-title":"Entropy-based discretization methods for ranking data","volume":"329","year":"2016","journal-title":"Information Sciences"},{"key":"key2020121210210927300_ref015","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1016\/j.cose.2019.03.018","article-title":"A keyword-based combination approach for detecting phishing webpages","volume":"84","year":"2019","journal-title":"Computers and Security"},{"key":"key2020121210210927300_ref016","first-page":"96","article-title":"Linear-time rule induction","volume-title":"in Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD '96)","year":"1996"},{"key":"key2020121210210927300_ref017","first-page":"194","article-title":"Supervised and unsupervised discretization of continuous features","volume-title":"Machine Learning Proceedings","year":"1995"},{"key":"key2020121210210927300_ref018","doi-asserted-by":"crossref","first-page":"729","DOI":"10.1016\/j.asoc.2016.08.005","article-title":"A new fast associative classification algorithm for detecting phishing websites","volume":"48","year":"2016","journal-title":"Applied Soft Computing"},{"key":"key2020121210210927300_ref019","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1016\/j.asoc.2018.04.056","article-title":"Integrating associative rule-based classification with naive Bayes for text classification","volume":"69","year":"2018","journal-title":"Applied Soft Computing"},{"edition":"3rd ed.","volume-title":"Data Mining: Concepts and Techniques","year":"2011","key":"key2020121210210927300_ref020"},{"issue":"3","key":"key2020121210210927300_ref021","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1016\/j.ijinfomgt.2013.01.001","article-title":"Social media competitive analysis and text mining: a case study in the pizza industry","volume":"33","year":"2013","journal-title":"International Journal of Information Management"},{"issue":"3","key":"key2020121210210927300_ref022","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1108\/IMDS-02-2018-0072","article-title":"Malicious web domain identification using online credibility and performance data by considering the class imbalance issue","volume":"119","year":"2019","journal-title":"Industrial Management and Data Systems"},{"key":"key2020121210210927300_ref023","first-page":"5186","article-title":"Identifying malicious web domains using machine learning techniques with online credibility and performance data","year":"2016","journal-title":"Proceedings of the IEEE Congress on Evolutionary Computation (CEC '16)"},{"key":"key2020121210210927300_ref024","first-page":"339","article-title":"Analysis of phishing attacks and countermeasures","volume-title":"Proceedings of the 6th International Conference on Managing Information in Digital Economy (IBIMA '06)","year":"2006"},{"issue":"1","key":"key2020121210210927300_ref025","first-page":"1","article-title":"Intelligent phishing URL detection using association rule mining","volume":"6","year":"2016","journal-title":"Human-Centric Computing and Information Sciences"},{"issue":"5","key":"key2020121210210927300_ref026","doi-asserted-by":"crossref","first-page":"604","DOI":"10.1145\/324133.324140","article-title":"Authoritative sources in a hyperlinked environment","volume":"46","year":"1999","journal-title":"Journal of the ACM"},{"volume-title":"Machine Learning with R","year":"2013","key":"key2020121210210927300_ref027"},{"key":"key2020121210210927300_ref028","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1145\/1273496.1273556","article-title":"An empirical evaluation of deep architectures on problems with many factors of variation","volume-title":"Proceedings of the 24th International Conference on Machine Learning","year":"2007"},{"key":"key2020121210210927300_ref029","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1016\/j.cose.2019.02.004","article-title":"An effective security alert mechanism for real-time phishing tweet detection on Twitter","volume":"83","year":"2019","journal-title":"Computers and Security"},{"key":"key2020121210210927300_ref030","first-page":"1","article-title":"Phishing sites detection based on C4.5 decision tree algorithm","volume-title":"Proceedings of the International Conference on Computing, Communication, Control and Automation","year":"2017"},{"key":"key2020121210210927300_ref031","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.eswa.2016.01.028","article-title":"New rule-based phishing detection method","volume":"53","year":"2016","journal-title":"Expert Systems with Applications"},{"issue":"2","key":"key2020121210210927300_ref032","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1007\/s00521-013-1490-z","article-title":"Predicting phishing websites based on self-structuring neural network","volume":"25","year":"2014","journal-title":"Neural Computing and Applications"},{"key":"key2020121210210927300_ref033","first-page":"557","article-title":"Web credibility: Features exploration and credibility prediction","volume-title":"Proceedings of the European Conference on Information Retrieval","year":"2013"},{"key":"key2020121210210927300_ref034","first-page":"381","article-title":"Anomaly based web phishing page detection","volume-title":"Proceedings of the 22nd Annual Computer Security Applications Conference","year":"2006"},{"key":"key2020121210210927300_ref035","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-1-4419-9326-7_1","article-title":"Ensemble learning","volume-title":"Ensemble Machine Learning: Methods and Applications","year":"2012"},{"key":"key2020121210210927300_ref036","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.knosys.2015.10.009","article-title":"Improving recall of software defect prediction models using association mining","volume":"90","year":"2015","journal-title":"Knowledge-Based Systems"},{"key":"key2020121210210927300_ref037","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1016\/j.eswa.2018.09.029","article-title":"Machine learning based phishing detection from URLs","volume":"117","year":"2019","journal-title":"Expert Systems with Applications"},{"key":"key2020121210210927300_ref038","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1016\/j.eswa.2018.07.042","article-title":"A novel software defect prediction based on atomic class-association rule mining","volume":"114","year":"2018","journal-title":"Expert Systems with Applications"},{"key":"key2020121210210927300_ref039","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.dss.2018.01.001","article-title":"Detection of online phishing email using dynamic evolving neural network based on reinforcement learning","volume":"107","year":"2018","journal-title":"Decision Support Systems"},{"issue":"1","key":"key2020121210210927300_ref040","first-page":"1","article-title":"Efficient deep learning techniques for the detection of phishing websites","volume":"45","year":"2020","journal-title":"S\u0101dhan\u0101"},{"issue":"3","key":"key2020121210210927300_ref041","doi-asserted-by":"crossref","first-page":"1831","DOI":"10.1007\/s11192-014-1374-8","article-title":"Linked title mentions: a new automated link search candidate","volume":"101","year":"2014","journal-title":"Scientometrics"},{"issue":"5","key":"key2020121210210927300_ref042","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1002\/asi.23486","article-title":"Uncovering information from social media hyperlinks: an investigation of Twitter","volume":"67","year":"2016","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"1","key":"key2020121210210927300_ref043","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1007\/s11192-012-0640-x","article-title":"Exploring web keyword analysis as an alternative to link analysis: a multi-industry case","volume":"93","year":"2012","journal-title":"Scientometrics"},{"issue":"10","key":"key2020121210210927300_ref044","doi-asserted-by":"crossref","first-page":"1960","DOI":"10.1002\/asi.22659","article-title":"Web data as academic and business quality estimates: a comparison of three data sources","volume":"63","year":"2012","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"key2020121210210927300_ref045","first-page":"1","article-title":"Comparative study of the detection of malicious URLs using shallow and deep networks","volume-title":"Proceedings of the 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT '18)","year":"2018"},{"key":"key2020121210210927300_ref046","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2222444.2222456","article-title":"Evaluating Arabic spam classifiers using link analysis","volume-title":"Proceedings of the 3rd International Conference on Information and Communication Systems","year":"2012"},{"key":"key2020121210210927300_ref047","first-page":"119","article-title":"Phishing website detection using C4. 5 decision tree","volume-title":"Proceedings of the 2nd International Conference on Information Technology and Management Engineering (ITME'17)","year":"2017"},{"key":"key2020121210210927300_ref048","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1016\/j.neunet.2020.02.013","article-title":"CNN-MHSA: a convolutional neural network and multi-head self-attention combined approach for detecting phishing websites","volume":"125","year":"2020","journal-title":"Neural Networks"},{"issue":"2","key":"key2020121210210927300_ref049","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1007\/s11192-007-1923-5","article-title":"Hyperlink analysis for government websites of Chinese provincial capitals","volume":"76","year":"2008","journal-title":"Scientometrics"},{"key":"key2020121210210927300_ref050","doi-asserted-by":"crossref","first-page":"15196","DOI":"10.1109\/ACCESS.2019.2892066","article-title":"Phishing website detection based on multidimensional features driven by deep learning","volume":"7","year":"2019","journal-title":"IEEE Access"},{"issue":"1","key":"key2020121210210927300_ref051","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1108\/EL-05-2019-0118","article-title":"Phishing web site detection using diverse machine learning algorithms","volume":"38","year":"2020","journal-title":"The Electronic Library"},{"first-page":"138","article-title":"Finding more bilingual webpages with high credibility via link analysis","year":"2013","key":"key2020121210210927300_ref052"},{"key":"key2020121210210927300_ref053","doi-asserted-by":"crossref","first-page":"73271","DOI":"10.1109\/ACCESS.2019.2920655","article-title":"OFS-NN: an effective phishing websites detection model based on optimal feature selection and neural network","volume":"7","year":"2019","journal-title":"IEEE Access"}],"container-title":["The Electronic Library"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EL-01-2020-0016\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EL-01-2020-0016\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T01:06:36Z","timestamp":1753405596000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/el\/article\/38\/5-6\/1073-1093\/47315"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,30]]},"references-count":53,"journal-issue":{"issue":"5\/6","published-print":{"date-parts":[[2020,11,30]]}},"alternative-id":["10.1108\/EL-01-2020-0016"],"URL":"https:\/\/doi.org\/10.1108\/el-01-2020-0016","relation":{},"ISSN":["0264-0473","0264-0473"],"issn-type":[{"type":"print","value":"0264-0473"},{"type":"print","value":"0264-0473"}],"subject":[],"published":{"date-parts":[[2020,11,30]]}}}