{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,11]],"date-time":"2026-05-11T18:15:28Z","timestamp":1778523328810,"version":"3.51.4"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T00:00:00Z","timestamp":1672185600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2023,3,31]]},"abstract":"<jats:p>Much of today\u2019s data are represented as graphs, ranging from social networks to bibliographic citations. Nodes in such graphs correspond to records that generally represent entities, while edges represent relationships between these entities. Both nodes and edges in a graph can have attributes that characterize the entities and their relationships. Relationships are either explicitly known (like friends in a social network), or they are inferred using link prediction (such as two babies are siblings because they have the same mother). Any graph representing real-world data likely contains nodes and edges that are abnormal, and identifying these can be important for outlier detection in applications ranging from crime and fraud detection to viral marketing. We propose a novel approach to the unsupervised detection of abnormal nodes and edges in graphs. We first characterize nodes and edges using a set of features, and then employ a one-class classifier to identify abnormal nodes and edges. We extract patterns of features from these abnormal nodes and edges, and apply clustering to identify groups of patterns with similar characteristics. We finally visualize these abnormal patterns to show co-occurrences of features and relationships between those features that mostly influence the abnormality of nodes and edges. We evaluate our approach on datasets from diverse domains, including historical birth certificates, COVID patient records, e-mails, books, and movies. This evaluation demonstrates that our approach is well suited to identify both abnormal nodes and edges in graphs in an unsupervised way, and it can outperform several baseline anomaly detection techniques.<\/jats:p>","DOI":"10.1145\/3546912","type":"journal-article","created":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T09:32:18Z","timestamp":1657099938000},"page":"1-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Unsupervised Identification of Abnormal Nodes and Edges in Graphs"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3080-7847","authenticated-orcid":false,"given":"Asara","family":"Senaratne","sequence":"first","affiliation":[{"name":"School of Computing, The Australian National University, Canberra, ACT, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3435-2015","authenticated-orcid":false,"given":"Peter","family":"Christen","sequence":"additional","affiliation":[{"name":"School of Computing, The Australian National University, Canberra, ACT, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7041-4127","authenticated-orcid":false,"given":"Graham","family":"Williams","sequence":"additional","affiliation":[{"name":"School of Computing, The Australian National University, Canberra, ACT, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4473-3877","authenticated-orcid":false,"given":"Pouya G.","family":"Omran","sequence":"additional","affiliation":[{"name":"School of Computing, The Australian National University, Canberra, ACT, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,12,28]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-14142-8_8"},{"key":"e_1_3_3_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375668"},{"key":"e_1_3_3_4_2","first-page":"1","article-title":"Linking Scottish vital event records using family groups","volume":"53","author":"Akg\u00fcn \u00d6zg\u00fcr","year":"2019","unstructured":"\u00d6zg\u00fcr Akg\u00fcn, Alan Dearle, Graham Kirby, Eilidh Garrett, Tom Dalton, Peter Christen, Chris Dibben, and Lee Williamson. 2019. Linking Scottish vital event records using family groups. Historical Methods: A Journal of Quantitative and Interdisciplinary History 53, 2 (2019), 1\u201317.","journal-title":"Historical Methods: A Journal of Quantitative and Interdisciplinary History"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-014-0365-y"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2396816"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.105754"},{"issue":"2","key":"e_1_3_3_8_2","first-page":"18","article-title":"Feature selection based on information gain","volume":"2","author":"Azhagusundari B.","year":"2013","unstructured":"B. Azhagusundari, Antony Selvadoss Thanamani, et\u00a0al. 2013. Feature selection based on information gain. International Journal of Innovative Technology and Exploring Engineering 2, 2 (2013), 18\u201321.","journal-title":"International Journal of Innovative Technology and Exploring Engineering"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5724"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-86520-7_3"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-19884-2"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/342009.335388"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/1132956.1132958"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541882"},{"key":"e_1_3_3_15_2","volume-title":"Handbook of Data Visualization","author":"Chen Chun-houh","year":"2007","unstructured":"Chun-houh Chen, Wolfgang Karl H\u00e4rdle, and Antony Unwin. 2007. Handbook of Data Visualization. Springer Science and Business Media, Berlin."},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.aei.2019.100959"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/2889311"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-59706-1"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281219"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401915"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143874"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063749"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611975673.67"},{"key":"e_1_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/3289600.3290964"},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.10.102"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835907"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.32614\/RJ-2017-047"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1950.tb00463.x"},{"key":"e_1_3_3_29_2","volume-title":"Data Mining: Concepts and Techniques (2nd. ed.)","author":"Han Jiawei","year":"2006","unstructured":"Jiawei Han and Micheline Kamber. 2006. Data Mining: Concepts and Techniques (2nd. ed.). Morgan Kaufmann, Chicago."},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1751-5823.2012.00183.x"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1023\/B:AIRE.0000045502.10941.a9"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jobcr.2020.08.003"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107746"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/1081870.1081891"},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/299"},{"key":"e_1_3_3_36_2","first-page":"4768","article-title":"A unified approach to interpreting model predictions","volume":"30","author":"Lundberg Scott M.","year":"2017","unstructured":"Scott M. Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. Advances in Neural Information Processing Dystems 30 (2017), 4768\u20134777.","journal-title":"Advances in Neural Information Processing Dystems"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.4103\/0976-500X.83300"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDEW.2013.6547453"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2078195"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/488"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611974348.24"},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.3366\/hac.2002.14.1-2.61"},{"key":"e_1_3_3_43_2","first-page":"1","article-title":"In-depth analysis of SVM kernel learning and its components","volume":"33","author":"Roman Ibai","year":"2020","unstructured":"Ibai Roman, Roberto Santana, Alexander Mendiburu, and Jose A. Lozano. 2020. In-depth analysis of SVM kernel learning and its components. Neural Computing and Applications 33, 10 (2020), 1\u201320.","journal-title":"Neural Computing and Applications"},{"key":"e_1_3_3_44_2","first-page":"1","article-title":"Fraud detection: Discovering connections with graph databases","volume":"13","author":"Sadowski Gorka","year":"2014","unstructured":"Gorka Sadowski and Philip Rathle. 2014. Fraud detection: Discovering connections with graph databases. White Paper-Neo Technology-Graphs are Everywhere 13 (2014), 1\u201313.","journal-title":"White Paper-Neo Technology-Graphs are Everywhere"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1145\/2618243.2618266"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2013.88"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.socnet.2014.05.002"},{"key":"e_1_3_3_48_2","doi-asserted-by":"publisher","DOI":"10.1162\/089976601750264965"},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1162\/089976600300015565"},{"key":"e_1_3_3_50_2","volume-title":"R: Recipes for Analysis, Visualization and Machine Learning","author":"Viswanathan Viswa","year":"2016","unstructured":"Viswa Viswanathan, Shanthi Viswanathan, Atmajitsinh Gohil, and Chiu David Chiu Yu-Wei. 2016. R: Recipes for Analysis, Visualization and Machine Learning. Packt Publishing Ltd, Mumbai."},{"key":"e_1_3_3_51_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.112897"},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-020-00849-0"},{"key":"e_1_3_3_53_2","first-page":"217","volume-title":"Proceedings of the National Conference on Artificial Intelligence","author":"Wong Weng-Keen","year":"2002","unstructured":"Weng-Keen Wong, Andrew Moore, Gregory Cooper, and Michael Wagner. 2002. Rule-based anomaly pattern detection for detecting disease outbreaks. In Proceedings of the National Conference on Artificial Intelligence. AAAI Press, California, 217\u2013223."},{"key":"e_1_3_3_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281280"},{"key":"e_1_3_3_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2010.5596881"},{"key":"e_1_3_3_56_2","unstructured":"Chris Young. 2021. 17+ of the Oldest People in the World to Have Beaten COVID-19. Retrieved February 17 2021 from https:\/\/interestingengineering.com\/17-of-the-oldest-people-in-the-world-to-have-beaten-covid-19."},{"key":"e_1_3_3_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2019.8852354"},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-020-05091-3"},{"key":"e_1_3_3_59_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41746-020-00372-6"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546912","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3546912","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:41Z","timestamp":1750186841000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546912"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,28]]},"references-count":58,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,3,31]]}},"alternative-id":["10.1145\/3546912"],"URL":"https:\/\/doi.org\/10.1145\/3546912","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"value":"1936-1955","type":"print"},{"value":"1936-1963","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,28]]},"assertion":[{"value":"2021-05-29","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-05-26","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-12-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}