{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T16:00:51Z","timestamp":1774281651201,"version":"3.50.1"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2022,4,29]],"date-time":"2022-04-29T00:00:00Z","timestamp":1651190400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,4,29]],"date-time":"2022-04-29T00:00:00Z","timestamp":1651190400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Data Sci. Eng."],"published-print":{"date-parts":[[2022,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The main aim of the outlying aspect mining algorithm is to automatically detect the subspace(s) (a.k.a. aspect(s)), where a given data point is dramatically different than the rest of the data in each of those subspace(s) (aspect(s)). To rank the subspaces for a given data point, a scoring measure is required to compute the outlying degree of the given data in each subspace. In this paper, we introduce a new measure to compute outlying degree, called <jats:italic>Simple Isolation score using Nearest Neighbor Ensemble<\/jats:italic> (SiNNE), which not only detects the outliers but also provides an explanation on why the selected point is an outlier. SiNNE is a dimensionally unbias measure in its raw form, which means the scores produced by SiNNE are compared directly with subspaces having different dimensions. Thus, it does not require any normalization to make the score unbiased. Our experimental results on synthetic and publicly available real-world datasets revealed that (i) SiNNE produces better or at least the same results as existing scores. (ii) It improves the run time of the existing outlying aspect mining algorithm based on beam search by at least two orders of magnitude. SiNNE allows the existing outlying aspect mining algorithm to run in datasets with hundreds of thousands of instances and thousands of dimensions which was not possible before.<\/jats:p>","DOI":"10.1007\/s41019-022-00185-5","type":"journal-article","created":{"date-parts":[[2022,4,29]],"date-time":"2022-04-29T10:41:59Z","timestamp":1651228919000},"page":"120-135","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["A New Dimensionality-Unbiased Score for Efficient and Effective Outlying Aspect Mining"],"prefix":"10.1007","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1042-7804","authenticated-orcid":false,"given":"Durgesh","family":"Samariya","sequence":"first","affiliation":[]},{"given":"Jiangang","family":"Ma","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,4,29]]},"reference":[{"issue":"1","key":"185_CR1","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1007\/s10618-016-0458-x","volume":"31","author":"F Angiulli","year":"2017","unstructured":"Angiulli F, Fassetti F, Manco G, Palopoli L (2017) Outlying property detection with numerical attributes. Data Min Knowl Disc 31(1):134\u2013163","journal-title":"Data Min Knowl Disc"},{"key":"185_CR2","doi-asserted-by":"crossref","unstructured":"Bandaragoda TR, Ting KM, Albrecht D, Liu FT, Wells JR (2014) Efficient anomaly detection by isolation using nearest neighbour ensemble. In: 2014 IEEE international conference on data mining workshop, pp 698\u2013705","DOI":"10.1109\/ICDMW.2014.70"},{"issue":"4","key":"185_CR3","doi-asserted-by":"publisher","first-page":"968","DOI":"10.1111\/coin.12156","volume":"34","author":"TR Bandaragoda","year":"2018","unstructured":"Bandaragoda TR, Ting KM, Albrecht D, Liu FT, Zhu Y, Wells JR (2018) Isolation-based anomaly detection using nearest-neighbor ensembles. Comput Intell 34(4):968\u2013998. https:\/\/doi.org\/10.1111\/coin.12156","journal-title":"Comput Intell"},{"key":"185_CR4","doi-asserted-by":"crossref","unstructured":"Brockett PL, Xia X, Derrig RA (1998) Using Kohonen\u2019s self-organizing feature map to uncover automobile bodily injury claims fraud. J Risk Insur 65(2):245\u2013274. http:\/\/www.jstor.org\/stable\/253535","DOI":"10.2307\/253535"},{"issue":"4","key":"185_CR5","doi-asserted-by":"publisher","first-page":"891","DOI":"10.1007\/s10618-015-0444-8","volume":"30","author":"GO Campos","year":"2016","unstructured":"Campos GO, Zimek A, Sander J, Campello RJGB, Micenkov\u00e1 B, Schubert E, Assent I, Houle ME (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Disc 30(4):891\u2013927. https:\/\/doi.org\/10.1007\/s10618-015-0444-8","journal-title":"Data Min Knowl Disc"},{"issue":"6","key":"185_CR6","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1109\/5254.809570","volume":"14","author":"PK Chan","year":"1999","unstructured":"Chan PK, Fan W, Prodromidis AL, Stolfo SJ (1999) Distributed data mining in credit card fraud detection. IEEE Intell Syst Appl 14(6):67\u201374","journal-title":"IEEE Intell Syst Appl"},{"key":"185_CR7","doi-asserted-by":"crossref","unstructured":"Dang XH, Micenkov\u00e1 B, Assent I, Ng RT (2013) Local outlier detection with interpretation. In: Blockeel H, Kersting K, Nijssen S, \u017delezn\u00fd F (eds) Machine learning and knowledge discovery in databases. Springer Berlin Heidelberg, Berlin, pp 304\u2013320","DOI":"10.1007\/978-3-642-40994-3_20"},{"issue":"5","key":"185_CR8","doi-asserted-by":"publisher","first-page":"1116","DOI":"10.1007\/s10618-014-0398-2","volume":"29","author":"L Duan","year":"2015","unstructured":"Duan L, Tang G, Pei J, Bailey J, Campbell A, Tang C (2015) Mining outlying aspects on numeric data. Data Min Knowl Disc 29(5):1116\u20131151. https:\/\/doi.org\/10.1007\/s10618-014-0398-2","journal-title":"Data Min Knowl Disc"},{"key":"185_CR9","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1007\/978-3-030-10925-7_8","volume-title":"Machine learning and knowledge discovery in databases","author":"N Gupta","year":"2019","unstructured":"Gupta N, Eswaran D, Shah N, Akoglu L, Faloutsos C (2019) Beyond outlier detection: lookout for pictorial explanation. In: Berlingerio M, Bonchi F, G\u00e4rtner T, Hurley N, Ifrim G (eds) Machine learning and knowledge discovery in databases. Springer, Cham, pp 122\u2013138"},{"issue":"1","key":"185_CR10","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1145\/1656274.1656278","volume":"11","author":"M Hall","year":"2009","unstructured":"Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11(1):10\u201318. https:\/\/doi.org\/10.1145\/1656274.1656278","journal-title":"SIGKDD Explor Newsl"},{"key":"185_CR11","volume-title":"Smoothing techniques: with implementation in S","author":"W H\u00e4rdle","year":"2012","unstructured":"H\u00e4rdle W (2012) Smoothing techniques: with implementation in S. Springer, New York"},{"issue":"7825","key":"185_CR12","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","volume":"585","author":"CR Harris","year":"2020","unstructured":"Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, Wieser E, Taylor J, Berg S, Smith NJ, Kern R, Picus M, Hoyer S, van Kerkwijk MH, Brett M, Haldane A, del R\u00edo JF, Wiebe M, Peterson P, G\u00e9rard-Marchant P, Sheppard K, Reddy T, Weckesser W, Abbasi H, Gohlke C, Oliphant TE (2020) Array programming with NumPy. Nature 585(7825):357\u2013362. https:\/\/doi.org\/10.1038\/s41586-020-2649-2","journal-title":"Nature"},{"key":"185_CR13","doi-asserted-by":"publisher","unstructured":"Keller F, Muller E, Bohm K (2012) Hics: high contrast subspaces for density-based outlier ranking. In: Proceedings of the 2012 IEEE 28th International Conference on Data Engineering, IEEE Computer Society, Washington, DC, USA, ICDE\u201912, pp 1037\u20131048, https:\/\/doi.org\/10.1109\/ICDE.2012.88","DOI":"10.1109\/ICDE.2012.88"},{"key":"185_CR14","doi-asserted-by":"crossref","unstructured":"Lin J, Keogh E, Ada Fu, Van Herle H (2005) Approximations to magic: finding unusual medical time series. In: 18th IEEE symposium on computer-based medical systems (CBMS\u201905), pp 329\u2013334","DOI":"10.1109\/CBMS.2005.34"},{"key":"185_CR15","doi-asserted-by":"crossref","unstructured":"Liu FT, Ting KM, Zhou Z (2008) Isolation forest. In: 2008 Eighth IEEE international conference on data mining, pp 413\u2013422","DOI":"10.1109\/ICDM.2008.17"},{"key":"185_CR16","doi-asserted-by":"crossref","unstructured":"Liu N, Shin D, Hu X (2018) Contextual outlier interpretation. In: Proceedings of the 27th international joint conference on artificial intelligence. AAAI Press, IJCAI\u201918, pp 2461\u20132467","DOI":"10.24963\/ijcai.2018\/341"},{"key":"185_CR17","doi-asserted-by":"publisher","first-page":"454","DOI":"10.1007\/978-3-642-03070-3_34","volume-title":"Machine learning and data mining in pattern recognition","author":"M Mej\u00eda-Lavalle","year":"2009","unstructured":"Mej\u00eda-Lavalle M, S\u00e1nchez Vivar A (2009) Outlier detection with explanation facility. In: Perner P (ed) Machine learning and data mining in pattern recognition. Springer Berlin Heidelberg, Berlin, pp 454\u2013464"},{"key":"185_CR18","doi-asserted-by":"publisher","unstructured":"Micenkov\u00e1 B, Ng RT, Dang X, Assent I (2013) Explaining outliers by subspace separability. In: 2013 IEEE 13th international conference on data mining, pp 518\u2013527, https:\/\/doi.org\/10.1109\/ICDM.2013.132","DOI":"10.1109\/ICDM.2013.132"},{"issue":"1\u20132","key":"185_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1561\/2200000060","volume":"10","author":"K Muandet","year":"2017","unstructured":"Muandet K, Fukumizu K, Sriperumbudur B, Sch\u00f6lkopf B (2017) Kernel mean embedding of distributions: a review and beyond. Found Trends Mach Learn 10(1\u20132):1\u2013141","journal-title":"Found Trends Mach Learn"},{"key":"185_CR20","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1007\/978-3-030-90885-0_15","volume-title":"Health information science","author":"D Samariya","year":"2021","unstructured":"Samariya D, Ma J (2021) Mining outlying aspects on healthcare data. In: Siuly S, Wang H, Chen L, Guo Y, Xing C (eds) Health information science. Springer, Cham, pp 160\u2013170"},{"key":"185_CR21","doi-asserted-by":"publisher","DOI":"10.1007\/s40745-021-00362-9","author":"D Samariya","year":"2021","unstructured":"Samariya D, Thakkar A (2021) A comprehensive survey of anomaly detection algorithms. Ann Data Sci. https:\/\/doi.org\/10.1007\/s40745-021-00362-9","journal-title":"Ann Data Sci"},{"key":"185_CR22","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1007\/978-3-030-62008-0_32","volume-title":"Web information systems engineering\u2014WISE 2020","author":"D Samariya","year":"2020","unstructured":"Samariya D, Aryal S, Ting KM, Ma J (2020) A new effective and efficient measure for outlying aspect mining. In: Huang Z, Beek W, Wang H, Zhou R, Zhang Y (eds) Web information systems engineering\u2014WISE 2020. Springer, Cham, pp 463\u2013474"},{"key":"185_CR23","unstructured":"Samariya D, Ma J, Aryal S (2020b) A comprehensive survey on outlying aspect mining methods. arXiv preprint arXiv:2005.02637"},{"key":"185_CR24","volume-title":"Density estimation for statistics and data analysis","author":"BW Silverman","year":"1986","unstructured":"Silverman BW (1986) Density estimation for statistics and data analysis. Chapman & Hall, London"},{"key":"185_CR25","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.4118697","author":"O Tange","year":"2020","unstructured":"Tange O (2020) Gnu parallel 20201022 (\u2018samuelpaty\u2019). Zenodo. https:\/\/doi.org\/10.5281\/zenodo.4118697","journal-title":"Zenodo"},{"key":"185_CR26","doi-asserted-by":"publisher","first-page":"422","DOI":"10.1007\/978-3-319-18032-8_33","volume-title":"Advances in knowledge discovery and data mining","author":"NX Vinh","year":"2015","unstructured":"Vinh NX, Chan J, Bailey J, Leckie C, Ramamohanarao K, Pei J (2015) Scalable outlying-inlying aspects discovery via feature ranking. In: Cao T, Lim EP, Zhou ZH, Ho TB, Cheung D, Motoda H (eds) Advances in knowledge discovery and data mining. Springer, Cham, pp 422\u2013434"},{"issue":"6","key":"185_CR27","doi-asserted-by":"publisher","first-page":"1520","DOI":"10.1007\/s10618-016-0453-2","volume":"30","author":"NX Vinh","year":"2016","unstructured":"Vinh NX, Chan J, Romano S, Bailey J, Leckie C, Ramamohanarao K, Pei J (2016) Discovering outlying aspects in large datasets. Data Min Knowl Disc 30(6):1520\u20131555. https:\/\/doi.org\/10.1007\/s10618-016-0453-2","journal-title":"Data Min Knowl Disc"},{"key":"185_CR28","doi-asserted-by":"publisher","first-page":"92","DOI":"10.1016\/j.patrec.2018.12.020","volume":"122","author":"JR Wells","year":"2019","unstructured":"Wells JR, Ting KM (2019) A new simple and efficient density estimator that enables fast systematic search. Pattern Recognit Lett 122:92\u201398. https:\/\/doi.org\/10.1016\/j.patrec.2018.12.020","journal-title":"Pattern Recognit Lett"},{"key":"185_CR29","doi-asserted-by":"publisher","unstructured":"Xu H, Wang Y, Jian S, Huang Z, Wang Y, Liu N, Li F (2021) Beyond outlier detection: Outlier interpretation by attention-guided triplet deviation network. In: Proceedings of the web conference 2021, association for computing machinery, New York, NY, USA, WWW\u201921, pp 1328\u20131339, https:\/\/doi.org\/10.1145\/3442381.3449868","DOI":"10.1145\/3442381.3449868"},{"key":"185_CR30","doi-asserted-by":"crossref","unstructured":"Xu YX, Pang M, Feng J, Ting KM, Jiang Y, Zhou ZH (2021) Reconstruction-based anomaly detection with completely random forest. In: Proceedings of the 2021 SIAM international conference on data mining (SDM), SIAM, pp 127\u2013135","DOI":"10.1137\/1.9781611976700.15"},{"key":"185_CR31","doi-asserted-by":"crossref","unstructured":"Zhang J, Lou M, Ling TW, Wang H (2004) Hos-miner: a system for detecting outlyting subspaces of high-dimensional data. In: Proceedings of the thirtieth international conference on very large data bases\u2014volume 30, VLDB endowment, Toronto, Canada, VLDB\u201904, pp 1265\u20131268, http:\/\/dl.acm.org\/citation.cfm?id=1316689.1316810","DOI":"10.1016\/B978-012088469-8\/50123-6"},{"issue":"2","key":"185_CR32","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1007\/s11263-006-9794-4","volume":"73","author":"J Zhang","year":"2007","unstructured":"Zhang J, Marsza\u0142ek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: a comprehensive study. Int J Comput Vis 73(2):213\u2013238. https:\/\/doi.org\/10.1007\/s11263-006-9794-4","journal-title":"Int J Comput Vis"}],"container-title":["Data Science and Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41019-022-00185-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s41019-022-00185-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41019-022-00185-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,12]],"date-time":"2022-05-12T17:11:22Z","timestamp":1652375482000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s41019-022-00185-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,29]]},"references-count":32,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,6]]}},"alternative-id":["185"],"URL":"https:\/\/doi.org\/10.1007\/s41019-022-00185-5","relation":{},"ISSN":["2364-1185","2364-1541"],"issn-type":[{"value":"2364-1185","type":"print"},{"value":"2364-1541","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,29]]},"assertion":[{"value":"13 December 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 March 2022","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 April 2022","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 April 2022","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no conflicts of interest to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}