{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T15:41:29Z","timestamp":1780501289865,"version":"3.54.1"},"reference-count":31,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2019,12,31]],"date-time":"2019-12-31T00:00:00Z","timestamp":1577750400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Social Science Fund","award":["19BYY076"],"award-info":[{"award-number":["19BYY076"]}]},{"name":"the Science Foundation of the Ministry of Education of China","award":["14YJC860042"],"award-info":[{"award-number":["14YJC860042"]}]},{"name":"the Shandong Provincial Social Science Planning Project","award":["19BJCJ51"],"award-info":[{"award-number":["19BJCJ51"]}]},{"name":"the Shandong Provincial Social Science Planning Project","award":["18CXWJ01"],"award-info":[{"award-number":["18CXWJ01"]}]},{"name":"the Shandong Provincial Social Science Planning Project","award":["18BJYJ04"],"award-info":[{"award-number":["18BJYJ04"]}]},{"name":"the Shandong Provincial Social Science Planning Project","award":["16CFXJ18"],"award-info":[{"award-number":["16CFXJ18"]}]},{"name":"the Shandong Provincial Social Science Planning Project","award":["16CXWJ01"],"award-info":[{"award-number":["16CXWJ01"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Recently, outlier detection has widespread applications in different areas. The task is to identify outliers in the dataset and extract potential information. The existing outlier detection algorithms mainly do not solve the problems of parameter selection and high computational cost, which leaves enough room for further improvements. To solve the above problems, our paper proposes a parameter-free outlier detection algorithm based on dataset optimization method. Firstly, we propose a dataset optimization method (DOM), which initializes the original dataset in which density is greater than a specific threshold. In this method, we propose the concepts of partition function (P) and threshold function (T). Secondly, we establish a parameter-free outlier detection method. Similarly, we propose the concept of the number of residual neighbors, as the number of residual neighbors and the size of data clusters are used as the basis of outlier detection to obtain a more accurate outlier set. Finally, extensive experiments are carried out on a variety of datasets and experimental results show that our method performs well in terms of the efficiency of outlier detection and time complexity.<\/jats:p>","DOI":"10.3390\/info11010026","type":"journal-article","created":{"date-parts":[[2020,1,3]],"date-time":"2020-01-03T03:28:53Z","timestamp":1578022133000},"page":"26","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A Parameter-Free Outlier Detection Algorithm Based on Dataset Optimization Method"],"prefix":"10.3390","volume":"11","author":[{"given":"Liying","family":"Wang","sequence":"first","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lei","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Liancheng","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peiyu","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lindong","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yanru","family":"Dong","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2019,12,31]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Prakobphol, K., and Zhan, J. (2008, January 24\u201326). A Novel Outlier Detection Scheme for Network Intrusion Detection Systems. Proceedings of the International Conference on Information Security Assurance, Washington, DC, USA.","DOI":"10.1109\/ISA.2008.26"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Mehnaz, S., and Bertino, E. (2017, January 22\u201324). Ghostbuster: A Fine-grained Approach for Anomalous Detection in File System Accesses. Proceedings of the ACM on Conference on Data and Application Security and Privacy, Scottsdale, AZ, USA.","DOI":"10.1145\/3029806.3029809"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Avdiienko, V., Kuznetsov, K., Rommelfanger, I., Rau, A., Gorla, A., and Zeller, A. (2017, January 20\u201328). Detecting behavior anomalies in graphical user interfaces. Proceedings of the International Conference on Software Engineering Companion, Buenos Aires, Argentina.","DOI":"10.1109\/ICSE-C.2017.130"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1002\/qre.1581","article-title":"Outlier Detection for Healthcare Quality Monitoring\u2014A Comparison of Four Approaches to Over Dispersed Proportions","volume":"30","author":"Vidmar","year":"2014","journal-title":"Qual. Reliab. Eng. Int."},{"key":"ref_5","unstructured":"Kumar, N., and Kumar, U. (2019, January 13\u201315). Anomaly-Based Network Intrusion Detection: An Outlier Detection Techniques. Proceedings of the International Conference on Soft Computing and Pattern Recognition, Hyderabad, India."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1016\/j.patcog.2017.09.037","article-title":"A comparative evaluation of outlier detection algorithms: Experiments and analyses","volume":"74","author":"Maurizio","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_7","first-page":"860","article-title":"Identification of Outliers","volume":"37","author":"Hawkins","year":"2018","journal-title":"Biometrics"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Aggarwal, C.C. (2017). An Introduction to Outlier Analysis. Outlier Analysis, Springer.","DOI":"10.1007\/978-3-319-47578-3"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1023\/B:AIRE.0000045502.10941.a9","article-title":"A Survey of Outlier Detection Methodologies","volume":"22","author":"Hodge","year":"2004","journal-title":"Artif. Intell. Rev."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1145\/1541880.1541882","article-title":"Anomaly Detection: A Survey","volume":"41","author":"Chandola","year":"2009","journal-title":"ACM Comput. Surv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Wang, Y., Wang, X., and Wang, X.L. (2016). A Spectral Clustering Based Outlier Detection Technique. Machine Learning and Data Mining in Pattern Recognition, Springer International Publishing.","DOI":"10.1007\/978-3-319-41920-6_2"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Breuning, M.M., Kriegel, H.P., Ng, R.T., and Sander, J. (2000, January 15\u201318). LOF: Identifying density-based local outliers. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, USA.","DOI":"10.1145\/342009.335388"},{"key":"ref_13","unstructured":"Kriegel, H.-P., Kr\u00f6ger, P., and Zimek, A. (2009, January 9). Outlier detection techniques. Proceedings of the 13th Pacific-Asia Conf. Knowl. Discovery Data Mining, Bangkok, Thailand."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/j.knosys.2015.10.014","article-title":"A non-parameter outlier detection algorithm based on Natural Neighbor","volume":"92","author":"Huang","year":"2016","journal-title":"Knowl. Based Syst."},{"key":"ref_15","first-page":"1","article-title":"Outlier Detection for Robust Multi-dimensional Scaling","volume":"1","author":"Leonid","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jal.2016.12.002","article-title":"A study on anomalous detection ensembles","volume":"21","author":"Chiang","year":"2017","journal-title":"J. Appl. Log."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hawkins, D.M. (1980). Identification of Outliers, Chapman and Hall.","DOI":"10.1007\/978-94-015-3994-4"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Su, S., Xiao, L., and Zhang, Z. (2017, January 18\u201320). N2DLOF: A New Local Density-Based Outlier Detection Approach for Scattered Data. Proceedings of the IEEE International Conference on High Performance Computing & Communications, Bangkok, Thailand.","DOI":"10.1109\/HPCC-SmartCity-DSS.2017.60"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wu, M. (2006, January 20\u201323). Outlier detection by sampling with accuracy guarantees. Proceedings of the Twelfth ACM Sigkdd International Conference on Knowledge Discovery & Data Mining, Philadelphia, PA, USA.","DOI":"10.1145\/1150402.1150501"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/j.neucom.2017.02.039","article-title":"A local density-based approach for outlier detection","volume":"241","author":"Tang","year":"2017","journal-title":"Neurocomputing"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.ins.2015.06.030","article-title":"A precise ranking method for outlier detection","volume":"324","author":"Ha","year":"2015","journal-title":"Inf. Sci."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"44707","DOI":"10.1109\/ACCESS.2018.2857834","article-title":"Unique Neighborhood Set Parameter Independent Density-Based Clustering with Outlier Detection","volume":"6","author":"Rahman","year":"2018","journal-title":"IEEE Access"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.patrec.2016.05.007","article-title":"Natural Neighbor: A self-adaptive neighborhood method without parameter K","volume":"80","author":"Zhu","year":"2016","journal-title":"Pattern Recognit. Lett."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1109\/TKDE.2011.261","article-title":"Information-Theoretic Outlier Detection for Large-Scale Categorical Data","volume":"25","author":"Shu","year":"2013","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1385","DOI":"10.1007\/s10586-016-0598-1","article-title":"Weighted natural neighborhood graph: An adaptive structure for clustering and outlier detection with no neighborhood parameter","volume":"19","author":"Zhu","year":"2016","journal-title":"Clust. Comput."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1109\/TPAMI.2017.2666151","article-title":"Generative Local Metric Learning for Nearest Neighbor Classification","volume":"40","author":"Noh","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/j.neucom.2011.06.039","article-title":"Nearest-neighbor method using multiple neighborhood similarities for social media data mining","volume":"95","author":"Wang","year":"2012","journal-title":"Neurocomputing"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1142\/S0218195905001622","article-title":"Geometric proximity graphs for improving nearest neighbor methods in instance-based learning and data mining","volume":"15","author":"Toussaint","year":"2005","journal-title":"Int. J. Comput. Geom. Appl."},{"key":"ref_29","unstructured":"Ville, H., Ismo, K., and Pasi, F. (2004, January 26). Outlier detection using k-nearest neighbour graph. Proceedings of the International Conference on Pattern Recognition, Cambridge, UK."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/S0167-7152(96)00213-1","article-title":"Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection","volume":"35","author":"Brito","year":"1997","journal-title":"Stat. Probab. Lett."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.knosys.2017.01.013","article-title":"A novel outlier cluster detection algorithm without top-n parameter","volume":"121","author":"Huang","year":"2017","journal-title":"Knowl. Based Syst."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/11\/1\/26\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:47:05Z","timestamp":1760190425000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/11\/1\/26"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,31]]},"references-count":31,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2020,1]]}},"alternative-id":["info11010026"],"URL":"https:\/\/doi.org\/10.3390\/info11010026","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12,31]]}}}