{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T15:23:24Z","timestamp":1770564204797,"version":"3.49.0"},"reference-count":33,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2020,3,16]],"date-time":"2020-03-16T00:00:00Z","timestamp":1584316800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Government of the Russian Federation","award":["08-08"],"award-info":[{"award-number":["08-08"]}]},{"name":"ITMO Research and Development grant","award":["619421"],"award-info":[{"award-number":["619421"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>With the current trend of rapidly growing popularity of the Python programming language for machine learning applications, the gap between machine learning engineer needs and existing Python tools increases. Especially, it is noticeable for more classical machine learning fields, namely, feature selection, as the community attention in the last decade has mainly shifted to neural networks. This paper has two main purposes. First, we perform an overview of existing open-source Python and Python-compatible feature selection libraries, show their problems, if any, and demonstrate the gap between these libraries and the modern state of feature selection field. Then, we present new open-source scikit-learn compatible ITMO FS (Information Technologies, Mechanics and Optics University feature selection) library that is currently under development, explain how its architecture covers modern views on feature selection, and provide some code examples on how to use it with Python and its performance compared with other Python feature selection libraries.<\/jats:p>","DOI":"10.3390\/fi12030054","type":"journal-article","created":{"date-parts":[[2020,3,17]],"date-time":"2020-03-17T09:27:41Z","timestamp":1584437261000},"page":"54","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":38,"title":["Feature Selection Algorithms as One of the Python Data Analytical Tools"],"prefix":"10.3390","volume":"12","author":[{"given":"Nikita","family":"Pilnenskiy","sequence":"first","affiliation":[{"name":"Information Technologies and Programming Faculty, ITMO University, 197101 St. Petersburg, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3376-9468","authenticated-orcid":false,"given":"Ivan","family":"Smetannikov","sequence":"additional","affiliation":[{"name":"Information Technologies and Programming Faculty, ITMO University, 197101 St. Petersburg, Russia"}]}],"member":"1968","published-online":{"date-parts":[[2020,3,16]]},"reference":[{"key":"ref_1","unstructured":"Hastie, T., Tibshirani, R., and Friedman, J. (2005). The Elements of Statistical Learning: Data Mining, Inference and Prediction, Springer."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2507","DOI":"10.1093\/bioinformatics\/btm344","article-title":"A review of feature selection techniques in bioinformatics","volume":"23","author":"Saeys","year":"2007","journal-title":"Bioinformatics"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.ymeth.2016.08.014","article-title":"Feature selection methods for big data bioinformatics: A survey from the search perspective","volume":"111","author":"Wang","year":"2016","journal-title":"Methods"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Li, J., Hu, X., Tang, J., and Liu, H. (2015, January 19\u201323). Unsupervised streaming feature selection in social media. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, Melbourne, Australia.","DOI":"10.1145\/2806416.2806501"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Hira, Z.M., and Gillies, D.F. (2015). A review of feature selection and feature extraction methods applied on microarray data. Adv. Bioinform., 2015.","DOI":"10.1155\/2015\/198363"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"211","DOI":"10.6029\/smartcr.2014.03.007","article-title":"Feature selection: A literature review","volume":"4","author":"Kumar","year":"2014","journal-title":"SmartCR"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1007\/s10115-012-0487-8","article-title":"A review of feature selection methods on synthetic data","volume":"34","year":"2013","journal-title":"Knowl. Inf. Syst."},{"key":"ref_8","first-page":"1157","article-title":"An introduction to variable and feature selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_9","unstructured":"Raschka, S. (2017). Python Machine Learning, Packt Publishing Ltd."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ketkar, N. (2017). Deep Learning with Python, Springer.","DOI":"10.1007\/978-1-4842-2766-4"},{"key":"ref_11","unstructured":"(2020, March 13). ITMO University Feature Selection Library. Available online: https:\/\/github.com\/ctlab\/ITMO_FS."},{"key":"ref_12","unstructured":"(2020, March 13). Arizona State University Feature Selection Library. Available online: https:\/\/github.com\/jundongl\/scikit-feature."},{"key":"ref_13","unstructured":"(2020, March 13). Scikit-Learn Feature Selection Module. Available online: https:\/\/scikit-learn.org\/stable\/modules\/feature_selection.html."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2982","DOI":"10.1166\/asl.2016.7078","article-title":"MeLiF: filter ensemble learning algorithm for gene selection","volume":"22","author":"Smetannikov","year":"2016","journal-title":"Adv. Sci. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/j.ins.2014.05.042","article-title":"A review of microarray datasets and applied feature selection methods","volume":"282","author":"Herrera","year":"2014","journal-title":"Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Saeys, Y., Abeel, T., and Van de Peer, Y. (2008, January 15\u201319). Robust feature selection using ensemble feature selection techniques. Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Antwerp, Belgium.","DOI":"10.1007\/978-3-540-87481-2_21"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1016\/j.patcog.2011.06.006","article-title":"An ensemble of filters and classifiers for microarray data classification","volume":"45","year":"2012","journal-title":"Pattern Recognit."},{"key":"ref_18","first-page":"1","article-title":"Feature selection: A data perspective","volume":"50","author":"Li","year":"2017","journal-title":"ACM Comput. Surv."},{"key":"ref_19","unstructured":"(2020, March 13). Scikit-Feature Is an Open-Source Feature Selection Repository in Python Developed at Arizona State University. Available online: http:\/\/featureselection.asu.edu\/index.php."},{"key":"ref_20","unstructured":"(2020, March 13). Python Implementations of the Boruta All Relevant Feature Selection Method. Available online: https:\/\/github.com\/scikit-learn-contrib\/boruta_py."},{"key":"ref_21","unstructured":"(2020, March 13). General Features Selection Based on Certain Machine Learning Algorithm and Evaluation Methods. Available online: https:\/\/github.com\/duxuhao\/Feature-Selection."},{"key":"ref_22","unstructured":"(2020, March 13). General Feature Selection Based on Certain Machine Learning Algorithm and Evaluation Methods. Available online: https:\/\/pypi.org\/project\/MLFeatureSelection\/."},{"key":"ref_23","unstructured":"(2020, March 13). Code and Data Sets for Feature Engineering and Selection by Max Kuhn and Kjell Johnson (2019). Available online: https:\/\/github.com\/topepo\/FES."},{"key":"ref_24","unstructured":"(2020, March 13). Scikit-Learn-Compatible Python Implementation of ReBATE. Available online: https:\/\/github.com\/EpistasisLab\/scikit-rebate."},{"key":"ref_25","unstructured":"(2020, March 13). Python Machine Learning Library. Available online: https:\/\/github.com\/rasbt\/mlxtend."},{"key":"ref_26","unstructured":"(2020, March 13). Java Machine Learning Library. Available online: https:\/\/www.cs.waikato.ac.nz\/ml\/weka\/."},{"key":"ref_27","unstructured":"(2020, March 13). R Machine Learning Library. Available online: https:\/\/www.rdocumentation.org\/packages\/caret\/versions\/6.0-85."},{"key":"ref_28","unstructured":"(2020, March 13). R Machine Learning Library. Available online: https:\/\/mlr3.mlr-org.com\/."},{"key":"ref_29","unstructured":"(2020, March 13). Matlab Feature Selection. Available online: https:\/\/www.mathworks.com\/help\/stats\/feature-selection.html."},{"key":"ref_30","unstructured":"(2020, March 13). Iris Data Set. Available online: https:\/\/archive.ics.uci.edu\/ml\/datasets\/iris."},{"key":"ref_31","unstructured":"(2020, March 13). The Database of Faces. Available online: http:\/\/featureselection.asu.edu\/files\/datasets\/orlraws10P.mat."},{"key":"ref_32","unstructured":"(2020, March 13). The 20 Newsgroups Data Set. Available online: http:\/\/qwone.com\/~jason\/20Newsgroups\/."},{"key":"ref_33","unstructured":"(2020, March 13). Columbia University Image Library. Available online: http:\/\/www.cad.zju.edu.cn\/home\/dengcai\/Data\/MLData.html."}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/12\/3\/54\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T09:07:22Z","timestamp":1760173642000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/12\/3\/54"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,16]]},"references-count":33,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2020,3]]}},"alternative-id":["fi12030054"],"URL":"https:\/\/doi.org\/10.3390\/fi12030054","relation":{},"ISSN":["1999-5903"],"issn-type":[{"value":"1999-5903","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,16]]}}}