{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T16:49:09Z","timestamp":1776358149049,"version":"3.51.2"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2018,1,22]],"date-time":"2018-01-22T00:00:00Z","timestamp":1516579200000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2018,10]]},"DOI":"10.1007\/s10115-017-1145-y","type":"journal-article","created":{"date-parts":[[2018,1,22]],"date-time":"2018-01-22T04:31:03Z","timestamp":1516595463000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":61,"title":["Distributed ReliefF-based feature selection in Spark"],"prefix":"10.1007","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2550-7516","authenticated-orcid":false,"given":"Raul-Jose","family":"Palma-Mendoza","sequence":"first","affiliation":[]},{"given":"Daniel","family":"Rodriguez","sequence":"additional","affiliation":[]},{"given":"Luis","family":"de-Marcos","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2018,1,22]]},"reference":[{"key":"1145_CR1","unstructured":"Apache Software Foundation: Hadoop. https:\/\/hadoop.apache.org"},{"issue":"19","key":"1145_CR2","doi-asserted-by":"publisher","first-page":"2441","DOI":"10.1093\/bioinformatics\/bts472","volume":"28","author":"J Bacardit","year":"2012","unstructured":"Bacardit J, Widera P, M\u00e1rquez-chamorro A, Divina F, Aguilar-Ruiz JS, Krasnogor N (2012) Contact map prediction using a large-scale ensemble of rule sets and the fusion of multiple predicted structural features. Bioinformatics 28(19):2441\u20132448. https:\/\/doi.org\/10.1093\/bioinformatics\/bts472","journal-title":"Bioinformatics"},{"key":"1145_CR3","doi-asserted-by":"publisher","first-page":"694","DOI":"10.1038\/ncomms5308","volume":"5","author":"P Baldi","year":"2014","unstructured":"Baldi P, Sadowski P, Whiteson D, Neyman J, Pearson E, Hornik K, Stinchcombe M, White H, Hochreiter S, Bengio Y, Simard P, Frasconi P, Baldi P, Sadowski P, Hinton GE, Osindero S, Teh YW, Aad G, Aaltonen T, Alwall J, Sjostrand T, Cheng HC, Han Z, Barr A, Lester C, Stephens P, Hocker A, Aaltonen T (2014) Searching for exotic particles in high-energy physics with deep learning. Nat Commun 5:694\u2013706. https:\/\/doi.org\/10.1038\/ncomms5308","journal-title":"Nat Commun"},{"issue":"3","key":"1145_CR4","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1007\/s10115-012-0487-8","volume":"34","author":"V Bol\u00f3n-Canedo","year":"2012","unstructured":"Bol\u00f3n-Canedo V, S\u00e1nchez-Maro\u00f1o N, Alonso-Betanzos A (2012) A review of feature selection methods on synthetic data. Knowl Inf Syst 34(3):483\u2013519. https:\/\/doi.org\/10.1007\/s10115-012-0487-8","journal-title":"Knowl Inf Syst"},{"key":"1145_CR5","doi-asserted-by":"publisher","first-page":"136","DOI":"10.1016\/j.asoc.2015.01.035","volume":"30","author":"V Bol\u00f3n-Canedo","year":"2015","unstructured":"Bol\u00f3n-Canedo V, S\u00e1nchez-Maro\u00f1o N, Alonso-Betanzos A (2015) Distributed feature selection: an application to microarray data classification. Appl Soft Comput 30:136\u2013150. https:\/\/doi.org\/10.1016\/j.asoc.2015.01.035","journal-title":"Appl Soft Comput"},{"issue":"1\u20132","key":"1145_CR6","doi-asserted-by":"publisher","first-page":"285","DOI":"10.14778\/1920841.1920881","volume":"3","author":"Y Bu","year":"2010","unstructured":"Bu Y, Howe B, Ernst MD (2010) HaLoop: efficient iterative data processing on large clusters. Proc VLDB Endow 3(1\u20132):285\u2013296. https:\/\/doi.org\/10.14778\/1920841.1920881","journal-title":"Proc VLDB Endow"},{"key":"1145_CR7","doi-asserted-by":"crossref","unstructured":"Dean J, Ghemawat S (2004) MapReduce: simplied data processing on large clusters. In: Proceedings of 6th symposium on operating systems design and implementation, pp 137\u2013149. https:\/\/doi.org\/10.1145\/1327452.1327492","DOI":"10.1145\/1327452.1327492"},{"key":"1145_CR8","doi-asserted-by":"crossref","unstructured":"Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107. http:\/\/dl.acm.org\/citation.cfm?id=1327452.1327492","DOI":"10.1145\/1327452.1327492"},{"key":"1145_CR9","doi-asserted-by":"crossref","unstructured":"Ekanayake J, Li H, Zhang B, Gunarathne T, Bae SH, Qiu J, Fox G (2010) Twister: a runtime for iterative MapReduce. In: Proceedings of the 19th ACM international symposium on high performance distributed computing, HPDC \u201910, pp 810\u2013818. ACM, New York. https:\/\/doi.org\/10.1145\/1851476.1851593","DOI":"10.1145\/1851476.1851593"},{"key":"1145_CR10","doi-asserted-by":"crossref","unstructured":"Garc\u00eda S, Luengo J, Herrera F (2015) Feature selection. In: Data preprocessing in data mining, pp 163\u2013193. Springer International Publishing, Cham. https:\/\/doi.org\/10.1007\/978-3-319-10247-4_7","DOI":"10.1007\/978-3-319-10247-4_7"},{"issue":"1","key":"1145_CR11","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/1756-0381-2-5","volume":"2","author":"CS Greene","year":"2009","unstructured":"Greene CS, Penrod NM, Kiralis J, Moore JH (2009) Spatially uniform ReliefF (SURF) for computationally-efficient filtering of gene\u2013gene interactions. BioData Min 2(1):5. https:\/\/doi.org\/10.1186\/1756-0381-2-5","journal-title":"BioData Min"},{"issue":"1","key":"1145_CR12","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1145\/1656274.1656278","volume":"11","author":"M Hall","year":"2009","unstructured":"Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software. ACM SIGKDD Explor Newsl 11(1):10. https:\/\/doi.org\/10.1145\/1656274.1656278","journal-title":"ACM SIGKDD Explor Newsl"},{"issue":"5","key":"1145_CR13","doi-asserted-by":"publisher","first-page":"718","DOI":"10.1109\/69.634751","volume":"9","author":"SJ Hong","year":"1997","unstructured":"Hong SJ (1997) Use of contextual information for feature ranking and discretization. IEEE Trans Knowl Data Eng 9(5):718\u2013730. https:\/\/doi.org\/10.1109\/69.634751","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"11","key":"1145_CR14","doi-asserted-by":"publisher","first-page":"1348","DOI":"10.1016\/j.datak.2009.07.011","volume":"68","author":"Y Huang","year":"2009","unstructured":"Huang Y, McCullagh PJ, Black ND (2009) An optimization of ReliefF for classification in large datasets. Data Knowl Eng 68(11):1348\u20131356. https:\/\/doi.org\/10.1016\/j.datak.2009.07.011","journal-title":"Data Knowl Eng"},{"issue":"1","key":"1145_CR15","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1007\/s10115-006-0040-8","volume":"12","author":"A Kalousis","year":"2006","unstructured":"Kalousis A, Prados J, Hilario M (2006) Stability of feature selection algorithms: a study on high-dimensional spaces. Knowl Inf Syst 12(1):95\u2013116. https:\/\/doi.org\/10.1007\/s10115-006-0040-8","journal-title":"Knowl Inf Syst"},{"key":"1145_CR16","doi-asserted-by":"crossref","unstructured":"Kira K, Rendell LA (1992) A practical approach to feature selection. In: Proceedings of the ninth international workshop on machine learning, pp 249\u2013256","DOI":"10.1016\/B978-1-55860-247-2.50037-1"},{"key":"1145_CR17","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1007\/3-540-57868-4","volume":"784","author":"I Kononenko","year":"1994","unstructured":"Kononenko I (1994) Estimating attributes: analysis and extensions of RELIEF. Mach Learn ECML-94 784:171\u2013182. https:\/\/doi.org\/10.1007\/3-540-57868-4","journal-title":"Mach Learn ECML-94"},{"key":"1145_CR18","doi-asserted-by":"crossref","unstructured":"Kubica J, Singh S, Sorokina D (2011) Parallel large-scale feature selection. In: Scaling up machine learning, pp 352\u2013370. https:\/\/doi.org\/10.1017\/CBO9781139042918.018","DOI":"10.1017\/CBO9781139042918.018"},{"key":"1145_CR19","unstructured":"Kuncheva LI (2007) A stability index for feature selection. In: International multi-conference: artificial intelligence and applications, pp 390\u2013395."},{"key":"1145_CR20","doi-asserted-by":"crossref","unstructured":"Leskovec J, Rajaraman A, Ullman JD (2014) Mining massive datasets, 2nd edn. Cambridge University Press, Cambridge (2014). http:\/\/infolab.stanford.edu\/~ullman\/mmds\/book.pdf","DOI":"10.1017\/CBO9781139924801"},{"key":"1145_CR21","unstructured":"Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, Liu H (2016) Feature selection: a data perspective. arXiv:1601.07996"},{"key":"1145_CR22","unstructured":"Lichman M (2013) UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml"},{"key":"1145_CR23","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-016-0401-1","author":"Y Liu","year":"2016","unstructured":"Liu Y, Xu L, Li M (2016) The parallelization of back propagation neural network in mapreduce and spark. Int J Parallel Program. https:\/\/doi.org\/10.1007\/s10766-016-0401-1","journal-title":"Int J Parallel Program"},{"key":"1145_CR24","doi-asserted-by":"crossref","unstructured":"Ma J, Saul LK, Savage S, Voelker GM (2009) Identifying suspicious URLs: an application of large-scale online learning. In: Proceedings of the international conference on machine learning (ICML). Montreal, Quebec","DOI":"10.1145\/1553374.1553462"},{"key":"1145_CR25","unstructured":"Meng X, Bradley J, Yavuz B, Sparks E, Venkataraman S, Liu D, Freeman J, Tsai D, Amde M, Owen S, Xin D, Xin R, Franklin MJ, Zadeh R, Zaharia M, Talwalkar A (2015) MLlib: machine learning in apache spark. J Mach Learn 17:1\u20137. http:\/\/www.jmlr.org\/papers\/volume17\/15-237\/15-237.pdf"},{"key":"1145_CR26","doi-asserted-by":"crossref","unstructured":"Peng H, Long F, Ding C (2005) Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226\u201338. https:\/\/doi.org\/10.1109\/TPAMI.2005.159. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/16119262","DOI":"10.1109\/TPAMI.2005.159"},{"key":"1145_CR27","doi-asserted-by":"crossref","unstructured":"Peralta D, del R\u00edo S, Ram\u00edrez-Gallego S, Riguero I, Benitez JM, Herrera F (2015) Evolutionary feature selection for big data classification: a mapreduce approach evolutinary feature selection for big data classification: a mapreduce approach. Math Probl Eng. https:\/\/doi.org\/10.1155\/2015\/246139. http:\/\/sci2s.ugr.es\/sites\/default\/files\/2015-hindawi-peralta.pdf","DOI":"10.1155\/2015\/246139"},{"key":"1145_CR28","doi-asserted-by":"publisher","DOI":"10.1002\/int.21833","author":"S Ram\u00edrez-Gallego","year":"2016","unstructured":"Ram\u00edrez-Gallego S, Lastra I, Mart\u00ednez-Rego D, Bol\u00f3n-Canedo V, Ben\u00edtez JM, Herrera F, Alonso-Betanzos A (2016) Fast-mRMR: fast minimum redundancy maximum relevance algorithm for high-dimensional big data. Int J Intell Syst. https:\/\/doi.org\/10.1002\/int.21833","journal-title":"Int J Intell Syst"},{"key":"1145_CR29","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1016\/j.neucom.2015.02.045","volume":"161","author":"O Reyes","year":"2015","unstructured":"Reyes O, Morell C, Ventura S (2015) Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context. Neurocomputing 161:168\u2013182. https:\/\/doi.org\/10.1016\/j.neucom.2015.02.045","journal-title":"Neurocomputing"},{"issue":"1\u20132","key":"1145_CR30","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1023\/A:1025667309714","volume":"53","author":"M Robnik-\u0160ikonja","year":"2003","unstructured":"Robnik-\u0160ikonja M, Kononenko I (2003) Theoretical and empirical analysis of ReliefF and RReliefF. Mach Learn 53(1\u20132):23\u201369","journal-title":"Mach Learn"},{"issue":"13","key":"1145_CR31","doi-asserted-by":"publisher","first-page":"2110","DOI":"10.14778\/2831360.2831365","volume":"8","author":"J Shi","year":"2015","unstructured":"Shi J, Qiu Y, Minhas UF, Jiao L, Wang C, Reinwald B, \u00d6zcan F (2015) Clash of the titans: mapreduce vs. spark for large scale data analytics. Proc VLDB Endow 8(13):2110\u20132121. https:\/\/doi.org\/10.14778\/2831360.2831365","journal-title":"Proc VLDB Endow"},{"key":"1145_CR32","doi-asserted-by":"crossref","unstructured":"Wang Y, Ke W, Tao X (2016) A feature selection method for large-scale network traffic classification based on spark. Information 7(1):6. https:\/\/doi.org\/10.3390\/info7010006. http:\/\/www.mdpi.com\/2078-2489\/7\/1\/6","DOI":"10.3390\/info7010006"},{"key":"1145_CR33","doi-asserted-by":"crossref","unstructured":"Xindong Wu X, Xingquan Zhu X, Gong-Qing Wu GQ, Wei Ding W (2014) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97\u2013107. https:\/\/doi.org\/10.1109\/TKDE.2013.109. http:\/\/ieeexplore.ieee.org\/lpdocs\/epic03\/wrapper.htm?arnumber=6547630","DOI":"10.1109\/TKDE.2013.109"},{"issue":"1","key":"1145_CR34","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.neucom.2011.03.052","volume":"75","author":"A Zafra","year":"2012","unstructured":"Zafra A, Pechenizkiy M, Ventura S (2012) ReliefF-MI: an extension of ReliefF to multiple instance learning. Neurocomputing 75(1):210\u2013218. https:\/\/doi.org\/10.1016\/j.neucom.2011.03.052","journal-title":"Neurocomputing"},{"key":"1145_CR35","unstructured":"Zaharia M, Chowdhury M, Das T, Dave A (2012) Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: NSDI\u201912 proceedings of the 9th USENIX conference on networked systems design and implementation, pp 2\u20132. https:\/\/doi.org\/10.1111\/j.1095-8649.2005.00662.x. https:\/\/www.usenix.org\/system\/files\/conference\/nsdi12\/nsdi12-final138.pdf"},{"key":"1145_CR36","doi-asserted-by":"crossref","unstructured":"Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I (2010) Spark: cluster computing with working sets. In: HotCloud\u201910 proceedings of the 2nd USENIX conference on hot topics in cloud computing, p\u00a010. https:\/\/doi.org\/10.1007\/s00256-009-0861-0","DOI":"10.1007\/s00256-009-0861-0"},{"issue":"Suppl 2","key":"1145_CR37","doi-asserted-by":"publisher","first-page":"S27","DOI":"10.1186\/1471-2164-9-S2-S27","volume":"9","author":"Y Zhang","year":"2008","unstructured":"Zhang Y, Ding C, Li T (2008) Gene selection algorithm by combining reliefF and mRMR. BMC Genomics 9(Suppl 2):S27. https:\/\/doi.org\/10.1186\/1471-2164-9-S2-S27","journal-title":"BMC Genomics"},{"issue":"PART 1","key":"1145_CR38","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1007\/978-3-642-33460-3_21","volume":"7523 LNAI","author":"Z Zhao","year":"2012","unstructured":"Zhao Z, Cox J, Duling D, Sarle W (2012) Massively parallel feature selection: an approach based on variance preservation. Lect. Notes Comput Sci 7523 LNAI(PART 1):237\u2013252. https:\/\/doi.org\/10.1007\/978-3-642-33460-3_21 (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)","journal-title":"Lect. Notes Comput Sci"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10115-017-1145-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-017-1145-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-017-1145-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,10,9]],"date-time":"2019-10-09T12:59:56Z","timestamp":1570625996000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10115-017-1145-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,1,22]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,10]]}},"alternative-id":["1145"],"URL":"https:\/\/doi.org\/10.1007\/s10115-017-1145-y","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"value":"0219-1377","type":"print"},{"value":"0219-3116","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,1,22]]}}}