{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,5]],"date-time":"2025-12-05T23:41:02Z","timestamp":1764978062731,"version":"3.46.0"},"reference-count":28,"publisher":"Walter de Gruyter GmbH","issue":"1","license":[{"start":{"date-parts":[[2019,7,17]],"date-time":"2019-07-17T00:00:00Z","timestamp":1563321600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,12,18]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The recent advancements in information technology and the web tend to increase the volume of data used in day-to-day life. The result is a big data era, which has become a key issue in research due to the complexity in the analysis of big data. This paper presents a technique called FPWhale-MRF for big data clustering using the MapReduce framework (MRF), by proposing two clustering algorithms. In FPWhale-MRF, the mapper function estimates the cluster centroids using the Fractional Tangential-Spherical Kernel clustering algorithm, which is developed by integrating the fractional theory into a Tangential-Spherical Kernel clustering approach. The reducer combines the mapper outputs to find the optimal centroids using the proposed Particle-Whale (P-Whale) algorithm, for the clustering. The P-Whale algorithm is proposed by combining Whale Optimization Algorithm with Particle Swarm Optimization, for effective clustering such that its performance is improved. Two datasets, namely localization and skin segmentation datasets, are used for the experimentation and the performance is evaluated regarding two performance evaluation metrics: clustering accuracy and DB-index. The maximum accuracy attained by the proposed FPWhale-MRF technique is 87.91% and 90% for the localization and skin segmentation datasets, respectively, thus proving its effectiveness in big data clustering.<\/jats:p>","DOI":"10.1515\/jisys-2018-0117","type":"journal-article","created":{"date-parts":[[2019,7,17]],"date-time":"2019-07-17T05:02:01Z","timestamp":1563339721000},"page":"1496-1513","source":"Crossref","is-referenced-by-count":8,"title":["Fractional Fuzzy Clustering and Particle Whale Optimization-Based MapReduce Framework for Big Data Clustering"],"prefix":"10.1515","volume":"29","author":[{"given":"Omkaresh","family":"Kulkarni","sequence":"first","affiliation":[{"name":"Research Scholar, Gandhi Institute of Technology and Management, GITAM University , Rudraram Mandal, Sangareddy district, Patancheru , Hyderabad, Telangana 502329 , India"}]},{"given":"Sudarson","family":"Jena","sequence":"additional","affiliation":[{"name":"Associate Professor, Department of Computer Science Engineering and Application , Sambalpur University Institute of Information Technology , Sambalpur, Orissa , India"}]},{"given":"C. H.","family":"Sanjay","sequence":"additional","affiliation":[{"name":"Distinguished Professor and Dean , GITAM University , Hyderabad , India"}]}],"member":"374","published-online":{"date-parts":[[2019,7,17]]},"reference":[{"key":"2025120523362758183_j_jisys-2018-0117_ref_001","doi-asserted-by":"crossref","unstructured":"N. Akthar, M. V. Ahamad and S. Khan, Clustering on big data using Hadoop MapReduce, in: 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, pp. 789\u2013795, IEEE, Piscataway, NJ, USA, 2015.","DOI":"10.1109\/CICN.2015.161"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_002","doi-asserted-by":"crossref","unstructured":"A. Banharnsakun, A MapReduce-based artificial bee colony for large-scale data clustering, Pattern Recognit. Lett. 93 (2017), 78\u201384.","DOI":"10.1016\/j.patrec.2016.07.027"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_003","doi-asserted-by":"crossref","unstructured":"P. R. Bhaladhare and D. C. Jinwala, A clustering approach for the l-diversity model in privacy preserving data mining using fractional calculus-bacterial foraging optimization algorithm, Adv. Comput. Eng. 2014, (2014), 1\u201312.","DOI":"10.1155\/2014\/396529"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_004","doi-asserted-by":"crossref","unstructured":"N. Bharill, A. Tiwari and A. Malviya, Fuzzy based scalable clustering algorithms for handling big data using Apache Spark, IEEE Trans. Big Data 2 (2016), 339\u2013352.","DOI":"10.1109\/TBDATA.2016.2622288"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_005","doi-asserted-by":"crossref","unstructured":"C. Chen, K. Li, A. Ouyang and K. Li, A parallel approximate SS-ELM algorithm based on MapReduce for large-scale datasets, J. Parallel Distrib. Comput. 108 (2017), 85\u201394.","DOI":"10.1016\/j.jpdc.2017.01.007"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_006","doi-asserted-by":"crossref","unstructured":"D. L. Davies and D. W. Bouldin, A cluster separation measure, IEEE Trans. Pattern Anal. Mach. Intell. 1 (1979), 224\u2013227.","DOI":"10.1109\/TPAMI.1979.4766909"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_007","unstructured":"S. Fries, S. Wels and T. Seidl, Projected clustering for huge data sets in MapReduce, in: Proceedings of International Conference on Extending Database Technology (EDBT), Athens, Greece, pp. 49\u201360, OpenProceedings.org, Konstanz, Germany, 2014."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_008","doi-asserted-by":"crossref","unstructured":"K. Grolinger, M. Hayes, W. A. Higashino, A. L\u2019Heureux, D. S. Allison and M. A. M. Capretz, Challenges for MapReduce in big data, in: 2014 IEEE World Congress on Services, Anchorage, AK, pp. 182\u2013189, IEEE, Piscataway, NJ, USA, 2014.","DOI":"10.1109\/SERVICES.2014.41"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_009","unstructured":"N. Hans, S. Mahajan and S. N. Omkar, Big data clustering using genetic algorithm on Hadoop Mapreduce, Int. J. Sci. Technol. Res. 4 (2015), 58\u201362."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_010","doi-asserted-by":"crossref","unstructured":"S. Kamal, S. H. Ripon, N. Dey, A. S. Ashour and V. Santhi, A MapReduce approach to diminish imbalance parameters for big deoxyribonucleic acid dataset, Comput. Methods Progr. Biomed. 131 (2016), 191\u2013206.","DOI":"10.1016\/j.cmpb.2016.04.005"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_011","doi-asserted-by":"crossref","unstructured":"H. Ke, P. Li, S. Guo and M. Guo, On traffic-aware partition and aggregation in MapReduce for big data applications, IEEE Trans. Parallel Distrib. Syst. 27 (2016), 818\u2013828.","DOI":"10.1109\/TPDS.2015.2419671"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_012","doi-asserted-by":"crossref","unstructured":"O. Kulkarni and S. Jena, MKS-MRF: a multiple kernel and a swarm-based MapReduce framework for big data clustering, Int. Rev. Comput. Softw. 11 (2016).","DOI":"10.15866\/irecos.v11i11.10117"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_013","doi-asserted-by":"crossref","unstructured":"D. Kumar, J. C. Bezdek, M. Palaniswami, S. Rajasegarar, C. Leckie and T. C. Havens, A hybrid approach to clustering in big data, in: IEEE Trans. Cybernet. 46 (2016), 2372\u20132385.","DOI":"10.1109\/TCYB.2015.2477416"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_014","doi-asserted-by":"crossref","unstructured":"O. Kurasova, V. Marcinkevicius, V. Medvedev, A. Rapecka and P. Stefanovic, Strategies for big data clustering, in: Proceedings of IEEE 26th International Conference on Tools with Artificial Intelligence, Limassol, pp. 740\u2013747, IEEE, Piscataway, NJ, USA, 2014.","DOI":"10.1109\/ICTAI.2014.115"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_015","doi-asserted-by":"crossref","unstructured":"S. Mirjalili and A. Lewis, The whale optimization algorithm, Adv. Eng. Softw. 95 (2016), 51\u201367.","DOI":"10.1016\/j.advengsoft.2016.01.008"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_016","doi-asserted-by":"crossref","unstructured":"F. Pulgar-Rubio, A. J. Rivera-Rivas, M. D. P\u00e9rez-Godoy, P. Gonz\u00e1lez, C. J. Carmona and M. J. del Jesus, MEFASD-BD: multi-objective evolutionary fuzzy algorithm for subgroup discovery in big data environments \u2013 a MapReduce solution, Knowl.-Based Syst. 117 (2017), 70\u201378.","DOI":"10.1016\/j.knosys.2016.08.021"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_017","doi-asserted-by":"crossref","unstructured":"H. Rehioui, A. Idrissi, M. Abourezq and F. Zegrari, DENCLUE-IM: a new approach for big data clustering, Proc. Comput. Sci. 83 (2016), 560\u2013567.","DOI":"10.1016\/j.procs.2016.04.265"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_018","doi-asserted-by":"crossref","unstructured":"A. S. Shirkhorshidi, S. Aghabozorgi, T. Y. Wah and T. Herawan, Big data clustering: a review, in: Proceedings of International Conference on Computational Science and Its Applications, ICCSA 2014, pp. 707\u2013720, 2014.","DOI":"10.1007\/978-3-319-09156-3_49"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_019","doi-asserted-by":"crossref","unstructured":"H. Singh and S. Bawa, A MapReduce-based scalable discovery and indexing of structured big data, Future Gen. Comput. Syst. 73 (2017), 32\u201343.","DOI":"10.1016\/j.future.2017.03.028"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_020","doi-asserted-by":"crossref","unstructured":"P. A. Traganitis, K. Slavakis and G. B. Giannakis, Sketch and validate for big data clustering,IEEE J. Select. Topics Signal Process. 9 (2015), 678\u2013690.","DOI":"10.1109\/JSTSP.2015.2396477"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_021","unstructured":"UCI Machine Learning Repository, Localization data for person activity data set, Available at: https:\/\/archive.ics.uci.edu\/ml\/datasets\/Localization+Data+for+Person+Activity, Accessed 24 March, 2018."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_022","unstructured":"UCI Machine Learning Repository, Skin segmentation data set, Available at: https:\/\/archive.ics.uci.edu\/ml\/datasets\/skin+segmentation, Accessed 24 March, 2018."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_023","unstructured":"M. Vadivel and V. Raghunath, Enhancing Map-Reduce framework for big data with hierarchical clustering, Innov. Res. Comput. Commun. Eng. 2 (2014), 490\u2013498."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_024","doi-asserted-by":"crossref","unstructured":"D. Xia, B. Wang, Y. Li, Z. Rong and Z. Zhang, An efficient MapReduce-based parallel clustering algorithm for distributed traffic subarea division, Discrete Dynam. Nat. Soc. 2015 (2015) Article ID 793010, 18 pp.","DOI":"10.1155\/2015\/793010"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_025","doi-asserted-by":"crossref","unstructured":"Q. Yu and Z. Ding, An improved fuzzy C-means algorithm based on MapReduce, in: Proceedings of 8th International Conference on Biomedical Engineering and Informatics (BMEI), Shenyang, China; IEEE, Piscataway, NJ, USA, 2015.","DOI":"10.1109\/BMEI.2015.7401581"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_026","doi-asserted-by":"crossref","unstructured":"M. Yuwono, S. W. Su, B. D. Moulton and H. T. Nguyen, Data clustering using variants of rapid centroid estimation, IEEE Trans. Evol. Comput. 18 (2014), 366\u2013377.","DOI":"10.1109\/TEVC.2013.2281545"},{"key":"2025120523362758183_j_jisys-2018-0117_ref_027","unstructured":"B. Zerhari, A. A. Lahcen and S. Mouline, Big data clustering: algorithms and challenges, in: Proceedings of International Conference on Big Data, Cloud and Applications BDCA\u201915, May 2015."},{"key":"2025120523362758183_j_jisys-2018-0117_ref_028","doi-asserted-by":"crossref","unstructured":"H. Zhu, Y. Guo, M. Niu, G. Yang and L. Jiao, Distributed SAR image change detection based on Spark, in: Proceedings of IEEE International Conference on Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy; IEEE, Piscataway, NJ, USA, 2015.","DOI":"10.1109\/IGARSS.2015.7326739"}],"container-title":["Journal of Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.degruyter.com\/view\/journals\/jisys\/29\/1\/article-p1496.xml","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.degruyterbrill.com\/document\/doi\/10.1515\/jisys-2018-0117\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.degruyterbrill.com\/document\/doi\/10.1515\/jisys-2018-0117\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,5]],"date-time":"2025-12-05T23:37:54Z","timestamp":1764977874000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.degruyterbrill.com\/document\/doi\/10.1515\/jisys-2018-0117\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,7,17]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2018,4,25]]},"published-print":{"date-parts":[[2019,12,18]]}},"alternative-id":["10.1515\/jisys-2018-0117"],"URL":"https:\/\/doi.org\/10.1515\/jisys-2018-0117","relation":{},"ISSN":["2191-026X","0334-1860"],"issn-type":[{"type":"electronic","value":"2191-026X"},{"type":"print","value":"0334-1860"}],"subject":[],"published":{"date-parts":[[2019,7,17]]}}}