{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T14:56:11Z","timestamp":1753887371016,"version":"3.41.2"},"reference-count":22,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2021,7,30]],"date-time":"2021-07-30T00:00:00Z","timestamp":1627603200000},"content-version":"vor","delay-in-days":210,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Computational Intelligence and Neuroscience"],"published-print":{"date-parts":[[2021,1]]},"abstract":"<jats:p>Sensors, satellites, mobile devices, social media, e\u2010commerce, and the Internet, among others, saturate us with data. The Internet of Things, in particular, enables massive amounts of data to be generated more quickly. The Internet of Things is a term that describes the process of connecting computers, smart devices, and other data\u2010generating equipment to a network and transmitting data. As a result, data is produced and updated on a regular basis to reflect changes in all areas and activities. As a consequence of this exponential growth of data, a new term and idea known as big data have been coined. Big data is required to illuminate the relationships between things, forecast future trends, and provide more information to decision\u2010makers. The major problem at present, however, is how to effectively collect and evaluate massive amounts of diverse and complicated data. In some sectors or applications, machine learning models are the most frequently utilized methods for interpreting and analyzing data and obtaining important information. On their own, traditional machine learning methods are unable to successfully handle large data problems. This article gives an introduction to Spark architecture as a platform that machine learning methods may utilize to address issues regarding the design and execution of large data systems. This article focuses on three machine learning types, including regression, classification, and clustering, and how they can be applied on top of the Spark platform.<\/jats:p>","DOI":"10.1155\/2021\/1896953","type":"journal-article","created":{"date-parts":[[2021,7,30]],"date-time":"2021-07-30T17:35:50Z","timestamp":1627666550000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Usages of Spark Framework with Different Machine Learning Algorithms"],"prefix":"10.1155","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5545-9757","authenticated-orcid":false,"given":"Mohamed","family":"Ali Mohamed","sequence":"first","affiliation":[]},{"given":"Ibrahim Mahmoud","family":"El-henawy","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3433-7640","authenticated-orcid":false,"given":"Ahmad","family":"Salah","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2021,7,30]]},"reference":[{"key":"e_1_2_8_1_2","unstructured":"https:\/\/www.statista.com\/statistics\/871513\/worldwide-data-created."},{"key":"e_1_2_8_2_2","unstructured":"https:\/\/www.idc.com\/getdoc.jsp?containerId=prUS47560321."},{"key":"e_1_2_8_3_2","doi-asserted-by":"crossref","unstructured":"SagirogluS.andSinancD. Big data: a review Proceedings of the 2013 International Conference on Collaboration Technologies and Systems (CTS) May 2013 San Diego CA USA IEEE.","DOI":"10.1109\/CTS.2013.6567202"},{"key":"e_1_2_8_4_2","first-page":"S117","article-title":"Deep transfer learning with Apache Spark to detect COVID-19 in chest x-ray images","volume":"23","author":"Benbrahim H.","year":"2020","journal-title":"Romanian Journal of Information Science and Technology"},{"key":"e_1_2_8_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-019-02998-y"},{"key":"e_1_2_8_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-18305-3_1"},{"key":"e_1_2_8_7_2","doi-asserted-by":"publisher","DOI":"10.4236\/jilsa.2017.91001"},{"key":"e_1_2_8_8_2","doi-asserted-by":"publisher","DOI":"10.20894\/ijdmta.102.004.001.011"},{"key":"e_1_2_8_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2015.05.005"},{"key":"e_1_2_8_10_2","doi-asserted-by":"crossref","unstructured":"ShanthamalluU. S. SpaniasA. TepedelenliogluC. andStanleyM. A brief survey of machine learning methods and their sensor and IoT applications Proceedings of the 2017 8th International Conference on Information Intelligence Systems & Applications (IISA) August 2017 Larnaca Cyprus IEEE.","DOI":"10.1109\/IISA.2017.8316459"},{"key":"e_1_2_8_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-32-9949-8_16"},{"key":"e_1_2_8_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cageo.2017.10.011"},{"key":"e_1_2_8_13_2","article-title":"Performance evaluation of linear regression algorithm in cluster environment","volume":"18","author":"Paramita C.","year":"2020","journal-title":"International Journal of Computer Science and Information Security (IJCSIS)"},{"key":"e_1_2_8_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2020.10.010"},{"key":"e_1_2_8_15_2","doi-asserted-by":"publisher","DOI":"10.3233\/ica-180580"},{"key":"e_1_2_8_16_2","doi-asserted-by":"publisher","DOI":"10.7287\/peerj-cs.345v0.1\/reviews\/1"},{"key":"e_1_2_8_17_2","doi-asserted-by":"crossref","unstructured":"Siva reddyS. V.andSaravananS. Performance evaluation of classification algorithms in the design of Apache Spark based intrusion detection system Proceedings of the 2020 5th International Conference on Communication and Electronics Systems (ICCES) June 2020 Coimbatore India IEEE 443\u2013447 https:\/\/doi.org\/10.1109\/ICCES48766.2020.9138066.","DOI":"10.1109\/ICCES48766.2020.9138066"},{"key":"e_1_2_8_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-1420-3_79"},{"key":"e_1_2_8_19_2","doi-asserted-by":"crossref","unstructured":"AswadS. A.andSonucE. Classification of VPN network traffic flow using time related features on Apache Spark Proceedings of the 2020 4th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT) October 2020 Istanbul Turkey IEEE 1\u20138 https:\/\/doi.org\/10.1109\/ISMSIT50672.2020.9254893.","DOI":"10.1109\/ISMSIT50672.2020.9254893"},{"key":"e_1_2_8_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11227-017-2040-8"},{"key":"e_1_2_8_21_2","doi-asserted-by":"crossref","unstructured":"BharillN. TiwariA. andMalviyaA. Fuzzy based clustering algorithms to handle big data with implementation on Apache Spark Proceedings of the 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService) March 2016 Oxford UK IEEE 95\u2013104 https:\/\/doi.org\/10.1109\/BigDataService.2016.34 2-s2.0-84973650084.","DOI":"10.1109\/BigDataService.2016.34"},{"key":"e_1_2_8_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/tase.2019.2910508"}],"container-title":["Computational Intelligence and Neuroscience"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/cin\/2021\/1896953.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/cin\/2021\/1896953.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2021\/1896953","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,6]],"date-time":"2024-08-06T12:16:11Z","timestamp":1722946571000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2021\/1896953"}},"subtitle":[],"editor":[{"given":"Ahmed Mostafa","family":"Khalil","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,1]]},"references-count":22,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,1]]}},"alternative-id":["10.1155\/2021\/1896953"],"URL":"https:\/\/doi.org\/10.1155\/2021\/1896953","archive":["Portico"],"relation":{},"ISSN":["1687-5265","1687-5273"],"issn-type":[{"type":"print","value":"1687-5265"},{"type":"electronic","value":"1687-5273"}],"subject":[],"published":{"date-parts":[[2021,1]]},"assertion":[{"value":"2021-07-07","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-24","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"1896953"}}