{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T01:02:38Z","timestamp":1760230958275,"version":"build-2065373602"},"reference-count":46,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2022,8,22]],"date-time":"2022-08-22T00:00:00Z","timestamp":1661126400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"King Saud University Riyadh, Saudi Arabia","award":["RSP-2021\/18"],"award-info":[{"award-number":["RSP-2021\/18"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Advanced collaborative and communication technologies play a significant role in intelligent services and applications, including artificial intelligence, Internet of Things (IoT), remote sensing, robotics, future generation wireless, and aerial access networks. These technologies improve connectivity, energy efficiency, and quality of services of various smart city applications, particularly in transportation, monitoring, healthcare, public services, and surveillance. A large amount of data can be obtained by IoT systems and then examined by deep learning methods for various applications, e.g., object detection or recognition. However, it is a challenging and complex task in smart remote monitoring applications (aerial and drone). Nevertheless, it has gained special consideration in recent years and has performed a pivotal role in different control and monitoring applications. This article presents an IoT-enabled smart surveillance solution for multiple object detection through segmentation. In particular, we aim to provide the concept of collaborative drones, deep learning, and IoT for improving surveillance applications in smart cities. We present an artificial intelligence-based system using the deep learning based segmentation model PSPNet (Pyramid Scene Parsing Network) for segmenting multiple objects. We used an aerial drone data set, implemented data augmentation techniques, and leveraged deep transfer learning to boost the system\u2019s performance. We investigate and analyze the performance of the segmentation paradigm with different CNN (Convolution Neural Network) based architectures. The experimental results illustrate that data augmentation enhances the system\u2019s performance by producing good accuracy results of multiple object segmentation. The accuracy of the developed system is 92% with VGG-16 (Visual Geometry Group), 93% with ResNet-50 (Residual Neural Network), and 95% with MobileNet.<\/jats:p>","DOI":"10.3390\/rs14164107","type":"journal-article","created":{"date-parts":[[2022,8,22]],"date-time":"2022-08-22T05:18:17Z","timestamp":1661145497000},"page":"4107","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["IoT Enabled Deep Learning Based Framework for Multiple Object Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7751-286X","authenticated-orcid":false,"given":"Imran","family":"Ahmed","sequence":"first","affiliation":[{"name":"School of Computing and Information Science, Anglia Ruskin University, Cambridge CB1 1PT, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7013-0159","authenticated-orcid":false,"given":"Misbah","family":"Ahmad","sequence":"additional","affiliation":[{"name":"Center of Excellence in IT, Institute of Management Sciences, Peshawar 25000, Pakistan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4193-6062","authenticated-orcid":false,"given":"Abdellah","family":"Chehri","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Royal Military College of Canada, Kingston, ON K7K 7B4, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3479-3606","authenticated-orcid":false,"given":"Mohammad Mehedi","family":"Hassan","sequence":"additional","affiliation":[{"name":"Information Systems Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0651-4278","authenticated-orcid":false,"given":"Gwanggil","family":"Jeon","sequence":"additional","affiliation":[{"name":"Department of Embedded Systems Engineering, Incheon National University, Incheon 22012, Korea"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Yang, C., Wong, D., Miao, Q., and Yang, R. (2010). Advanced GeoInformation Science, CRC Press.","DOI":"10.1201\/b10280"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"5737","DOI":"10.1109\/JIOT.2019.2951365","article-title":"Exploring deep learning models for overhead view multiple object detection","volume":"7","author":"Ahmed","year":"2019","journal-title":"IEEE Internet Things J."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1016\/j.comcom.2019.08.015","article-title":"Efficient topview person detector using point based transformation and lookup table","volume":"147","author":"Ahmed","year":"2019","journal-title":"Comput. Commun."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1550147720934738","DOI":"10.1177\/1550147720934738","article-title":"Convolutional neural network\u2013based person tracking using overhead views","volume":"16","author":"Ahmad","year":"2020","journal-title":"Int. J. Distrib. Sens. Netw."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ullah, K., Ahmed, I., Ahmad, M., Rahman, A.U., Nawaz, M., and Adnan, A. (2019). Rotation invariant person tracker using top view. J. Ambient. Intell. Humaniz. Comput., 1\u201317.","DOI":"10.1007\/s12652-019-01526-5"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1007\/s11554-021-01166-z","article-title":"A real-time efficient object segmentation system based on U-Net using aerial drone images","volume":"18","author":"Ahmed","year":"2021","journal-title":"J. Real-Time Image Process."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"136361","DOI":"10.1109\/ACCESS.2020.3011406","article-title":"Comparison of deep-learning-based segmentation models: Using top view person images","volume":"8","author":"Ahmed","year":"2020","journal-title":"IEEE Access"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Pires de Lima, R., and Marfurt, K. (2020). Convolutional neural network for remote-sensing scene classification: Transfer learning analysis. Remote Sens., 12.","DOI":"10.3390\/rs12010086"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"102571","DOI":"10.1016\/j.scs.2020.102571","article-title":"A deep learning-based social distance monitoring framework for COVID-19","volume":"65","author":"Ahmed","year":"2021","journal-title":"Sustain. Cities Soc."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ahmad, M., Ahmed, I., Ullah, K., Khan, I., Khattak, A., and Adnan, A. (2019). Person Detection from Overhead View: A Survey. Int. J. Adv. Comput. Sci. Appl., 10.","DOI":"10.14569\/IJACSA.2019.0100470"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1016\/j.procs.2015.09.027","article-title":"Survey on image segmentation techniques","volume":"65","author":"Zaitoun","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_12","first-page":"3523","article-title":"Image segmentation using deep learning: A survey","volume":"44","author":"Minaee","year":"2021","journal-title":"IEee Trans. Pattern Anal. Mach. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"107489","DOI":"10.1016\/j.asoc.2021.107489","article-title":"Edge computing-based person detection system for top view surveillance: Using CenterNet with transfer learning","volume":"107","author":"Ahmed","year":"2021","journal-title":"Appl. Soft Comput."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Guzzo, A., Sacca, D., and Serra, E. (2009, January 6\u20139). An effective approach to inverse frequent set mining. Proceedings of the 2009 Ninth IEEE International Conference on Data Mining, Miami Beach, FL, USA.","DOI":"10.1109\/ICDM.2009.123"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"100190","DOI":"10.1016\/j.bdr.2021.100190","article-title":"A framework for pandemic prediction using big data analytics","volume":"25","author":"Ahmed","year":"2021","journal-title":"Big Data Res."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2541268.2541271","article-title":"Solving inverse frequent itemset mining with infrequency constraints via large-scale linear programs","volume":"7","author":"Guzzo","year":"2013","journal-title":"ACM Trans. Knowl. Discov. Data (TKDD)"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3053","DOI":"10.1007\/s13042-020-01220-5","article-title":"Top view multiple people tracking by detection using deep SORT and YOLOv3 with transfer learning: Within 5G infrastructure","volume":"12","author":"Ahmed","year":"2020","journal-title":"Int. J. Mach. Learn. Cybern."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"102908","DOI":"10.1016\/j.scs.2021.102908","article-title":"Adapting Gaussian YOLOv3 with transfer learning for overhead view human detection in smart cities and societies","volume":"70","author":"Ahmed","year":"2021","journal-title":"Sustain. Cities Soc."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"107226","DOI":"10.1016\/j.compeleceng.2021.107226","article-title":"IoT-based crowd monitoring system: Using SSD with transfer learning","volume":"93","author":"Ahmed","year":"2021","journal-title":"Comput. Electr. Eng."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1538","DOI":"10.1109\/JSTARS.2012.2199085","article-title":"Semi-automated road detection from high resolution satellite images by directional morphological enhancement and segmentation techniques","volume":"5","author":"Chaudhuri","year":"2012","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"4069","DOI":"10.1109\/JSTARS.2014.2308301","article-title":"Detection of buildings in multispectral very high spatial resolution images using the percentage occupancy hit-or-miss transform","volume":"7","author":"Stankov","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1109\/LGRS.2012.2193552","article-title":"Building detection in very high spatial resolution multispectral images using the hit-or-miss transform","volume":"10","author":"Stankov","year":"2012","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1016\/j.isprsjprs.2015.01.013","article-title":"Water flow based geometric active deformable model for road network","volume":"102","author":"Leninisha","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2013.11.018","article-title":"Automated parameterisation for multi-scale image segmentation on multiple layers","volume":"88","author":"Csillik","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.isprsjprs.2015.04.010","article-title":"Scale parameter selection by spatial statistics for GeOBIA: Using mean-shift based multi-scale segmentation as an example","volume":"106","author":"Ming","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"9705","DOI":"10.3390\/rs70809705","article-title":"Identification of forested landslides using LiDar data, object-based image analysis, and machine learning algorithms","volume":"7","author":"Li","year":"2015","journal-title":"Remote Sens."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1080\/15230406.2015.1029520","article-title":"Monitoring recovery after earthquakes through the integration of remote sensing, GIS, and ground observations: The case of L\u2019Aquila (Italy)","volume":"43","author":"Contreras","year":"2016","journal-title":"Cartogr. Geogr. Inf. Sci."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"6627","DOI":"10.1109\/TGRS.2014.2299540","article-title":"Detection of compound structures using a Gaussian mixture model with spectral and spatial constraints","volume":"52","author":"Aksoy","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.isprsjprs.2015.02.006","article-title":"Multilayer Markov random field models for change detection in optical remote sensing images","volume":"107","author":"Benedek","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1830","DOI":"10.1109\/JSTARS.2015.2416255","article-title":"Target detection based on random forest metric learning","volume":"8","author":"Dong","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"3144","DOI":"10.1080\/01431161.2015.1054049","article-title":"Road network extraction: A neural-dynamic framework based on deep learning and a finite state machine","volume":"36","author":"Wang","year":"2015","journal-title":"Int. J. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1109\/MNET.011.2000643","article-title":"AI-enabled Object Detection in UAVs: Challenges, Design Choices, and Research Directions","volume":"35","author":"Jain","year":"2021","journal-title":"IEEE Netw."},{"key":"ref_34","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. Semantic segmentation of earth observation data using multimodal and multi-scale deep networks. Proceedings of the Asian Conference on Computer Vision."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3418205","article-title":"Isdnet: Ai-enabled instance segmentation of aerial scenes for smart cities","volume":"21","author":"Garg","year":"2020","journal-title":"ACM Trans. Internet Technol."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"3355","DOI":"10.1080\/10106049.2020.1856199","article-title":"An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images","volume":"37","author":"Abdollahi","year":"2022","journal-title":"Geocarto Int."},{"key":"ref_37","unstructured":"Marcu, A., Costea, D., Licaret, V., and Leordeanu, M. (2019). Towards automatic annotation for semantic segmentation in drone videos. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/MGRS.2016.2641240","article-title":"Remote Sensing Image Classification: A survey of support-vector-machine-based advanced techniques","volume":"5","author":"Maulik","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1080\/20964471.2019.1657720","article-title":"A survey of remote sensing image classification based on CNNs","volume":"3","author":"Song","year":"2019","journal-title":"Big Earth Data"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_42","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_44","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Zheng, S., Jayasumana, S., Romera-Paredes, B., Vineet, V., Su, Z., Du, D., Huang, C., and Torr, P.H. (2015, January 7\u201313). Conditional random fields as recurrent neural networks. Proceedings of the IEEE International Conference on Computer Vision, Washington, DC, USA.","DOI":"10.1109\/ICCV.2015.179"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"He, C., Fang, P., Zhang, Z., Xiong, D., and Liao, M. (2019). An end-to-end conditional random fields and skip-connected generative adversarial segmentation network for remote sensing images. Remote Sens., 11.","DOI":"10.3390\/rs11131604"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/16\/4107\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:13:22Z","timestamp":1760141602000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/16\/4107"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,22]]},"references-count":46,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2022,8]]}},"alternative-id":["rs14164107"],"URL":"https:\/\/doi.org\/10.3390\/rs14164107","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2022,8,22]]}}}