{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:01:09Z","timestamp":1760234469086,"version":"build-2065373602"},"reference-count":57,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2021,5,18]],"date-time":"2021-05-18T00:00:00Z","timestamp":1621296000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Key R&amp;D Program of China","award":["Grant No. 2019YFD1100803"],"award-info":[{"award-number":["Grant No. 2019YFD1100803"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The volume of remote sensing images continues to grow as image sources become more diversified and with increasing spatial and spectral resolution. The handling of such large-volume datasets, which exceed available CPU memory, in a timely and efficient manner is becoming a challenge for single machines. The distributed cluster provides an effective solution with strong calculation power. There has been an increasing number of big data technologies that have been adopted to deal with large images using mature parallel technology. However, since most commercial big data platforms are not specifically developed for the remote sensing field, two main issues exist in processing large images with big data platforms using a distributed cluster. On the one hand, the quantities and categories of official algorithms used to process remote sensing images in big data platforms are limited compared to large amounts of sequential algorithms. On the other hand, the sequential algorithms employed directly to process large images in parallel over a distributed cluster may lead to incomplete objects in the tile edges and the generation of large communication volumes at the shuffle stage. It is, therefore, necessary to explore the distributed strategy and adapt the sequential algorithms over the distributed cluster. In this research, we employed two seed-based image segmentation algorithms to construct a distributed strategy based on the Spark platform. The proposed strategy focuses on modifying the incomplete objects by processing border areas and reducing the communication volume to a reasonable size by limiting the auxiliary bands and the buffer size to a small range during the shuffle stage. We calculated the F-measure and execution time to evaluate the accuracy and execution efficiency. The statistical data reveal that both segmentation algorithms maintained high accuracy, as achieved in the reference image segmented in the sequential way. Moreover, generally the strategy took less execution time compared to significantly larger auxiliary bands and buffer sizes. The proposed strategy can modify incomplete objects, with execution time being twice as fast as the strategies that do not employ communication volume reduction in the distributed cluster.<\/jats:p>","DOI":"10.3390\/rs13101969","type":"journal-article","created":{"date-parts":[[2021,5,18]],"date-time":"2021-05-18T12:17:16Z","timestamp":1621340236000},"page":"1969","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["A Strategy of Parallel Seed-Based Image Segmentation Algorithms for Handling Massive Image Tiles over the Spark Platform"],"prefix":"10.3390","volume":"13","author":[{"given":"Fang","family":"Chen","sequence":"first","affiliation":[{"name":"Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 9 Dengzhuang South Road, Beijing 100094, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"},{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China"},{"name":"Hainan Key Laboratory of Earth Observation, Aerospace Information Research Institute, Chinese Academy of Sciences, Sanya 572029, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ning","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 9 Dengzhuang South Road, Beijing 100094, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bo","family":"Yu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, No. 9 Dengzhuang South Road, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuchu","family":"Qin","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7163-3644","authenticated-orcid":false,"given":"Lei","family":"Wang","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,18]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"8077","DOI":"10.1109\/TGRS.2020.2987060","article-title":"High-Resolution Remote Sensing Image Scene Classification via Key Filter Bank Based on Convolutional Neural Network","volume":"58","author":"Li","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.isprsjprs.2015.10.004","article-title":"Remote Sensing Platforms and Sensors: A Survey","volume":"115","author":"Toth","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Zhou, D.C., Xiao, J., Bonafoni, S., Berger, C., Deilami, K., Zhou, Y., Frolking, S., Yao, R., Qiao, Z., and Sobrino, J.A. (2019). Satellite Remote Sensing of Surface Urban Heat Islands: Progress, Challenges, and Perspectives. Remote Sens., 11.","DOI":"10.3390\/rs11010048"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"5294","DOI":"10.1109\/JSTARS.2016.2591519","article-title":"A Cloud Computing Strategy for Region-Growing Segmentation","volume":"9","author":"Happ","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"4002","DOI":"10.1109\/JSTARS.2017.2705718","article-title":"Extraction of Glacial Lake Outlines in Tibet Plateau Using Landsat 8 Imagery and Google Earth Engine","volume":"10","author":"Chen","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"952","DOI":"10.1109\/TGRS.2014.2330857","article-title":"Stable Mean-Shift Algorithm and Its Application to the Segmentation of Arbitrarily Large Remote Sensing Images","volume":"53","author":"Michel","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"112181","DOI":"10.1016\/j.rse.2020.112181","article-title":"Improving Landsat Multispectral Scanner (MSS) Geolocation by Least-Squares-Adjustment Based Time-Series Co-Registration","volume":"252","author":"Yan","year":"2021","journal-title":"Remote Sens. Environ."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"741","DOI":"10.5194\/essd-13-741-2021","article-title":"Annual 30 m dataset for glacial lakes in High Mountain Asia from 2008 to 2017","volume":"13","author":"Chen","year":"2021","journal-title":"Earth Syst. Sci. Data"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"104388","DOI":"10.1016\/j.cageo.2019.104388","article-title":"Landslide detection based on contour-based deep learning framework in case of national scale of Nepal in 2015","volume":"135","author":"Yu","year":"2020","journal-title":"Comput. Geosci."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1007\/s10346-017-0884-x","article-title":"A practical trial of landslide detection from single-temporal Landsat8 images using contour-based proposals and random forest: A case study of national Nepal","volume":"15","author":"Chen","year":"2018","journal-title":"Landslides"},{"key":"ref_11","unstructured":"(2021, April 20). Apache Hadoop. Available online: http:\/\/hadoop.apache.org\/."},{"key":"ref_12","unstructured":"(2021, April 20). Apache Spark. Available online: http:\/\/spark.apache.org\/."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/20964471.2017.1405925","article-title":"Big data drives the development of Earth science","volume":"1","author":"Guo","year":"2017","journal-title":"Big Earth Data"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"8246","DOI":"10.1109\/TGRS.2020.2973363","article-title":"Nonlocal Graph Convolutional Networks for Hyperspectral Image Classification","volume":"58","author":"Mou","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1109\/TIP.2018.2878958","article-title":"An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing","volume":"28","author":"Hong","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1145\/2934664","article-title":"Apache Spark: A Unified Engine for Big Data Processing","volume":"59","author":"Zaharia","year":"2016","journal-title":"Commun. ACM"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kertesz, G., Szenasi, S., and Vamossy, Z. (2015, January 3\u20135). Performance Measurement of a General Multi-Scale Template Matching Method. Proceedings of the 2015-IEEE 19th International Conference on Intelligent Engineering Systems, Bratislava, Slovakia.","DOI":"10.1109\/INES.2015.7329697"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.isprsjprs.2020.02.012","article-title":"Segmentation of large-scale remotely sensed images on a Spark platform: A strategy for handling massive image tiles with the MapReduce model","volume":"162","author":"Wang","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1016\/j.isprsjprs.2013.09.014","article-title":"Geographic Object-Based Image Analysis\u2014Towards a new paradigm","volume":"87","author":"Blaschke","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1016\/j.isprsjprs.2009.06.004","article-title":"Object based image analysis for remote sensing","volume":"65","author":"Blaschke","year":"2010","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.isprsjprs.2013.03.006","article-title":"Change detection from remotely sensed images: From pixel-based to object-based approaches","volume":"80","author":"Hussain","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Ventura, D., Bonifazi, A., Gravina, M.F., Belluscio, A., and Ardizzone, G. (2018). Mapping and Classification of Ecologically Sensitive Marine Habitats Using Unmanned Aerial Vehicle (UAV) Imagery and Object-Based Image Analysis (OBIA). Remote Sens., 10.","DOI":"10.3390\/rs10091331"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Pena, J.M., Torres-S\u00e1nchez, J., de Castro, A.I., Kelly, M., and L\u00f3pez-Granados, F. (2013). Weed Mapping in Early-Season Maize Fields Using Object-Based Analysis of Unmanned Aerial Vehicle (UAV) Images. PLoS ONE, 8.","DOI":"10.1371\/journal.pone.0077151"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1016\/j.isprsjprs.2017.06.001","article-title":"A review of supervised object-based land-cover image classification","volume":"130","author":"Ma","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/j.isprsjprs.2019.02.009","article-title":"Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective","volume":"150","author":"Hossain","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"3252","DOI":"10.1109\/JSTARS.2018.2860989","article-title":"Semantic Segmentation for High Spatial Resolution Remote Sensing Images Based on Convolution Neural Network and Pyramid Pooling Module","volume":"11","author":"Yu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1007\/978-3-319-02895-8_45","article-title":"The Divide and Segment Method for Parallel Image Segmentation","volume":"8192","author":"Koerting","year":"2013","journal-title":"Adv. Concepts Intell. Vis. Syst. Acivs."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Afshar, Y., and Sbalzarini, I.F. (2016). A Parallel Distributed-Memory Particle Method Enables Acquisition-Rate Segmentation of Large Fluorescence Microscopy Images. PLoS ONE, 11.","DOI":"10.1371\/journal.pone.0152528"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1007\/s11554-014-0464-4","article-title":"Accelerated hyperspectral image recursive hierarchical segmentation using GPUs, multicore CPUs, and hybrid CPU\/GPU cluster","volume":"14","author":"Hossam","year":"2014","journal-title":"J. Real-Time Image Process."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"5473","DOI":"10.1109\/TGRS.2015.2422848","article-title":"A Scalable Tile-Based Framework for Region-Merging Segmentation","volume":"53","author":"Lassalle","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1376","DOI":"10.3390\/rs10091376","article-title":"RDCRMG: A Raster Dataset Clean & Reconstitution Multi-Grid Architecture for Remote Sensing Monitoring of Vegetation Dryness","volume":"10","author":"Ye","year":"2018","journal-title":"Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2582","DOI":"10.1109\/TPDS.2018.2829724","article-title":"Parallel Computation of Component Trees on Distributed Memory Machines","volume":"29","author":"Gotz","year":"2018","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Gu, H., Han, Y., Yang, Y., Li, H., Liu, Z., Soergel, U., Blaschke, T., and Cui, S. (2018). An Efficient Parallel Multi-Scale Segmentation Method for Remote Sensing Imagery. Remote Sens., 10.","DOI":"10.3390\/rs10040590"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1080\/17538947.2018.1432709","article-title":"Implementation of the parallel mean shift-based image segmentation algorithm on a GPU cluster","volume":"12","author":"Huang","year":"2018","journal-title":"Int. J. Digit. Earth"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"3664","DOI":"10.1109\/TIP.2021.3064223","article-title":"Distributed Connected Component Filtering and Analysis in 2D and 3D Tera-Scale Data Sets","volume":"30","author":"Gazagnes","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"3073","DOI":"10.1109\/TGRS.2018.2880248","article-title":"Scaling Up SLIC Superpixels Using a Tile-Based Approach","volume":"57","author":"Derksen","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Lin, W., and Li, Y. (2020). Parallel Regional Segmentation Method of High-Resolution Remote Sensing Image Based on Minimum Spanning Tree. Remote Sens., 12.","DOI":"10.3390\/rs12050783"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Barbary, K., Nothaft, F.A., Sparks, E., Zahn, O., Franklin, M.J., Patterson, D.A., and Perlmutter, S. (November, January 29). Scientific Computing Meets Big Data Technology: An Astronomy Use Case. Proceedings of the 2015 IEEE International Conference on Big Data, Santa Clara, CA, USA.","DOI":"10.1109\/BigData.2015.7363840"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Tang, S., He, B., Yu, C., Li, Y., and Li, K. (2020). A Survey on Spark Ecosystem: Big Data Processing Infrastructure, Machine Learning, and Applications. IEEE Trans. Knowl. Data Eng., 1.","DOI":"10.1109\/TKDE.2020.2975652"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1109\/34.295913","article-title":"Seeded Region Growing","volume":"16","author":"Adams","year":"1994","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/34.87344","article-title":"Watersheds in Digital Spaces\u2014An Efficient Algorithm Based on Immersion Simulations","volume":"13","author":"Vincent","year":"1991","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_42","unstructured":"(2021, April 20). Copernicus Open Access Hub. Available online: https:\/\/scihub.copernicus.eu\/."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.ndteint.2018.06.001","article-title":"Automatic seeded region growing for thermography debonding detection of CFRP","volume":"99","author":"Feng","year":"2018","journal-title":"NDT E Int."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Huang, Z.L., Wang, X., Wang, J., Liu, W., and Wang, J. (2018, January 18\u201323). Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing. Proceedings of the 2018 IEEE\/Cvf Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00733"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/j.postharvbio.2018.12.007","article-title":"Early detection of decay on apples using hyperspectral reflectance imaging combining both principal component analysis and improved watershed segmentation method","volume":"149","author":"Li","year":"2019","journal-title":"Postharvest Biol. Technol."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.postharvbio.2017.09.007","article-title":"Detection of early bruises on peaches (Amygdalus persica L.) using hyperspectral imaging coupled with improved watershed segmentation algorithm","volume":"135","author":"Li","year":"2018","journal-title":"Postharvest Biol. Technol."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Kornilov, A., and Safonov, I. (2018). An Overview of Watershed Algorithm Implementations in Open Source Libraries. J. Imaging, 4.","DOI":"10.3390\/jimaging4100123"},{"key":"ref_48","unstructured":"(2021, April 20). MathWorks\/rgb2gray. Available online: https:\/\/ww2.mathworks.cn\/help\/matlab\/ref\/rgb2gray.html."},{"key":"ref_49","unstructured":"(2021, April 20). Scikit-Image: Image Processing in Python. Available online: https:\/\/scikit-image.org\/."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Sehrish, S., Kowalkowski, J., and Paterno, M. (June, January 29). Spark and HPC for High Energy Physics Data Analyses. Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Orlando, FL, USA.","DOI":"10.1109\/IPDPSW.2017.112"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1016\/j.ins.2017.11.064","article-title":"Mining maximal frequent patterns in transactional databases and dynamic data streams: A spark-based approach","volume":"432","author":"Karim","year":"2018","journal-title":"Inf. Sci."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1007\/s10707-018-0330-9","article-title":"Spatial data management in apache spark: The GeoSpark perspective and beyond","volume":"23","author":"Yu","year":"2019","journal-title":"Geoinformatica"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.bdr.2017.05.001","article-title":"A Methodology for Spark Parameter Tuning","volume":"11","author":"Gounaris","year":"2018","journal-title":"Big Data Res."},{"key":"ref_54","first-page":"1218","article-title":"A parallel content-based image retrieval system using spark and tachyon frameworks","volume":"32","author":"Mezzoudj","year":"2020","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/j.isprsjprs.2015.01.009","article-title":"Segmentation quality evaluation using region-based precision and recall measures for remote sensing images","volume":"102","author":"Zhang","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"4062","DOI":"10.1109\/TGRS.2012.2187789","article-title":"A Scale-Synthesis Method for High Spatial Resolution Remote Sensing Image Segmentation","volume":"50","author":"Yi","year":"2012","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1016\/j.cviu.2008.02.002","article-title":"Image segmentation algorithm development using ground truth image data sets","volume":"112","author":"Crevier","year":"2008","journal-title":"Comput. Vis. Image Underst."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1969\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:03:24Z","timestamp":1760162604000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1969"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,18]]},"references-count":57,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["rs13101969"],"URL":"https:\/\/doi.org\/10.3390\/rs13101969","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,5,18]]}}}