{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T00:45:43Z","timestamp":1759970743341,"version":"build-2065373602"},"reference-count":34,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2025,1,30]],"date-time":"2025-01-30T00:00:00Z","timestamp":1738195200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"State Key Laboratory of Geo-Information Engineering","award":["SKLGIE2024-M-4-2"],"award-info":[{"award-number":["SKLGIE2024-M-4-2"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Map symbols play a crucial role in cartographic representation. Among these symbols, icons are particularly valued for their vivid and intuitive designs, making them widely utilized in tourist maps. However, the diversity and complexity of these symbols present significant challenges to cartographic workflows. Icon design often relies on manual drawing, which is not only time-consuming but also heavily dependent on specialized skills. Automating the extraction of symbols from existing maps could greatly enhance the map symbol database, offering a valuable resource to support both symbol design and map production. Nevertheless, the intricate shapes and dense distribution of symbols in tourist maps complicate the accurate and efficient detection and extraction using existing methods. Previous studies have shown that You Only Look Once (YOLO) series models demonstrate strong performance in object detection, offering high accuracy and speed. However, these models are less effective in fine-grained boundary segmentation. To address this limitation, this article proposes integrating YOLO models with the Segment Anything Model (SAM) to tackle the challenges of combining efficient detection with precise segmentation. This article developed a dataset consisting of both paper-based and digital tourist maps, with annotations for five main categories of symbols: human landscapes, natural sceneries, humans, animals, and cultural elements. The performance of various YOLO model variants was systematically evaluated using this dataset. Additionally, a user interaction mechanism was incorporated to review and refine detection results, which were subsequently used as prompts for the SAM to perform precise symbol segmentation. The results indicate that the YOLOv8x model achieved excellent performance on the tourist map dataset, with an average detection accuracy of 94.4% across the five symbol categories, fully meeting the requirements for symbol detection tasks. The inclusion of a user interaction mechanism enhanced the reliability and flexibility of detection outcomes, while the integration of the SAM significantly improved the precision of symbol boundary extraction. In conclusion, the integration of YOLOv8x and SAM provides a robust and effective solution for automating the extraction of map symbols. This approach not only reduces the manual workload involved in dataset annotation, but also offers valuable theoretical and practical insights for enhancing cartographic efficiency.<\/jats:p>","DOI":"10.3390\/ijgi14020055","type":"journal-article","created":{"date-parts":[[2025,1,30]],"date-time":"2025-01-30T11:08:23Z","timestamp":1738235303000},"page":"55","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Automated Icon Extraction from Tourism Maps: A Synergistic Approach Integrating YOLOv8x and SAM"],"prefix":"10.3390","volume":"14","author":[{"given":"Di","family":"Cao","sequence":"first","affiliation":[{"name":"School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China"},{"name":"State Key Laboratory of Geo-Information Engineering, Xi\u2019an 710000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xinran","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China"},{"name":"State Key Laboratory of Geo-Information Engineering, Xi\u2019an 710000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingjing","family":"Li","sequence":"additional","affiliation":[{"name":"School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiayao","family":"Li","sequence":"additional","affiliation":[{"name":"School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lili","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China"},{"name":"State Key Laboratory of Geo-Information Engineering, Xi\u2019an 710000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,1,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1007\/s12525-015-0196-8","article-title":"Smart tourism: Foundations and developments","volume":"25","author":"Gretzel","year":"2015","journal-title":"Electron. Mark."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1177\/1473871612472177","article-title":"Automatic tourist attraction and representative icon determination for tourist map generation","volume":"13","author":"Lin","year":"2014","journal-title":"Inf. Vis."},{"key":"ref_3","first-page":"644","article-title":"Frame Design for Point-shaped Map Symbol Based on Eye Movement Experiment","volume":"33","author":"Liu","year":"2016","journal-title":"J. Geomat. Sci. Technol."},{"key":"ref_4","unstructured":"Airikka, M., and Masoodian, M. (September, January 28). Comparing the Effects of Illustration Styles on the Functionality of Tourist Maps. Proceedings of the 19th International-Federation-for-Information-Processing-Technical-Committee-13 (IFIP TC13) International Conference on Human-Computer Interaction (INTERACT), University of York, York, UK."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Airikka, M., and Masoodian, M. (2019, January 15\u201319). A Survey of the Visual Design of Cartographic and other Elements of Illustrated Tourist Maps. Proceedings of the 23rd International Conference in Information Visualization (IV)\/16th International Conference Computer Graphics, Imaging and Visualization (CGiV), Flinders University, Adelaide, Australia.","DOI":"10.1109\/IV-2.2019.00011"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Kovacevic, D., Brozovic, M., and Mozina, K. (2024). Comprehension of City Map Pictograms Designed for Specific Tourists\u2019 Needs. Isprs Int. J. Geo-Inf., 13.","DOI":"10.3390\/ijgi13040137"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Robinson, A.C., \u00c7\u00f6ltekin, A., Griffin, A.L., and Ledermann, F. (2023, January 13). Cartography in GeoAI: Emerging themes and research challenges. Proceedings of the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery, Hamburg, Germany.","DOI":"10.1145\/3615886.3627734"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Li, J., Mouchere, H., and Viard-Gaudin, C. (2011, January 18\u201321). Symbol Knowledge Extraction from a Simple Graphical Language. Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR), Beijing, China.","DOI":"10.1109\/ICDAR.2011.128"},{"key":"ref_9","first-page":"632","article-title":"Extraction of symbol line-features based on improved Hough transformation","volume":"11","author":"Chen","year":"2003","journal-title":"Opt. Precis. Eng."},{"key":"ref_10","unstructured":"Chang, A., Kyu Sik, K., Sang Burm, R., and Kye Young, L. (1997, January 20\u201322). A Road Extraction Method from Topographical Map Images. Proceedings of the IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, Victoria, BC, Canada."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/s10032-011-0177-1","article-title":"A general approach for extracting road vector data from raster maps","volume":"16","author":"Chiang","year":"2013","journal-title":"Int. J. Doc. Anal. Recognit. (IJDAR)"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Chiang, Y.Y., Duan, W., Leyk, S., Uhl, J.H., and Knoblock, C.A. (2020). Using Historical Maps in Scientific Studies, Springer.","DOI":"10.1007\/978-3-319-66908-3"},{"key":"ref_13","first-page":"1646","article-title":"Automatic Extraction Method of Point Symbols in Modern Hand-Drawn Maps for We-Map","volume":"26","author":"Yu","year":"2024","journal-title":"J. Geo-Inf. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"He, H., Yang, D., Wang, S., Wang, S., and Li, Y. (2019). Road Extraction by Using Atrous Spatial Pyramid Pooling Integrated Encoder-Decoder Network and Structural Similarity Loss. Remote Sens., 11.","DOI":"10.3390\/rs11091015"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"012046","DOI":"10.1088\/1755-1315\/1418\/1\/012046","article-title":"Building Footprint Extraction from Fixed-Wing UAV Imagery using Mask R-CNN and Object-based Image Analysis Methods (Case Study: Banturejo Village, Malang Regency)","volume":"1418","author":"Hidayat","year":"2024","journal-title":"IOP Conf. Ser. Earth Environ. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"111363","DOI":"10.1016\/j.ecolind.2023.111363","article-title":"Unleashing the power of old maps: Extracting symbology from nineteenth century maps using convolutional neural networks to quantify modern land use on historic wetlands","volume":"158","author":"Rob","year":"2024","journal-title":"Ecol. Indic."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1002\/arp.1807","article-title":"Potential of deep learning segmentation for the extraction of archaeological features from historical map series","volume":"28","author":"Orengo","year":"2021","journal-title":"Archaeol. Prospect."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"104943","DOI":"10.1016\/j.cageo.2021.104943","article-title":"Deep learning framework for geological symbol detection on geological maps","volume":"157","author":"Guo","year":"2021","journal-title":"Comput. Geosci."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1109\/MCG.2019.2943333","article-title":"Deep Stroke-Based Sketched Symbol Reconstruction and Segmentation","volume":"40","author":"Kurmanbek","year":"2020","journal-title":"IEEE Comput. Graph. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.neunet.2020.05.025","article-title":"Deep learning for symbols detection and classification in engineering drawings","volume":"129","author":"Elyan","year":"2020","journal-title":"Neural Netw."},{"key":"ref_21","first-page":"1","article-title":"An ensemble of deep transfer learning models for handwritten music symbol recognition","volume":"34","author":"Paul","year":"2021","journal-title":"Neural Comput. Appl."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2426589","DOI":"10.1080\/15481603.2024.2426589","article-title":"SEMPNet: Enhancing few-shot remote sensing image semantic segmentation through the integration of the segment anything model","volume":"61","author":"Ao","year":"2024","journal-title":"GIScience Remote Sens."},{"key":"ref_23","unstructured":"Jocher, G., Ayush, C., and Qiu, J. (2023). YOLO, Ultralytics."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zou, N., Xu, Q., Wu, Y., Zhu, X., and Su, Y. (2023). An Automated Method for Generating Prefabs of AR Map Point Symbols Based on Object Detection Model. ISPRS Int. J. Geo-Inf., 12.","DOI":"10.3390\/ijgi12110440"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"20939","DOI":"10.1007\/s00521-023-08809-1","article-title":"An improved fire detection approach based on YOLO-v8 for smart cities","volume":"35","author":"Talaat","year":"2023","journal-title":"Neural Comput. Appl."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Wang, G., Chen, Y., An, P., Hong, H., Hu, J., and Huang, T. (2023). UAV-YOLOv8: A small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios. Sensors, 23.","DOI":"10.3390\/s23167190"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.-Y. (2023). Segment Anything. arXiv.","DOI":"10.1109\/ICCV51070.2023.00371"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"654","DOI":"10.1038\/s41467-024-44824-z","article-title":"Segment anything in medical images","volume":"15","author":"Ma","year":"2024","journal-title":"Nat. Commun."},{"key":"ref_29","unstructured":"Jocher, G., Chaurasia, A., Stoken, A., Borovec, J., Kwon, Y., Michael, K., Fang, J., Yifu, Z., Wong, C., and Montes, D. (2022). ultralytics\/yolov5: v7. 0-yolov5 sota realtime instance segmentation. Zenodo, 5281."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2023, January 17\u201324). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_31","unstructured":"Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., Han, J., and Ding, G. (2024). Yolov10: Real-time end-to-end object detection. arXiv."},{"key":"ref_32","unstructured":"Ren, S. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv."},{"key":"ref_33","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016). Ssd: Single shot multibox detector. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11\u201314 October 2016, Springer. Proceedings, Part I 14."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal Loss for Dense Object Detection","volume":"42","author":"Priya","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/2\/55\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:39:03Z","timestamp":1759919943000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/2\/55"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,30]]},"references-count":34,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2025,2]]}},"alternative-id":["ijgi14020055"],"URL":"https:\/\/doi.org\/10.3390\/ijgi14020055","relation":{},"ISSN":["2220-9964"],"issn-type":[{"type":"electronic","value":"2220-9964"}],"subject":[],"published":{"date-parts":[[2025,1,30]]}}}