{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T15:05:41Z","timestamp":1777388741216,"version":"3.51.4"},"reference-count":36,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T00:00:00Z","timestamp":1736467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Neurorobot."],"abstract":"<jats:p>With the rapid development of tourism, the concentration of visitor flows poses significant challenges for public safety management, especially in low-light and highly occluded environments, where existing pedestrian detection technologies often struggle to achieve satisfactory accuracy. Although infrared images perform well under low-light conditions, they lack color and detail, making them susceptible to background noise interference, particularly in complex outdoor environments where the similarity between heat sources and pedestrian features further reduces detection accuracy. To address these issues, this paper proposes the FusionU10 model, which combines information from both infrared and visible light images. The model first incorporates an Attention Gate mechanism (AGUNet) into an improved UNet architecture to focus on key features and generate pseudo-color images, followed by pedestrian detection using YOLOv10. During the prediction phase, the model optimizes the loss function with Complete Intersection over Union (CIoU), objectness loss (obj loss), and classification loss (cls loss), thereby enhancing the performance of the detection network and improving the quality and feature extraction capabilities of the pseudo-color images through a feedback mechanism. Experimental results demonstrate that FusionU10 significantly improves detection accuracy and robustness in complex scenes on the FLIR, M3FD, and LLVIP datasets, showing great potential for application in challenging environments.<\/jats:p>","DOI":"10.3389\/fnbot.2024.1504070","type":"journal-article","created":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T06:12:57Z","timestamp":1736489577000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["FusionU10: enhancing pedestrian detection in low-light complex tourist scenes through multimodal fusion"],"prefix":"10.3389","volume":"18","author":[{"given":"Xuefan","family":"Zhou","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiapeng","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yingzheng","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2025,1,10]]},"reference":[{"key":"B1","first-page":"5349","article-title":"\u201cReal-time multi-class helmet violation detection using few-shot data sampling technique and yolov8,\u201d","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Aboah","year":"2023"},{"key":"B2","doi-asserted-by":"publisher","first-page":"10076","DOI":"10.1109\/TPAMI.2024.3435571","article-title":"Medical image segmentation review: The success of U-Net","volume":"46","author":"Azad","year":"2024","journal-title":"IEEE Trans. Pattern Anal Mach. Intell"},{"key":"B3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/DSPA60853.2024.10510106","article-title":"\u201cTransforming aircraft detection through leo satellite imagery and yolov9 for improved aviation safety,\u201d","volume-title":"2024 26th International Conference on Digital Signal Processing and its Applications (DSPA)","author":"Bakirci","year":"2024"},{"key":"B4","doi-asserted-by":"publisher","first-page":"2934","DOI":"10.3390\/s23062934","article-title":"Dual-yolo architecture from infrared and visible images for object detection","volume":"23","author":"Bao","year":"2023","journal-title":"Sensors"},{"key":"B5","doi-asserted-by":"publisher","first-page":"103255","DOI":"10.1016\/j.annals.2021.103255","article-title":"Tourism demand forecasting with time series imaging: a deep learning model","volume":"90","author":"Bi","year":"2021","journal-title":"Ann. Tour. Res"},{"key":"B6","doi-asserted-by":"publisher","first-page":"104129","DOI":"10.1016\/j.tourman.2020.104129","article-title":"Using deep learning and visual analytics to explore hotel reviews and responses","volume":"80","author":"Chang","year":"2020","journal-title":"Tour. Managem"},{"key":"B7","doi-asserted-by":"publisher","DOI":"10.1109\/TFUZZ.2024.3427654","article-title":"Dual guidance enabled fuzzy inference for enhanced fine-grained recognition","author":"Chen","year":"2024","journal-title":"IEEE Trans. Fuzzy Syst. 2024"},{"key":"B8","doi-asserted-by":"crossref","first-page":"1111","DOI":"10.1109\/ICCASIT55263.2022.9986726","article-title":"\u201cTraffic lights detection method based on the improved yolov5 network,\u201d","volume-title":"2022 IEEE 4th International Conference on Civil Aviation Safety and Information Technology (ICCASIT)","author":"Chen","year":"2022"},{"key":"B9","doi-asserted-by":"publisher","DOI":"10.1109\/TIV.2024.3393015","article-title":"YOLO-adaptor: a fast adaptive one-stage detector for non-aligned visible-infrared object detection","author":"Fu","year":"2024","journal-title":"IEEE Trans. Intell. Vehicles"},{"key":"B10","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1109\/ICBIR57571.2023.10147590","article-title":"\u201cA comparative study on satellite image analysis for road traffic detection using YOLOv3-SPP, keras retinanet and full convolutional network,\u201d","volume-title":"2023 8th International Conference on Business and Industrial Research (ICBIR)","author":"Guillermo","year":"2023"},{"key":"B11","doi-asserted-by":"publisher","first-page":"100211","DOI":"10.1016\/j.prime.2023.100211","article-title":"Real-time traffic control and monitoring","volume":"5","author":"Gupta","year":"2023","journal-title":"e-Prime-Adv. Elect. Eng. Electron. Energy"},{"key":"B12","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/j.jhtm.2021.08.022","article-title":"Using sarima-cnn-lstm approach to forecast daily tourism demand","volume":"49","author":"He","year":"2021","journal-title":"J. Hosp. Tour. Managem"},{"key":"B13","first-page":"3489","article-title":"\u201cLLVIP: A visible-infrared paired dataset for low-light vision,\u201d","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops (ICCVW)","author":"Jia","year":"2021"},{"key":"B14","first-page":"140","article-title":"\u201cYOLO-D: dual-branch infrared distant target detection based on multi-level weighted feature fusion,\u201d","volume-title":"International Conference on Neural Information Processing","author":"Jing","year":"2023"},{"key":"B15","doi-asserted-by":"publisher","first-page":"20368","DOI":"10.1109\/JSEN.2023.3297205","article-title":"Sad: Sensor-based anomaly detection system for smart junctions","volume":"23","author":"Krishnendhu","year":"2023","journal-title":"IEEE Sensors J"},{"key":"B16","doi-asserted-by":"publisher","first-page":"108962","DOI":"10.1016\/j.sigpro.2023.108962","article-title":"Yolosr-ist: A deep learning method for small target detection in infrared remote sensing images based on super-resolution and yolo","volume":"208","author":"Li","year":"2023","journal-title":"Signal Proc"},{"key":"B17","doi-asserted-by":"publisher","first-page":"141861","DOI":"10.1109\/ACCESS.2021.3120870","article-title":"YOLO-FIRI: improved yolov5 for infrared image object detection","volume":"9","author":"Li","year":"2021","journal-title":"IEEE Access"},{"key":"B18","doi-asserted-by":"publisher","first-page":"7567","DOI":"10.1007\/s11042-022-13251-x","article-title":"Attention-yolov4: a real-time and high-accurate traffic sign detection algorithm","volume":"82","author":"Li","year":"2023","journal-title":"Multimed. Tools Appl"},{"key":"B19","first-page":"5802","article-title":"\u201cTarget-aware dual adversarial learning and a multi-scenario multi-modality benchmark to fuse infrared and visible for object detection,\u201d","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Liu","year":"2022"},{"key":"B20","doi-asserted-by":"publisher","first-page":"102467","DOI":"10.1016\/j.displa.2023.102467","article-title":"Pedestrian re-id based on feature consistency and contrast enhancement","volume":"79","author":"Ning","year":"2023","journal-title":"Displays"},{"key":"B21","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1804.03999","article-title":"Attention U-Net: Learning where to look for the pancreas","author":"Oktay","year":"2018","journal-title":"arXiv"},{"key":"B22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11042-023-16451-1","article-title":"A performance comparison of yolov8 models for traffic sign detection in the robotaxi-full scale autonomous vehicle competition","volume":"83","author":"Soylu","year":"2023","journal-title":"Multimedia Tools Appl"},{"key":"B23","doi-asserted-by":"publisher","first-page":"115277","DOI":"10.1016\/j.eswa.2021.115277","article-title":"A cultural heritage framework using a deep learning based chatbot for supporting tourist journey","volume":"183","author":"Sperl\u00ed","year":"2021","journal-title":"Expert Syst. Appl"},{"key":"B24","article-title":"\u201cDRMF: Degradation-robust multi-modal image fusion via composable diffusion prior,\u201d","author":"Tang","year":"2024","journal-title":"ACM Multimedia 2024"},{"key":"B25","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2405.14458","article-title":"YOLOv10: Real-time end-to-end object detection","author":"Wang","year":"2024","journal-title":"arXiv"},{"key":"B26","doi-asserted-by":"publisher","first-page":"168","DOI":"10.3390\/e21020168","article-title":"Dense u-net based on patch-based learning for retinal vessel segmentation","volume":"21","author":"Wang","year":"2019","journal-title":"Entropy"},{"key":"B27","first-page":"1","article-title":"\u201cThermal infrared image inpainting via edge-aware guidance,\u201d","volume-title":"ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Wang","year":"2023"},{"key":"B28","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1007\/s00371-023-02813-1","article-title":"YOLOF-F: you only look one-level feature fusion for traffic sign detection","volume":"40","author":"Wei","year":"2023","journal-title":"Visual Comp"},{"key":"B29","doi-asserted-by":"publisher","first-page":"113442","DOI":"10.1016\/j.measurement.2023.113442","article-title":"Infrared pedestrian detection using improved unet and yolo through sharing visible light domain information","volume":"221","author":"Wei","year":"2023","journal-title":"Measurement"},{"key":"B30","doi-asserted-by":"publisher","first-page":"3510","DOI":"10.1038\/s41598-024-54146-1","article-title":"Cdyl for infrared and visible light image dense small object detection","volume":"14","author":"Wu","year":"2024","journal-title":"Sci. Rep"},{"key":"B31","doi-asserted-by":"publisher","first-page":"1374","DOI":"10.1109\/LSP.2024.3399119","article-title":"IAIFNet: an illumination-aware infrared and visible image fusion network","volume":"31","author":"Yang","year":"2024","journal-title":"IEEE Signal Process. Lett"},{"key":"B32","doi-asserted-by":"publisher","first-page":"749","DOI":"10.1109\/LGRS.2018.2802944","article-title":"Road extraction by deep residual U-Net","volume":"15","author":"Zhang","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Letters"},{"key":"B33","doi-asserted-by":"publisher","first-page":"105395","DOI":"10.1016\/j.cmpb.2020.105395","article-title":"Dense-inception U-Net for medical image segmentation","volume":"192","author":"Zhang","year":"2020","journal-title":"Comput. Methods Programs Biomed"},{"key":"B34","first-page":"233","article-title":"\u201cYOLOv7-sea: Object detection of maritime uav images based on improved YOLOv7,\u201d","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision","author":"Zhao","year":"2023"},{"key":"B35","doi-asserted-by":"publisher","first-page":"3778","DOI":"10.3390\/rs15153778","article-title":"YOLO-VIT-based method for unmanned aerial vehicle infrared vehicle target detection","volume":"15","author":"Zhao","year":"2023","journal-title":"Remote Sens"},{"key":"B36","first-page":"8082","article-title":"\u201cDDFM: denoising diffusion model for multi-modality image fusion,\u201d","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Zhao","year":"2023"}],"container-title":["Frontiers in Neurorobotics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2024.1504070\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T06:13:06Z","timestamp":1736489586000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2024.1504070\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,10]]},"references-count":36,"alternative-id":["10.3389\/fnbot.2024.1504070"],"URL":"https:\/\/doi.org\/10.3389\/fnbot.2024.1504070","relation":{},"ISSN":["1662-5218"],"issn-type":[{"value":"1662-5218","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,10]]},"article-number":"1504070"}}