{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,20]],"date-time":"2026-05-20T16:04:09Z","timestamp":1779293049520,"version":"3.51.4"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T00:00:00Z","timestamp":1760400000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T00:00:00Z","timestamp":1760400000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004913","name":"Universit\u00e0 degli Studi di Palermo","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004913","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Machine Vision and Applications"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Predicting pedestrian crossing intentions is crucial for enhancing autonomous vehicles\u2019 safety and decision-making capabilities. Timely and accurate predictions made well before potential crossing events are essential for enabling vehicles to respond appropriately and prevent accidents. Unlike state-of-the-art models that rely on complex architectures and multiple input modalities, this article proposes Context-Ped, a minimalist effective model for pedestrian intention prediction. To capture critical contextual information, Context-Ped processes two key visual inputs: cropped pedestrian images and environmental road structure. By integrating residual networks with recurrent architectures, the model extracts robust spatiotemporal features through the aggregation of hidden states from ConvLSTM layers. This operation serves as a temporal integration strategy that accumulates the spatiotemporal patterns over the sequence, capturing both dynamic motion cues and static contextual information to encode both pedestrian behavior and environmental context. As a result, Context-Ped achieves accurate intention prediction while maintaining computational efficiency. Additionally, the proposed approach addresses the often-overlooked issue of class imbalance by employing the focal loss function, which significantly enhances performance for the minority class. For evaluation, AUC-ROC and recall metrics are prioritized over traditional accuracy, providing clearer insight into the model\u2019s performance. Experimental results on the JAAD and PIE datasets demonstrate competitive performance, achieving an AUC of 79% and a crossing recall of 82% on the JAAD dataset, outperforming more complex state-of-the-art models and by only using visual input data.<\/jats:p>","DOI":"10.1007\/s00138-025-01751-3","type":"journal-article","created":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T04:26:57Z","timestamp":1760416017000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Context-ped: multi-modal context fusion for pedestrian crossing intention prediction"],"prefix":"10.1007","volume":"36","author":[{"given":"Soulayma","family":"Gazzeh","sequence":"first","affiliation":[]},{"given":"Liliana","family":"Lo Presti","sequence":"additional","affiliation":[]},{"given":"Ali","family":"Douik","sequence":"additional","affiliation":[]},{"given":"Marco","family":"La Cascia","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,10,14]]},"reference":[{"key":"1751_CR1","first-page":"1","volume":"2016","author":"NHTS Administration","year":"2016","unstructured":"Administration, N.H.T.S., et al.: 2015 motor vehicle crashes: Overview. Traffic Saf. Facts: Res. Note 2016, 1\u20139 (2016)","journal-title":"Traffic Saf. Facts: Res. Note"},{"issue":"6","key":"1751_CR2","doi-asserted-by":"publisher","first-page":"3285","DOI":"10.1109\/TITS.2020.2981118","volume":"22","author":"X Song","year":"2020","unstructured":"Song, X., Chen, K., Li, X., Sun, J., Hou, B., Cui, Y., Zhang, B., Xiong, G., Wang, Z.: Pedestrian trajectory prediction based on deep convolutional lstm network. IEEE Trans. Intell. Transp. Syst. 22(6), 3285\u20133302 (2020)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"1751_CR3","doi-asserted-by":"publisher","first-page":"3229","DOI":"10.1109\/TIP.2021.3058599","volume":"30","author":"R Quan","year":"2021","unstructured":"Quan, R., Zhu, L., Wu, Y., Yang, Y.: Holistic lstm for pedestrian trajectory prediction. IEEE Trans. Image Process. 30, 3229\u20133239 (2021)","journal-title":"IEEE Trans. Image Process."},{"issue":"10","key":"1751_CR4","doi-asserted-by":"publisher","first-page":"4301","DOI":"10.1109\/TIV.2023.3317032","volume":"8","author":"L Guo","year":"2023","unstructured":"Guo, L., Shan, C., Shi, T., Li, X., Wang, F.-Y.: A vectorized representation model for trajectory prediction of intelligent vehicles in challenging scenarios. IEEE Trans. Intell. Veh. 8(10), 4301\u20134306 (2023)","journal-title":"IEEE Trans. Intell. Veh."},{"key":"1751_CR5","doi-asserted-by":"crossref","unstructured":"Gazzeh, S., Lo\u00a0Presti, L., Douik, A., La\u00a0Cascia, M.: Rlstm: A novel residual and recurrent network for pedestrian action classification. In: International Conference on Computer Analysis of Images and Patterns, pp. 55\u201364 (2023). Springer","DOI":"10.1007\/978-3-031-44240-7_6"},{"issue":"6","key":"1751_CR6","doi-asserted-by":"publisher","first-page":"6023","DOI":"10.1109\/TITS.2023.3333328","volume":"25","author":"B Yang","year":"2023","unstructured":"Yang, B., Wei, Z., Hu, H., Wang, R., Yang, C., Ni, R.: Dpcian: A novel dual-channel pedestrian crossing intention anticipation network. IEEE Trans. Intell. Transportat. Syst. 25(6), 6023\u20136034 (2023)","journal-title":"IEEE Trans. Intell. Transportat. Syst."},{"key":"1751_CR7","doi-asserted-by":"crossref","unstructured":"Li, J., Shi, X., Chen, F., Stroud, J., Zhang, Z., Lan, T., Mao, J., Kang, J., Refaat, K.S., Yang, W., et al.: Pedestrian crossing action recognition and trajectory prediction with 3d human keypoints. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 1463\u20131470 (2023). IEEE","DOI":"10.1109\/ICRA48891.2023.10160273"},{"key":"1751_CR8","doi-asserted-by":"crossref","unstructured":"Rasouli, A., Kotseruba, I.: Pedformer: Pedestrian behavior prediction via cross-modal attention modulation and gated multitask learning. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 9844\u20139851 (2023). IEEE","DOI":"10.1109\/ICRA48891.2023.10161318"},{"issue":"3","key":"1751_CR9","doi-asserted-by":"publisher","first-page":"900","DOI":"10.1109\/TITS.2019.2901817","volume":"21","author":"A Rasouli","year":"2019","unstructured":"Rasouli, A., Tsotsos, J.K.: Autonomous vehicles that interact with pedestrians: A survey of theory and practice. IEEE Trans. Intell. Transp. Syst. 21(3), 900\u2013918 (2019)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"1751_CR10","doi-asserted-by":"crossref","unstructured":"Rasouli, A., Yau, T., Rohani, M., Luo, J.: Multi-modal hybrid architecture for pedestrian action prediction. In: 2022 IEEE Intelligent Vehicles Symposium (IV), pp. 91\u201397 (2022). IEEE","DOI":"10.1109\/IV51971.2022.9827055"},{"key":"1751_CR11","unstructured":"Rasouli, A., Kotseruba, I., Tsotsos, J.K.: Pedestrian action anticipation using contextual feature fusion in stacked rnns. arXiv preprint arXiv:2005.06582 (2020)"},{"key":"1751_CR12","doi-asserted-by":"crossref","unstructured":"Kotseruba, I., Rasouli, A., Tsotsos, J.K.: Benchmark for evaluating pedestrian action prediction. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 1258\u20131268 (2021)","DOI":"10.1109\/WACV48630.2021.00130"},{"issue":"2","key":"1751_CR13","doi-asserted-by":"publisher","first-page":"27540","DOI":"10.1109\/JSEN.2023.3317426","volume":"23","author":"N Sharma","year":"2023","unstructured":"Sharma, N., Dhiman, C., Indu, S.: Visual-motion-interaction guided pedestrian intention prediction framework. IEEE Sensors J. 23(2), 27540 (2023)","journal-title":"IEEE Sensors J."},{"issue":"2","key":"1751_CR14","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1109\/TIV.2022.3162719","volume":"7","author":"D Yang","year":"2022","unstructured":"Yang, D., Zhang, H., Yurtsever, E., Redmill, K.A., \u00d6zg\u00fcner, \u00dc.: Predicting pedestrian crossing intention with feature fusion and spatio-temporal attention. IEEE Trans. Intell. Veh. 7(2), 221\u2013230 (2022)","journal-title":"IEEE Trans. Intell. Veh."},{"issue":"10","key":"1751_CR15","doi-asserted-by":"publisher","first-page":"14713","DOI":"10.1007\/s11042-022-13814-y","volume":"82","author":"J Lian","year":"2023","unstructured":"Lian, J., Yu, F., Li, L., Zhou, Y.: Early intention prediction of pedestrians using contextual attention-based lstm. Multim. Tools Appl. 82(10), 14713\u201314729 (2023)","journal-title":"Multim. Tools Appl."},{"key":"1751_CR16","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Tian, R., Ding, Z.: Trep: Transformer-based evidential prediction for pedestrian intention with uncertainty. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, pp. 3534\u20133542 (2023)","DOI":"10.1609\/aaai.v37i3.25463"},{"key":"1751_CR17","doi-asserted-by":"crossref","unstructured":"Landry, F.-G., Akhloufi, M.A.: Predicting pedestrian crossing intention in autonomous vehicles: A review. Neurocomputing, 129105 (2024)","DOI":"10.1016\/j.neucom.2024.129105"},{"key":"1751_CR18","doi-asserted-by":"crossref","unstructured":"Kotseruba, I., Rasouli, A., Tsotsos, J.K.: Do they want to cross? understanding pedestrian intention for behavior prediction. In: 2020 IEEE Intelligent Vehicles Symposium (IV), pp. 1688\u20131693 (2020). IEEE","DOI":"10.1109\/IV47402.2020.9304591"},{"key":"1751_CR19","doi-asserted-by":"crossref","unstructured":"Rasouli, A., Kotseruba, I., Kunic, T., Tsotsos, J.K.: Pie: A large-scale dataset and models for pedestrian intention estimation and trajectory prediction. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 6262\u20136271 (2019)","DOI":"10.1109\/ICCV.2019.00636"},{"key":"1751_CR20","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778 (2016)","DOI":"10.1109\/CVPR.2016.90"},{"key":"1751_CR21","unstructured":"Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-K., Woo, W.-c.: Convolutional lstm network: A machine learning approach for precipitation nowcasting. Advances in neural information processing systems 28 (2015)"},{"key":"1751_CR22","doi-asserted-by":"crossref","unstructured":"Alofi, A., Greer, R., Gopalkrishnan, A., Trivedi, M.: Pedestrian safety by intent prediction: A lightweight lstm-attention architecture and experimental evaluations with real-world datasets. In: 2024 IEEE Intelligent Vehicles Symposium (IV), pp. 77\u201384 (2024). IEEE","DOI":"10.1109\/IV55156.2024.10588868"},{"key":"1751_CR23","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., Doll\u00e1r, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980\u20132988 (2017)","DOI":"10.1109\/ICCV.2017.324"},{"key":"1751_CR24","doi-asserted-by":"crossref","unstructured":"Rasouli, A., Kotseruba, I., Tsotsos, J.K.: Are they going to cross? a benchmark dataset and baseline for pedestrian crosswalk behavior. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 206\u2013213 (2017)","DOI":"10.1109\/ICCVW.2017.33"},{"key":"1751_CR25","doi-asserted-by":"crossref","unstructured":"Azarmi, M., Rezaei, M., Hussain, T., Qian, C.: Local and global contextual features fusion for pedestrian intention prediction. In: International Conference on Artificial Intelligence and Smart Vehicles, pp. 1\u201313 (2023). Springer","DOI":"10.1007\/978-3-031-43763-2_1"},{"issue":"12","key":"1751_CR26","doi-asserted-by":"publisher","first-page":"14213","DOI":"10.1109\/TITS.2023.3309309","volume":"24","author":"Y Zhou","year":"2023","unstructured":"Zhou, Y., Tan, G., Zhong, R., Li, Y., Gou, C.: Pit: Progressive interaction transformer for pedestrian crossing intention prediction. IEEE Trans. Intell. Transp. Syst. 24(12), 14213\u201314225 (2023)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"1751_CR27","doi-asserted-by":"crossref","unstructured":"Du, Q., Xu, L., Wu, Q., Ning, H., Wang, X., Lin, L., Sun, C.: Safecrossnet: Multi-modal fusion with social-aware for pedestrian crossing intention prediction. Information Fusion, 103609 (2025)","DOI":"10.2139\/ssrn.5217031"},{"key":"1751_CR28","doi-asserted-by":"publisher","first-page":"3250","DOI":"10.1109\/TITS.2024.3524117","volume":"26","author":"A Elgazwy","year":"2025","unstructured":"Elgazwy, A., Elgazzar, K., Khamis, A.: Predicting pedestrian crossing intentions in adverse weather with self-attention models. IEEE Trans. Intell. Transp. Syst. 26, 3250\u20133261 (2025)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"issue":"17","key":"1751_CR29","doi-asserted-by":"publisher","first-page":"5694","DOI":"10.3390\/s21175694","volume":"21","author":"J Lorenzo","year":"2021","unstructured":"Lorenzo, J., Alonso, I.P., Izquierdo, R., Ballardini, A.L., Saz, \u00c1.H., Llorca, D.F., Sotelo, M.\u00c1.: Capformer: pedestrian crossing action prediction using transformer. Sensors 21(17), 5694 (2021)","journal-title":"Sensors"},{"issue":"9","key":"1751_CR30","doi-asserted-by":"publisher","first-page":"4237","DOI":"10.1109\/TIV.2023.3309274","volume":"8","author":"X Liu","year":"2023","unstructured":"Liu, X., Zhou, Y., Gou, C.: Learning from interaction-enhanced scene graph for pedestrian collision risk assessment. IEEE Trans. Intell. Veh. 8(9), 4237\u20134248 (2023)","journal-title":"IEEE Trans. Intell. Veh."},{"key":"1751_CR31","doi-asserted-by":"publisher","DOI":"10.1109\/TIV.2024.3384989","author":"Y Zhou","year":"2024","unstructured":"Zhou, Y., Liu, X., Guo, Z., Cai, M., Gou, C.: Hktsg: A hierarchical knowledge-guided traffic scene graph representation learning framework for intelligent vehicles. IEEE Trans. Intell. Veh. (2024). https:\/\/doi.org\/10.1109\/TIV.2024.3384989","journal-title":"IEEE Trans. Intell. Veh."},{"key":"1751_CR32","doi-asserted-by":"publisher","first-page":"15562","DOI":"10.1109\/TITS.2024.3445652","volume":"25","author":"D Hu","year":"2024","unstructured":"Hu, D., Fang, Z., Fang, H., Li, T., Shen, C., Chen, L., Gao, Y.: Estimator: An effective and scalable framework for transportation mode classification over trajectories. IEEE Trans. Intell. Transp. Syst. 25, 15562 (2024)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"issue":"11","key":"1751_CR33","doi-asserted-by":"publisher","first-page":"22343","DOI":"10.1109\/TITS.2022.3164450","volume":"23","author":"K Zhang","year":"2022","unstructured":"Zhang, K., Feng, X., Wu, L., He, Z.: Trajectory prediction for autonomous driving using spatial-temporal graph attention transformer. IEEE Trans. Intell. Transp. Syst. 23(11), 22343\u201322353 (2022)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"1751_CR34","doi-asserted-by":"crossref","unstructured":"Gujjar, P., Vaughan, R.: Classifying pedestrian actions in advance using predicted video of urban driving scenes. In: 2019 International Conference on Robotics and Automation (ICRA), pp. 2097\u20132103 (2019). IEEE","DOI":"10.1109\/ICRA.2019.8794278"},{"key":"1751_CR35","doi-asserted-by":"crossref","unstructured":"Strickland, M., Fainekos, G., Amor, H.B.: Deep predictive models for collision risk assessment in autonomous driving. In: 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 4685\u20134692 (2018). IEEE","DOI":"10.1109\/ICRA.2018.8461160"},{"key":"1751_CR36","doi-asserted-by":"publisher","DOI":"10.1016\/j.autcon.2022.104328","volume":"140","author":"Y Wang","year":"2022","unstructured":"Wang, Y., Chew, A.W.Z., Zhang, L.: Building damage detection from satellite images after natural disasters on extremely imbalanced datasets. Autom. Constr. 140, 104328 (2022)","journal-title":"Autom. Constr."},{"key":"1751_CR37","doi-asserted-by":"publisher","DOI":"10.1016\/j.cie.2019.106266","volume":"140","author":"D Gan","year":"2020","unstructured":"Gan, D., Shen, J., An, B., Xu, M., Liu, N.: Integrating tanbn with cost sensitive classification algorithm for imbalanced data in medical diagnosis. Comput. Ind. Eng. 140, 106266 (2020)","journal-title":"Comput. Ind. Eng."},{"key":"1751_CR38","unstructured":"Chen, L.-C.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017)"},{"key":"1751_CR39","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248\u2013255 (2009). Ieee","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"1751_CR40","unstructured":"Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)"},{"key":"1751_CR41","doi-asserted-by":"crossref","unstructured":"Hochreiter, S.: Long short-term memory. Neural Computation MIT-Press (1997)","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"1751_CR42","unstructured":"Diederik, P.K.: Adam: A method for stochastic optimization. (No Title) (2014)"}],"container-title":["Machine Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-025-01751-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00138-025-01751-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-025-01751-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,10]],"date-time":"2025-11-10T17:00:51Z","timestamp":1762794051000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00138-025-01751-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,14]]},"references-count":42,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["1751"],"URL":"https:\/\/doi.org\/10.1007\/s00138-025-01751-3","relation":{},"ISSN":["0932-8092","1432-1769"],"issn-type":[{"value":"0932-8092","type":"print"},{"value":"1432-1769","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,14]]},"assertion":[{"value":"12 May 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 September 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 September 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 October 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 October 2025","order":6,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Update","order":7,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Corresponding Author email updated.","order":8,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"132"}}