{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T07:58:50Z","timestamp":1761897530994,"version":"build-2065373602"},"reference-count":77,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T00:00:00Z","timestamp":1621209600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In this paper, we address various challenges in multi-pedestrian and vehicle tracking in high-resolution aerial imagery by intensive evaluation of a number of traditional and Deep Learning based Single- and Multi-Object Tracking methods. We also describe our proposed Deep Learning based Multi-Object Tracking method AerialMPTNet that fuses appearance, temporal, and graphical information using a Siamese Neural Network, a Long Short-Term Memory, and a Graph Convolutional Neural Network module for more accurate and stable tracking. Moreover, we investigate the influence of the Squeeze-and-Excitation layers and Online Hard Example Mining on the performance of AerialMPTNet. To the best of our knowledge, we are the first to use these two for regression-based Multi-Object Tracking. Additionally, we studied and compared the L1 and Huber loss functions. In our experiments, we extensively evaluate AerialMPTNet on three aerial Multi-Object Tracking datasets, namely AerialMPT and KIT AIS pedestrian and vehicle datasets. Qualitative and quantitative results show that AerialMPTNet outperforms all previous methods for the pedestrian datasets and achieves competitive results for the vehicle dataset. In addition, Long Short-Term Memory and Graph Convolutional Neural Network modules enhance the tracking performance. Moreover, using Squeeze-and-Excitation and Online Hard Example Mining significantly helps for some cases while degrades the results for other cases. In addition, according to the results, L1 yields better results with respect to Huber loss for most of the scenarios. The presented results provide a deep insight into challenges and opportunities of the aerial Multi-Object Tracking domain, paving the way for future research.<\/jats:p>","DOI":"10.3390\/rs13101953","type":"journal-article","created":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T12:19:57Z","timestamp":1621253997000},"page":"1953","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Multiple Pedestrians and Vehicles Tracking in Aerial Imagery Using a Convolutional Neural Network"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6084-2272","authenticated-orcid":false,"given":"Seyed Majid","family":"Azimi","sequence":"first","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute (IMF), 82234 Wessling, Germany"},{"name":"Department of Aerospace and Geodesy, Technical University of Munich, 80333 Munich, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maximilian","family":"Kraus","sequence":"additional","affiliation":[{"name":"Department of Informatics, Technical University of Munich, 85748 Garching, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6999-714X","authenticated-orcid":false,"given":"Reza","family":"Bahmanyar","sequence":"additional","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute (IMF), 82234 Wessling, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8122-1475","authenticated-orcid":false,"given":"Peter","family":"Reinartz","sequence":"additional","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute (IMF), 82234 Wessling, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Bergmann, P., Meinhardt, T., and Leal-Taixe, L. (2019, January 16\u201320). Tracking without bells and whistles. Proceedings of the IEEE International Conference on Computer Vision (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/ICCV.2019.00103"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Xiang, Y., Alahi, A., and Savarese, S. (2015, January 13\u201316). Learning to track: Online multi-object tracking by decision making. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.534"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., and Torr, P.H. (2016, January 8\u201316). Fully-convolutional siamese networks for object tracking. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-48881-3_56"},{"key":"ref_4","unstructured":"Cuevas, E.V., Zaldivar, D., and Rojas, R. (2005). Kalman Filter for Vision Tracking, Freie Universit\u00e4t Berlin. Technical Report."},{"key":"ref_5","first-page":"1","article-title":"Particle filter in vision tracking","volume":"5","author":"Cuevas","year":"2007","journal-title":"e-Gnosis"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Bolme, D.S., Beveridge, J.R., Draper, B.A., and Lui, Y.M. (2010, January 13\u201318). Visual object tracking using adaptive correlation filters. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539960"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Boudoukh, G., Leichter, I., and Rivlin, E. (2009, January 7\u201310). Visual tracking of object silhouettes. Proceedings of the IEEE International Conference on Image Processing (ICIP), Cairo, Egypt.","DOI":"10.1109\/ICIP.2009.5414280"},{"key":"ref_8","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201326). Histograms of oriented gradients for human detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA."},{"key":"ref_9","unstructured":"Marvasti-Zadeh, S.M., Cheng, L., Ghanei-Yakhdan, H., and Kasaei, S. (2019). Deep Learning for Visual Tracking: A Comprehensive Survey. arXiv."},{"key":"ref_10","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_11","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (July, January 26). Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_12","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wang, L., Ouyang, W., Wang, X., and Lu, H. (2015, January 13\u201316). Visual tracking with fully convolutional networks. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.357"},{"key":"ref_14","first-page":"1779","article-title":"Robust visual tracking via convolutional networks without training","volume":"25","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Image Process."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1109\/LSP.2018.2835768","article-title":"Residual LSTM attention network for object tracking","volume":"25","author":"Kim","year":"2018","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Li, B., Yan, J., Wu, W., Zhu, Z., and Hu, X. (2018, January 18\u201322). High performance visual tracking with siamese region proposal network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00935"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Held, D., Thrun, S., and Savarese, S. (2016, January 8\u201316). Learning to track at 100 fps with deep regression networks. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_45"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Song, Y., Ma, C., Wu, X., Gong, L., Bao, L., Zuo, W., Shen, C., Lau, R.W., and Yang, M.H. (2018, January 18\u201322). Vital: Visual tracking via adversarial learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00937"},{"key":"ref_19","unstructured":"Zhang, D., Maei, H., Wang, X., and Wang, Y.F. (2017). Deep reinforcement learning for visual object tracking in videos. arXiv."},{"key":"ref_20","unstructured":"U.S. Government Printing Office (2020, January 02). Remote Sensing Data: Applications and Benefits; Technical Report; Subcommittee on Space and Aeronautics, Committee on Science and Technology, Serial No. 110-91, Available online: https:\/\/www.govinfo.gov\/content\/pkg\/CHRG-110hhrg41573\/html\/CHRG-110hhrg41573.html."},{"key":"ref_21","first-page":"1187","article-title":"The use of unmanned aerial vehicles (UAVs) for remote sensing and mapping","volume":"37","author":"Everaerts","year":"2008","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"163","DOI":"10.5194\/isprs-archives-XLII-4-W18-163-2019","article-title":"Multiple vehicle and people tracking in aerial imagery using stack of micro single-object-tracking CNNs","volume":"42","author":"Bahmanyar","year":"2019","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Reilly, V., Idrees, H., and Shah, M. (2010, January 5\u201311). Detection and tracking of large number of targets in wide area surveillance. Proceedings of the European Conference on Computer Vision, Crete, Greece.","DOI":"10.1007\/978-3-642-15558-1_14"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1109\/JSTARS.2011.2179639","article-title":"Object tracking using high resolution satellite imagery","volume":"5","author":"Meng","year":"2012","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_25","unstructured":"Milan, A., Leal-Taix\u00e9, L., Reid, I., Roth, S., and Schindler, K. (2016). MOT16: A benchmark for multi-object tracking. arXiv."},{"key":"ref_26","unstructured":"Kraus, M., Azimi, S.M., Ercelik, E., Bahmanyar, R., Reinartz, P., and Knoll, A. (2020, January 10\u201315). AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features. Proceedings of the International Conference on Pattern Recognition (ICPR), Milan, Italy."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Bewley, A., Ge, Z., Ott, L., Ramos, F., and Upcroft, B. (2016, January 25\u201328). Simple online and realtime tracking. Proceedings of the IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Shrivastava, A., Gupta, A., and Girshick, R. (July, January 26). Training Region-Based Object Detectors with Online Hard Example Mining. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.89"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3309665","article-title":"Handcrafted and deep trackers: Recent visual object tracking approaches and trends","volume":"52","author":"Fiaz","year":"2019","journal-title":"Acm Comput. Surv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","article-title":"A new approach to linear filtering and prediction problems","volume":"82","author":"Kalman","year":"1960","journal-title":"J. Basic Eng. Mar"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.dsp.2013.11.006","article-title":"Overview of Bayesian sequential Monte Carlo methods for group and extended object tracking","volume":"25","author":"Mihaylova","year":"2014","journal-title":"Digit. Signal Process."},{"key":"ref_32","unstructured":"Wang, Q., Gao, J., Xing, J., Zhang, M., and Hu, W. (2017). DCFNet: Discriminant Correlation Filters Network for Visual Tracking. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Ma, C., Huang, J.B., Yang, X., and Yang, M.H. (2015, January 13\u201316). Hierarchical convolutional features for visual tracking. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.352"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Wojke, N., Bewley, A., and Paulus, D. (2017, January 17\u201320). Simple online and realtime tracking with a deep association metric. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Huang, C., Wu, B., and Nevatia, R. (2008, January 12\u201318). Robust object tracking by hierarchical association of detection responses. Proceedings of the European Conference on Computer Vision, Marseille, France.","DOI":"10.1007\/978-3-540-88688-4_58"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Lu, X., Ma, C., Ni, B., Yang, X., Reid, I., and Yang, M.H. (2018, January 8\u201314). Deep Regression Tracking with Shrinkage Loss. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_22"},{"key":"ref_37","unstructured":"Wang, L., Ouyang, W., Wang, X., and Lu, H. (July, January 26). Stct: Sequentially training convolutional networks for visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Huang, C., Lucey, S., and Ramanan, D. (2017, January 22\u201329). Learning policies for adaptive tracking with deep feature cascades. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.21"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1016\/j.procs.2017.12.143","article-title":"Tracking people by detection using CNN features","volume":"124","author":"Chahyati","year":"2017","journal-title":"Procedia Comput. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"012068","DOI":"10.1088\/1742-6596\/887\/1\/012068","article-title":"Real-time vehicle detection and tracking in video based on faster R-CNN","volume":"887","author":"Zhang","year":"2017","journal-title":"J. Phys. Conf. Ser. IOP Publ."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Okuma, K., Taleghani, A., De Freitas, N., Little, J.J., and Lowe, D.G. (2004, January 11\u201314). A boosted particle filter: Multitarget detection and tracking. Proceedings of the European Conference on Computer Vision, Prague, Czech Republic.","DOI":"10.1007\/978-3-540-24670-1_3"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Brunelli, R. (2009). Template Matching Techniques in Computer Vision: Theory and Practice, John Wiley & Sons.","DOI":"10.1002\/9780470744055"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Hager, G.D., and Belhumeur, P.N. (1996, January 18\u201320). Real-time tracking of image regions with changes in geometry and illumination. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.1996.517104"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1117\/12.421129","article-title":"Template matching using fast normalized cross correlation","volume":"Volume 4387","author":"Briechle","year":"2001","journal-title":"Optical Pattern Recognition XII"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1109\/TPAMI.2007.35","article-title":"Ensemble tracking","volume":"29","author":"Avidan","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"2096","DOI":"10.1109\/TPAMI.2015.2509974","article-title":"Struck: Structured output tracking with kernels","volume":"38","author":"Hare","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TPAMI.2014.2345390","article-title":"High-speed tracking with kernelized correlation filters","volume":"37","author":"Henriques","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Sadeghian, A., Alahi, A., and Savarese, S. (2017, January 22\u201329). Tracking the untrackable: Learning to track multiple cues with long-term dependencies. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.41"},{"key":"ref_49","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Kalal, Z., Mikolajczyk, K., and Matas, J. (2010, January 23\u201326). Forward-backward error: Automatic detection of tracking failures. Proceedings of the 2010 20th International Conference on Pattern Recognition, Istanbul, Turkey.","DOI":"10.1109\/ICPR.2010.675"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Lukezic, A., Vojir, T., Cehovin Zajc, L., Matas, J., and Kristan, M. (2017, January 22\u201329). Discriminative correlation filter with channel and spatial reliability. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Venice, Italy.","DOI":"10.1109\/CVPR.2017.515"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Nav. Res. Logist. Q."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Zheng, L., Bie, Z., Sun, Y., Wang, J., Su, C., Wang, S., and Tian, Q. (2016, January 8\u201316). Mars: A video benchmark for large-scale person re-identification. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46466-4_52"},{"key":"ref_54","unstructured":"Yokoyama, M., and Poggio, T. (2005, January 7). A contour-based moving object detection and tracking. Proceedings of the 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, Breckenridge, CO, USA."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Jadhav, A., Mukherjee, P., Kaushik, V., and Lall, B. (2019). Aerial multi-object tracking by detection using deep association networks. arXiv.","DOI":"10.1109\/NCC48643.2020.9056035"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"2303","DOI":"10.1109\/TIP.2009.2025808","article-title":"Detection of object motion regions in aerial image pairs with a multilayer Markovian model","volume":"18","author":"Benedek","year":"2009","journal-title":"IEEE Trans. Image Process."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Butenuth, M., Burkert, F., Schmidt, F., Hinz, S., Hartmann, D., Kneidl, A., Borrmann, A., and Sirmacek, B. (2011, January 8\u201313). Integrating pedestrian simulation, tracking and event detection for crowd analysis. Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW), Barcelona, Spain.","DOI":"10.1109\/ICCVW.2011.6130237"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Schmidt, F., and Hinz, S. (2011, January 5\u20137). A Scheme for the Detection and Tracking of People Tuned for Aerial Image Sequences. Proceedings of the ISPRS conference on Photogrammetric Image Analysis (PIA), Munich, Germany.","DOI":"10.1007\/978-3-642-24393-6_22"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"1938","DOI":"10.1109\/LGRS.2015.2439517","article-title":"Fast multiclass vehicle detection on aerial images","volume":"12","author":"Liu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1109\/LGRS.2015.2408355","article-title":"Unsupervised ship detection based on saliency and S-HOG descriptor from optical satellite images","volume":"12","author":"Qi","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_61","unstructured":"Bahmanyar, R., Vig, E., and Reinartz, P. (2019, January 9\u201312). MRCNet: Crowd Counting and Density Map Estimation in Aerial and Ground Imagery. Proceedings of the BMVC\u2019s Workshop on Object Detection and Recognition for Security Screenin (BMVC-ODRSS), Cardiff, UK."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Ristani, E., Solera, F., Zou, R., Cucchiara, R., and Tomasi, C. (2016, January 8\u201316). Performance measures and a data set for multi-target, multi-camera tracking. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-48881-3_2"},{"key":"ref_63","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1061\/(ASCE)TE.1943-5436.0000251","article-title":"Design implications of walking speed for pedestrian facilities","volume":"137","author":"Rastogi","year":"2011","journal-title":"J. Transp. Eng."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1080\/00140130701812147","article-title":"Field observations of factors influencing walking speeds","volume":"51","author":"Finnis","year":"2006","journal-title":"Ergonomics"},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1518\/hfes.46.4.640.56806","article-title":"Profiles in driver distraction: Effects of cell phone conversations on younger and older drivers","volume":"46","author":"Strayer","year":"2004","journal-title":"Hum. Factors"},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1109\/TITS.2007.908146","article-title":"Characterizing driver behavior on signalized intersection approaches at the onset of a yellow-phase trigger","volume":"8","author":"Rakha","year":"2007","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_68","unstructured":"Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Fei-Fei, L., and Savarese, S. (July, January 26). Social LSTM: Human trajectory prediction in crowded spaces. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Xue, H., Huynh, D.Q., and Reynolds, M. (2018, January 12\u201314). SS-LSTM: A hierarchical LSTM model for pedestrian trajectory prediction. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Stateline, NV, USA.","DOI":"10.1109\/WACV.2018.00135"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Vemula, A., Muelling, K., and Oh, J. (2018, January 21\u201325). Social attention: Modeling attention in human crowds. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbone, Australia.","DOI":"10.1109\/ICRA.2018.8460504"},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_72","unstructured":"Lin, M., Chen, Q., and Yan, S. (2013). Network in network. arXiv."},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_74","doi-asserted-by":"crossref","first-page":"1505","DOI":"10.1109\/TVT.2019.2961625","article-title":"Real-Time Single-Stage Vehicle Detector Optimized by Multi-Stage Image-Based Online Hard Example Mining","volume":"69","author":"Lin","year":"2019","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Koga, Y., Miyazaki, H., and Shibasaki, R. (2018). A CNN-based method of vehicle detection from aerial images using hard example mining. Remote Sens., 10.","DOI":"10.3390\/rs10010124"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Huber, P.J. (1992). Robust estimation of a location parameter. Breakthroughs in Statistics, Springer.","DOI":"10.1007\/978-1-4612-4380-9_35"},{"key":"ref_77","doi-asserted-by":"crossref","unstructured":"Xu, Y., Osep, A., Ban, Y., Horaud, R., Leal-Taix\u00e9, L., and Alameda-Pineda, X. (2020, January 14\u201319). How To Train Your Deep Multi-Object Tracker. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00682"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1953\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:02:48Z","timestamp":1760162568000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/10\/1953"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,17]]},"references-count":77,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2021,5]]}},"alternative-id":["rs13101953"],"URL":"https:\/\/doi.org\/10.3390\/rs13101953","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,5,17]]}}}