{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:51:32Z","timestamp":1774486292726,"version":"3.50.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:00:00Z","timestamp":1774483200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:00:00Z","timestamp":1774483200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton. Intell. Syst."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    We present a modular, production-ready approach that integrates compact Neural Network (NN) into a Kalman-filter-based Multi-Object Tracking (MOT) pipeline. We design three tiny task-specific networks to retain modularity, interpretability and real-time suitability for embedded Automotive Driver Assistance Systems:\n                    <jats:list list-type=\"order\">\n                      <jats:list-item>\n                        <jats:p>SPENT (Single-Prediction Network) \u2014 predicts per-track states and replaces heuristic motion models used by the Kalman Filter (KF).<\/jats:p>\n                      <\/jats:list-item>\n                      <jats:list-item>\n                        <jats:p>SANT (Single-Association Network) \u2014 assigns a single incoming sensor object to existing tracks, without relying on heuristic distance and association metrics.<\/jats:p>\n                      <\/jats:list-item>\n                      <jats:list-item>\n                        <jats:p>MANTa (Multi-Association Network) \u2014 jointly associates multiple sensor objects to multiple tracks in a single step.<\/jats:p>\n                      <\/jats:list-item>\n                    <\/jats:list>\n                    Each module has less than 50k trainable parameters. Furthermore, all three can be operated in real-time, are trained from tracking data, and expose modular interfaces so they can be integrated with standard Kalman-filter state updates and track management. This makes them drop-in compatible with many existing trackers. Modularity is ensured, as each network can be trained and evaluated independently of the others. Our evaluation on the KITTI tracking benchmark shows that SPENT reduces prediction RMSE by more than 50% compared to a standard Kalman filter, while SANT and MANTa achieve up to 95% assignment accuracy. These results demonstrate that small, task-specific neural modules can substantially improve tracking accuracy and robustness without sacrificing modularity, interpretability, or the real-time constraints required for automotive deployment.\n                  <\/jats:p>","DOI":"10.1007\/s43684-026-00127-2","type":"journal-article","created":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:02:09Z","timestamp":1774483329000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Tiny neural networks for multi-object tracking in a modular Kalman framework"],"prefix":"10.1007","volume":"6","author":[{"given":"Christian","family":"Holz","sequence":"first","affiliation":[]},{"given":"Christian","family":"Bader","sequence":"additional","affiliation":[]},{"given":"Markus","family":"Enzweiler","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6375-7743","authenticated-orcid":false,"given":"Matthias","family":"Dr\u00fcppel","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2026,3,26]]},"reference":[{"key":"127_CR1","doi-asserted-by":"publisher","first-page":"3464","DOI":"10.1109\/ICIP.2016.7533003","volume-title":"2016 IEEE International Conference on Image Processing (ICIP)","author":"A. Bewley","year":"2016","unstructured":"A. Bewley, Z. Ge, L. Ott, F. Ramos, B. Upcroft, Sort: simple online and realtime tracking, in 2016 IEEE International Conference on Image Processing (ICIP) (IEEE Press, New York, 2016), pp. 3464\u20133468"},{"key":"127_CR2","doi-asserted-by":"crossref","unstructured":"J. Seidenschwarz, G. Bras\u00f3, V. Serrano, I. Elezi, L. Leal-Taix\u00e9, Simple cues lead to a strong multi-object tracker. (2022) Available https:\/\/doi.org\/10.48550\/arXiv.2206.04656","DOI":"10.1109\/CVPR52729.2023.01327"},{"key":"127_CR3","first-page":"1","volume-title":"2023 26th International Conference on Information Fusion (FUSION)","author":"J. Krejc\u00ed","year":"2023","unstructured":"J. Krejc\u00ed, O. Kost, O. Straka, J. Dun\u00edk, Bounding box dynamics in visual tracking: modeling and noise covariance estimation, in 2023 26th International Conference on Information Fusion (FUSION) (2023), pp. 1\u20136. [Online]. Available https:\/\/api.semanticscholar.org\/CorpusID:261126559"},{"key":"127_CR4","doi-asserted-by":"crossref","unstructured":"A. Milan, S. Rezatofighi, A. Dick, I. Reid, K. Schindler, Online multi-target tracking using recurrent neural networks. Association for the Advancement of Artificial Intelligence (AAAI) (2017) Available https:\/\/doi.org\/10.1609\/aaai.v31i1.11194","DOI":"10.1609\/aaai.v31i1.11194"},{"key":"127_CR5","volume-title":"2019 22th International Conference on Information Fusion (FUSION)","author":"H. Liu","year":"2019","unstructured":"H. Liu, H. Zhang, C. Mertz, DeepDA: LSTM-based deep data association network for multi-targets tracking in clutter, in 2019 22th International Conference on Information Fusion (FUSION) (2019)"},{"key":"127_CR6","volume-title":"2017 IEEE International Conference on Computer Vision (ICCV)","author":"Q. Chu","year":"2017","unstructured":"Q. Chu, W. Ouyang, H. Li, X. Wang, B. Liu, N. Yu, Online multi-object tracking using cnn-based single object tracker with spatial-temporal attention mechanism, in 2017 IEEE International Conference on Computer Vision (ICCV) (2017)"},{"key":"127_CR7","doi-asserted-by":"crossref","unstructured":"T. Meinhardt, A. Kirillov, L. Leal-Taixe, C. Feichtenhofer, Trackformer: multi-object tracking with transformers. (2022) Available https:\/\/doi.org\/10.48550\/arXiv.2101.02702","DOI":"10.1109\/CVPR52688.2022.00864"},{"key":"127_CR8","unstructured":"W.-C. Hung, H. Kretzschmar, T.-Y. Lin, Y. Chai, R. Yu, Soda: multi-object tracking with soft data association. Paper (2020) Available https:\/\/doi.org\/10.48550\/arXiv.2008.07725"},{"key":"127_CR9","doi-asserted-by":"crossref","unstructured":"Y. Zhang, P. Sun, Y. Jiang, D. Yu, Z. Yuan, P. Luo, W. Liu, X. Wang, Bytetrack: multi-object tracking by associating every detection box. arXiv (2021). [Online]. Available https:\/\/api.semanticscholar.org\/CorpusID:238744032. arXiv:2110.06864","DOI":"10.1007\/978-3-031-20047-2_1"},{"key":"127_CR10","doi-asserted-by":"publisher","first-page":"3069","DOI":"10.1007\/s11263-021-01513-4","volume":"129","author":"Y. Zhang","year":"2020","unstructured":"Y. Zhang, C. Wang, X. Wang, W. Zeng, W. Liu, Fairmot: on the fairness of detection and re-identification in multiple object tracking. Int. J. Comput. Vis. 129, 3069\u20133087 (2020). [Online]. Available https:\/\/api.semanticscholar.org\/CorpusID:221562313","journal-title":"Int. J. Comput. Vis."},{"key":"127_CR11","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1016\/j.inffus.2019.06.012","volume":"53","author":"J. Liu","year":"2020","unstructured":"J. Liu, Z. Wang, M. Xu, Deepmtt: a deep learning maneuvering target-tracking algorithm based on bidirectional lstm network. Inf. Fusion 53, 289\u2013304 (2020)","journal-title":"Inf. Fusion"},{"key":"127_CR12","first-page":"156","volume-title":"VEHITS","author":"A. Kampker","year":"2018","unstructured":"A. Kampker, M. Sefati, A.S.A. Rachman, K. Kreisk\u00f6ther, P. Campoy, Towards multi-object detection and tracking in urban scenario under uncertainties, in VEHITS (2018), pp. 156\u2013167"},{"issue":"6","key":"127_CR13","doi-asserted-by":"publisher","first-page":"5668","DOI":"10.1109\/TITS.2021.3055616","volume":"23","author":"H. Wu","year":"2022","unstructured":"H. Wu, W. Han, C. Wen, X. Li, C. Wang, 3d multi-object tracking in point clouds based on prediction confidence-guided data association. IEEE Trans. Intell. Transp. Syst. 23(6), 5668\u20135677 (2022)","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"127_CR14","doi-asserted-by":"crossref","unstructured":"S.D. Pendleton, H. Andersen, X. Du, X. Shen, M. Meghjani, Y.H. Eng, D. Rus, M.H. Ang, Perception, planning, control, and coordination for autonomous vehicles. Machines 5(1) (2017). [Online]. Available https:\/\/www.mdpi.com\/2075-1702\/5\/1\/6","DOI":"10.3390\/machines5010006"},{"key":"127_CR15","volume-title":"Handbook of Driver Assistance Systems: Basic Information, Components and Systems for Active Safety and Comfort","author":"H. Winner","year":"2024","unstructured":"H. Winner, S. Hakuli, F. Lotz, C. Singer, C. Stiller, Handbook of Driver Assistance Systems: Basic Information, Components and Systems for Active Safety and Comfort (Springer, Berlin, 2024)"},{"key":"127_CR16","volume-title":"TinyML: Machine Learning with TensorFlow Lite on Arduino and Ultra-Low-Power Microcontrollers","author":"P. Warden","year":"2019","unstructured":"P. Warden, D. Situnayake, TinyML: Machine Learning with TensorFlow Lite on Arduino and Ultra-Low-Power Microcontrollers (O\u2019Reilly Media, 2019)"},{"key":"127_CR17","doi-asserted-by":"crossref","unstructured":"H.W. Kuhn, The Hungarian method for the assignment problem. Naval Research Logistics Quarterly (1955)","DOI":"10.1002\/nav.3800020109"},{"key":"127_CR18","unstructured":"B.-T. Vo, Code set for research use: multi-sensor multi-target tracking. Code (2013). [Online]. Available https:\/\/ba-tuong.vo-au.com\/codes.html"},{"key":"127_CR19","volume-title":"Particle Filters for Tracking Applications","author":"B. Ristic","year":"2004","unstructured":"B. Ristic, S. Arulampalam, N. Gordon, Particle Filters for Tracking Applications (Artech House, Norwood, 2004)"},{"issue":"3","key":"127_CR20","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1109\/JPROC.2003.823141","volume":"92","author":"S.J. Julier","year":"2004","unstructured":"S.J. Julier, J.K. Uhlmann, The unscented Kalman filter for nonlinear estimation. Proc. IEEE 92(3), 401\u2013422 (2004)","journal-title":"Proc. IEEE"},{"key":"127_CR21","first-page":"153","volume-title":"Proceedings of the IEEE Adaptive Systems for Signal Processing, Communications, and Control Symposium (AS-SPCC)","author":"E.A. Wan","year":"2000","unstructured":"E.A. Wan, R. Van Der Merwe, The unscented Kalman filter for nonlinear estimation, in Proceedings of the IEEE Adaptive Systems for Signal Processing, Communications, and Control Symposium (AS-SPCC) (IEEE Press, New York, 2000), pp. 153\u2013158"},{"key":"127_CR22","unstructured":"X. Zhou, V. Koltun, P. Kr\u2019ahenb\u2019uhl, CenterTrack: tracking Objects as Points (2020). arXiv preprint. arXiv:2004.01177"},{"key":"127_CR23","unstructured":"P. Sun, Y. Jiang, R. Zhang, E. Xie, P. Luo, H. Li, TransTrack: multiple-Object Tracking with Transformer (2020). arXiv preprint. arXiv:2012.15460"},{"issue":"4","key":"127_CR24","doi-asserted-by":"publisher","first-page":"1275","DOI":"10.1109\/7.259531","volume":"29","author":"J. Dezert","year":"1993","unstructured":"J. Dezert, Y. Bar-Shalom, Joint probabilistic data association for autonomous navigation. IEEE Trans. Aerosp. Electron. Syst. 29(4), 1275\u20131286 (1993)","journal-title":"IEEE Trans. Aerosp. Electron. Syst."},{"key":"127_CR25","doi-asserted-by":"publisher","first-page":"3645","DOI":"10.1109\/ICIP.2017.8296962","volume-title":"2017 IEEE International Conference on Image Processing (ICIP)","author":"N. Wojke","year":"2017","unstructured":"N. Wojke, A. Bewley, D. Paulus, Deepsort: simple online and realtime tracking with a deep association metric, in 2017 IEEE International Conference on Image Processing (ICIP) (IEEE Press, New York, 2017), pp. 3645\u20133649"},{"key":"127_CR26","doi-asserted-by":"crossref","unstructured":"R. Henschel, L. Leal-Taix\u00e9, D. Cremers, B. Rosenhahn, Fusion of head and full-body detectors for multi-object tracking. Computer Vision and Pattern Recognition (2017) Available https:\/\/doi.org\/10.48550\/arXiv.1705.08314","DOI":"10.1109\/CVPRW.2018.00192"},{"key":"127_CR27","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","author":"Z. Wang","year":"2019","unstructured":"Z. Wang, L. Zheng, Y. Liu, et al., Towards real-time multi-object tracking, in Proceedings of the IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (2019)"},{"key":"127_CR28","unstructured":"J. Fitz, Datenassoziation f\u00fcr multi-objekt-verfolgung mittels deep learning. Paper (2020) Available https:\/\/www.deutsche-digitale-bibliothek.de\/item\/JMGUN5VSWOARH4SDB62G54IRXB2VSHJN"},{"key":"127_CR29","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser, I. Polosukhin, Attention is all you need. NIPS (2017) Available https:\/\/doi.org\/10.48550\/arXiv.1706.03762"},{"key":"127_CR30","doi-asserted-by":"crossref","unstructured":"N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, S. Zagoruyko, End-to-end object detection with transformers. arXiv (2020). [Online]. Available https:\/\/api.semanticscholar.org\/CorpusID:218889832. arXiv:2005.12872","DOI":"10.1007\/978-3-030-58452-8_13"},{"issue":"8","key":"127_CR31","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S. Hochreiter","year":"1997","unstructured":"S. Hochreiter, J. Schmidhuber, Long short-term memory. Neural Comput. 9(8), 1735\u20131780 (1997)","journal-title":"Neural Comput."},{"issue":"11","key":"127_CR32","doi-asserted-by":"publisher","first-page":"2673","DOI":"10.1109\/78.650093","volume":"45","author":"M. Schuster","year":"1997","unstructured":"M. Schuster, K. Paliwal, Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45(11), 2673\u20132681 (1997)","journal-title":"IEEE Trans. Signal Process."},{"key":"127_CR33","volume-title":"Conference on Computer Vision and Pattern Recognition (CVPR)","author":"A. Geiger","year":"2012","unstructured":"A. Geiger, P. Lenz, R. Urtasun, Are we ready for autonomous driving? The kitti vision benchmark suite, in Conference on Computer Vision and Pattern Recognition (CVPR) (2012)"},{"key":"127_CR34","doi-asserted-by":"crossref","unstructured":"X. Gao, Z. Wang, X. Wang, S. Zhang, S. Zhuang, H. Wang, Dettrack: an algorithm for multiple object tracking by improving occlusion object detection. Electronics 13(1) (2023). [Online]. Available https:\/\/www.mdpi.com\/2079-9292\/13\/1\/91","DOI":"10.3390\/electronics13010091"},{"key":"127_CR35","volume-title":"Deep Learning","author":"I. Goodfellow","year":"2019","unstructured":"I. Goodfellow, Y. Bengio, A. Courville, Deep Learning (MIT Press, Cambridge, 2019)"},{"key":"127_CR36","unstructured":"D.M. Reddy, N.V.S. Reddy, Effect of padding on lstms and cnns. Paper (2019) Available https:\/\/doi.org\/10.48550\/arXiv.1903.07288"},{"key":"127_CR37","unstructured":"S. Ioffe, C. Szegedy, Batch normalization: accelerating deep network training by reducing internal covariate shift. ICML (2015) Available https:\/\/doi.org\/10.48550\/arXiv.1502.03167"},{"issue":"1","key":"127_CR38","first-page":"1929","volume":"15","author":"N. Srivastava","year":"2014","unstructured":"N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929\u20131958 (2014)","journal-title":"J. Mach. Learn. Res."},{"key":"127_CR39","first-page":"249","volume":"9","author":"X. Glorot","year":"2010","unstructured":"X. Glorot, Y. Bengio, Understanding the difficulty of training deep feedforward neural networks. J. Mach. Learn. Res. 9, 249\u2013256 (2010)","journal-title":"J. Mach. Learn. Res."},{"key":"127_CR40","unstructured":"G. Welch, G. Bishop, An Introduction to the Kalman Filter. Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, Tech. Rep. TR 95-041 (2006)"},{"key":"127_CR41","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"P. Bergmann","year":"2019","unstructured":"P. Bergmann, T. Meinhardt, L. Leal-Taix\u00e9, Tracking without bells and whistles, in Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV) (2019)"},{"key":"127_CR42","doi-asserted-by":"crossref","unstructured":"T. Meinhardt, A. Kirillov, L. Leal-Taix\u00e9, C. Feichtenhofer, TrackFormer: multi-Object Tracking with Transformers (2021). arXiv preprint. arXiv:2101.02702","DOI":"10.1109\/CVPR52688.2022.00864"}],"container-title":["Autonomous Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s43684-026-00127-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s43684-026-00127-2","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s43684-026-00127-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T00:02:15Z","timestamp":1774483335000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s43684-026-00127-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,26]]},"references-count":42,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2026,12]]}},"alternative-id":["127"],"URL":"https:\/\/doi.org\/10.1007\/s43684-026-00127-2","relation":{},"ISSN":["2730-616X"],"issn-type":[{"value":"2730-616X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,26]]},"assertion":[{"value":"10 November 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 January 2026","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 February 2026","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 March 2026","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All authors declare that there are no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"6"}}