{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T13:01:15Z","timestamp":1769000475819,"version":"3.49.0"},"reference-count":91,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2020,1,30]],"date-time":"2020-01-30T00:00:00Z","timestamp":1580342400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Florida Department of Transportation (FDOT) and NSF CNS","award":["1922782"],"award-info":[{"award-number":["1922782"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Spatial Algorithms Syst."],"published-print":{"date-parts":[[2020,6,30]]},"abstract":"<jats:p>Camera-based systems are increasingly used for collecting information on intersections and arterials. Unlike loop controllers that can generally be only used for detection and movement of vehicles, cameras can provide rich information about the traffic behavior. Vision-based frameworks for multiple-object detection, object tracking, and near-miss detection have been developed to derive this information. However, much of this work currently addresses processing videos offline. In this article, we propose an integrated two-stream convolutional networks architecture that performs real-time detection, tracking, and near-accident detection of road users in traffic video data. The two-stream model consists of a spatial stream network for object detection and a temporal stream network to leverage motion features for multiple-object tracking. We detect near-accidents by incorporating appearance features and motion features from these two networks. Further, we demonstrate that our approaches can be executed in real-time and at a frame rate that is higher than the video frame rate on a variety of videos collected from fisheye and overhead cameras.<\/jats:p>","DOI":"10.1145\/3373647","type":"journal-article","created":{"date-parts":[[2020,2,28]],"date-time":"2020-02-28T12:40:48Z","timestamp":1582893648000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":86,"title":["Intelligent Intersection"],"prefix":"10.1145","volume":"6","author":[{"given":"Xiaohui","family":"Huang","sequence":"first","affiliation":[{"name":"University of Florida, Gainesville, FL, USA"}]},{"given":"Pan","family":"He","sequence":"additional","affiliation":[{"name":"University of Florida, Gainesville, FL, USA"}]},{"given":"Anand","family":"Rangarajan","sequence":"additional","affiliation":[{"name":"University of Florida, Gainesville, FL, USA"}]},{"given":"Sanjay","family":"Ranka","sequence":"additional","affiliation":[{"name":"University of Florida, Gainesville, FL, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,1,30]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.120"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2002.1041184"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.161"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201916)","author":"Bewley Alex","year":"2016"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2000.881040"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2011.2119372"},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the Asian Conference on Computer Vision. Springer, 136--153","author":"Chan Fu-Hsiang","year":"2016"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNC.2010.5583920"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICARM.2016.7606983"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0968-090X(98)00019-9"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0733-5"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1111\/1467-8667.t01-1-00311"},{"key":"e_1_2_1_15_1","volume-title":"Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision. 1440--1448","author":"Girshick Ross","year":"2015"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2437384"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.331"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2019.8917097"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/5254.708428"},{"key":"e_1_2_1_22_1","unstructured":"Alexander Hermans Lucas Beyer and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737.  Alexander Hermans Lucas Beyer and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/IVS.2009.5164354"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/AVSS.2011.6027304"},{"key":"e_1_2_1_25_1","volume-title":"IMPACTS: An image analysis tool for motorway surveillance. Traffic Eng. Control 33, 3","author":"Hoose Neil","year":"1992"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1186\/s41074-018-0045-8"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2008.4761041"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.1999.821154"},{"key":"e_1_2_1_29_1","unstructured":"Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167.  Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.3141\/2365-12"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_22"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing","volume":"5","author":"Jiang Fan"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 11th World Congress on Intelligent Control and Automation. IEEE, 1035--1038","author":"Fu"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2014.6836010"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1115\/1.3662552"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/6979.880968"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1061\/(ASCE)0733-947X(2002)128:3(232)"},{"key":"e_1_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Hirokatsu Kataoka Teppei Suzuki Shoko Oikawa Yasuhiro Matsui and Yutaka Satoh. 2018. Drive video analysis for the detection of traffic near-miss incidents. arXiv preprint arXiv:1804.02555.  Hirokatsu Kataoka Teppei Suzuki Shoko Oikawa Yasuhiro Matsui and Yutaka Satoh. 2018. Drive video analysis for the detection of traffic near-miss incidents. arXiv preprint arXiv:1804.02555.","DOI":"10.1109\/ICRA.2018.8460812"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 4696--4704","author":"Kim Chanho"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/VNIS.1994.396867"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/166914.2684747"},{"key":"e_1_2_1_42_1","unstructured":"Alex Krizhevsky Geoffrey Hinton et al. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report. University of Toronto.  Alex Krizhevsky Geoffrey Hinton et al. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report. University of Toronto."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1002\/nav.3800020109"},{"key":"e_1_2_1_44_1","unstructured":"Laura Leal-Taix\u00e9 Anton Milan Ian Reid Stefan Roth and Konrad Schindler. 2015. MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv preprint arXiv:1504.01942.  Laura Leal-Taix\u00e9 Anton Milan Ian Reid Stefan Roth and Konrad Schindler. 2015. MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv preprint arXiv:1504.01942."},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the European Conference on Computer Vision. Springer, 740--755","author":"Lin Tsung-Yi"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2010.5651958"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the European Conference on Computer Vision. Springer, 21--37","author":"Liu Wei"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2016.7553002"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.1999.790410"},{"key":"e_1_2_1_50_1","unstructured":"Wenhan Luo Junliang Xing Anton Milan Xiaoqin Zhang Wei Liu Xiaowei Zhao and Tae-Kyun Kim. 2014. Multiple-object tracking: A literature review. arXiv preprint arXiv:1409.7618.  Wenhan Luo Junliang Xing Anton Milan Xiaoqin Zhang Wei Liu Xiaowei Zhao and Tae-Kyun Kim. 2014. Multiple-object tracking: A literature review. arXiv preprint arXiv:1409.7618."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/SIES.2017.7993382"},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 495--501","author":"McLauchlan Philip","year":"1997"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/VNIS.1995.518844"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.690"},{"key":"e_1_2_1_56_1","volume-title":"Advances in Neural Information Processing Systems","author":"Ren Shaoqing"},{"key":"e_1_2_1_57_1","unstructured":"Oren Rippel Manohar Paluri Piotr Dollar and Lubomir Bourdev. 2015. Metric learning with adaptive density discrimination. arXiv preprint arXiv:1511.05939.  Oren Rippel Manohar Paluri Piotr Dollar and Lubomir Bourdev. 2015. Metric learning with adaptive density discrimination. arXiv preprint arXiv:1511.05939."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2010.817"},{"key":"e_1_2_1_60_1","unstructured":"Tim Salimans and Durk P. Kingma. 2016. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. In Advances in Neural Information Processing Systems. MIT Press 901--909.  Tim Salimans and Durk P. Kingma. 2016. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. In Advances in Neural Information Processing Systems. MIT Press 901--909."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1080\/22797254.2017.1328978"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1049\/ip-vis:20041302"},{"key":"e_1_2_1_63_1","unstructured":"Ankit Shah Jean Baptiste Lamare Tuan Nguyen Anh and Alexander Hauptmann. 2018. Accident forecasting in CCTV traffic camera videos. arXiv preprint arXiv:1809.05782.  Ankit Shah Jean Baptiste Lamare Tuan Nguyen Anh and Alexander Hauptmann. 2018. Accident forecasting in CCTV traffic camera videos. arXiv preprint arXiv:1809.05782."},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2002.1041306"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2018.2835308"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2011.6130503"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2001.948682"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2004.825084"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/SIS.2003.1202260"},{"key":"e_1_2_1_70_1","doi-asserted-by":"crossref","unstructured":"Waqas Sultani Chen Chen and Mubarak Shah. 2018. Real-world anomaly detection in surveillance videos. Center for Research in Computer Vision University of Central Florida (2018).  Waqas Sultani Chen Chen and Mubarak Shah. 2018. Real-world anomaly detection in surveillance videos. Center for Research in Computer Vision University of Central Florida (2018).","DOI":"10.1109\/CVPR.2018.00678"},{"key":"e_1_2_1_71_1","doi-asserted-by":"crossref","unstructured":"Tomoyuki Suzuki Hirokatsu Kataoka Yoshimitsu Aoki and Yutaka Satoh. 2018. Anticipating traffic accidents with adaptive loss and large-scale incident db. arXiv preprint arXiv:1804.02675.  Tomoyuki Suzuki Hirokatsu Kataoka Yoshimitsu Aoki and Yutaka Satoh. 2018. Anticipating traffic accidents with adaptive loss and large-scale incident db. arXiv preprint arXiv:1804.02675.","DOI":"10.1109\/CVPR.2018.00371"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2004.843112"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_4"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-013-0620-5"},{"key":"e_1_2_1_75_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP). IEEE, 2470--2474","author":"Ullah Habib"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1049\/ip-vis:20041147"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2003.821212"},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2001.990517"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000013087.49260.fb"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2012.6467456"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCIS.2006.252312"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICOSP.2010.5655356"},{"key":"e_1_2_1_83_1","first-page":"207","article-title":"Distance metric learning for large margin nearest neighbor classification","author":"Weinberger Kilian Q.","year":"2009","journal-title":"J. Mach. Learn. Res. 10"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1109\/IVS.2012.6232277"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2018.00087"},{"key":"e_1_2_1_86_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201917)","author":"Wojke Nicolai","year":"2017"},{"key":"e_1_2_1_87_1","volume-title":"Proceedings of the 10th Asian Control Conference (ASCC\u201915)","author":"Xia Siyu","year":"2015"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.534"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0165-0114(96)00229-1"},{"key":"e_1_2_1_90_1","unstructured":"Yupeng Yan Xiaohui Huang Anand Rangarajan and Sanjay Ranka. 2018. Densely labeling large-scale satellite images with generative adversarial networks. In Proceedings of the IEEE 16th International Conference on Dependable Autonomic and Secure Computing 16th International Conference on Pervasive Intelligence and Computing 4th International Conference on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC\/PiCom\/DataCom\/CyberSciTech\u201918). IEEE 927--934.  Yupeng Yan Xiaohui Huang Anand Rangarajan and Sanjay Ranka. 2018. Densely labeling large-scale satellite images with generative adversarial networks. In Proceedings of the IEEE 16th International Conference on Dependable Autonomic and Secure Computing 16th International Conference on Pervasive Intelligence and Computing 4th International Conference on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC\/PiCom\/DataCom\/CyberSciTech\u201918). IEEE 927--934."},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNC.2008.54"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.4304\/jcp.3.10.36-43"}],"container-title":["ACM Transactions on Spatial Algorithms and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3373647","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3373647","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:34Z","timestamp":1750197754000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3373647"}},"subtitle":["Two-stream Convolutional Networks for Real-time Near-accident Detection in Traffic Video"],"short-title":[],"issued":{"date-parts":[[2020,1,30]]},"references-count":91,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,6,30]]}},"alternative-id":["10.1145\/3373647"],"URL":"https:\/\/doi.org\/10.1145\/3373647","relation":{},"ISSN":["2374-0353","2374-0361"],"issn-type":[{"value":"2374-0353","type":"print"},{"value":"2374-0361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,1,30]]},"assertion":[{"value":"2018-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-11-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-01-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}