{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:10:14Z","timestamp":1750219814489,"version":"3.41.0"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T00:00:00Z","timestamp":1691712000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2023,10,31]]},"abstract":"<jats:p>Traffic congestion incurs long delay in travel time, which seriously affects our daily travel experiences. Exploring why traffic congestion occurs is significantly important to effectively address the problem of traffic congestion and improve user experience. Traditional approaches to mine the congestion causes depend on human efforts, which is time consuming and cost-intensive. Hence, we aim at discovering the known and unknown causes of traffic congestion in a systematic way. However, to achieve it, there are three challenges: (1) traffic congestion is affected by several factors with complex spatio-temporal relations; (2) there are a few samples of congestion data with known causes due to the limitation of human label; (3) more unknown congestion causes are unexplored since several factors contribute to traffic congestion. To address above challenges, we design a congestion cause discovery system consisting of two modules: (1) congestion feature extraction module, which extracts the important features distinguishing between different causes of congestion; and (2) congestion cause discovery module, which designs a deep semi-supervised learning based framework to discover the causes of traffic congestion with limited labeled data. Specifically, in pre-training stage, it first leverages a few labeled data as prior knowledge to pre-train the model. Then, in clustering stage, we propose two different clustering methods to discover the congestion causes. For the first clustering method, we extend the classic deep embedded clustering model to produce clusters via soft assignment. For the second one, we iteratively use<jats:italic>k<\/jats:italic>-means to group the latent features extracted from the pre-trained model, and use the cluster results as pseudo-labels to fine-tune the network. Extensive experiments show that the performance of our methods is superior to the state-of-the-art baselines, which demonstrates the effectiveness of the proposed cause discovery system. Additionally, our system is deployed and used in the practical production environment at Amap.<\/jats:p>","DOI":"10.1145\/3604810","type":"journal-article","created":{"date-parts":[[2023,6,17]],"date-time":"2023-06-17T09:14:59Z","timestamp":1686993299000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Discovering Causes of Traffic Congestion via Deep Transfer Clustering"],"prefix":"10.1145","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-4628-2048","authenticated-orcid":false,"given":"Mudan","family":"Wang","sequence":"first","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1701-2588","authenticated-orcid":false,"given":"Yuan","family":"Yuan","sequence":"additional","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9626-5676","authenticated-orcid":false,"given":"Huan","family":"Yan","sequence":"additional","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8702-234X","authenticated-orcid":false,"given":"Hongjie","family":"Sui","sequence":"additional","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-6358-3392","authenticated-orcid":false,"given":"Fan","family":"Zuo","sequence":"additional","affiliation":[{"name":"Alibaba Group, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1396-5270","authenticated-orcid":false,"given":"Yue","family":"Liu","sequence":"additional","affiliation":[{"name":"Alibaba Group, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5617-1659","authenticated-orcid":false,"given":"Yong","family":"Li","sequence":"additional","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0419-5514","authenticated-orcid":false,"given":"Depeng","family":"Jin","sequence":"additional","affiliation":[{"name":"Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,8,11]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3356995.3364537"},{"key":"e_1_3_1_3_2","doi-asserted-by":"crossref","unstructured":"Yue Bao Feng Xiao Zaihan Gao and Ziyou Gao. 2017. Investigation of the traffic congestion during public holiday and the impact of the toll-exemption policy. Transportation Research Part B: Methodological 104 (2017) 58\u201381.","DOI":"10.1016\/j.trb.2017.05.011"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_9"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2012.104"},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Meng Chen Xiaohui Yu and Yang Liu. 2018. PCNN: Deep convolutional networks for short-term traffic congestion prediction. IEEE Transactions on Intelligent Transportation Systems 19 11 (2018) 3550\u20133559.","DOI":"10.1109\/TITS.2018.2835523"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i6.20587"},{"key":"e_1_3_1_8_2","doi-asserted-by":"crossref","unstructured":"Kai Han Sylvestre-Alvise Rebuffi Sebastien Ehrhardt Andrea Vedaldi and Andrew Zisserman. 2021. Autonovel: Automatically discovering and learning novel visual categories. IEEE Transactions on Pattern Analysis and Machine Intelligence 44 10 (2021) 6767\u20136781.","DOI":"10.1109\/TPAMI.2021.3091944"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00849"},{"key":"e_1_3_1_10_2","unstructured":"Yen-Chang Hsu Zhaoyang Lv and Zsolt Kira. 2018. Learning to cluster in order to transfer across domains and tasks. In Proceedings of International Conference on Learning Representations (ICLR) . 1\u201320."},{"key":"e_1_3_1_11_2","unstructured":"Yen-Chang Hsu Zhaoyang Lv Joel Schlosser Phillip Odom and Zsolt Kira. 2019. Multi-class classification without multi-class labels. In Proceedings of International Conference on Learning Representations (ICLR) . 1\u201316."},{"key":"e_1_3_1_12_2","doi-asserted-by":"crossref","unstructured":"Stephen C. Johnson. 1967. Hierarchical clustering schemes. Psychometrika 32 3 (1967) 241\u2013254.","DOI":"10.1007\/BF02289588"},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","unstructured":"Yiannis Kamarianakis and Poulicos Prastacos. 2003. Forecasting traffic flow conditions in an urban network: Comparison of multivariate and univariate approaches. Transportation Research Record 1857 1 (2003) 74\u201384.","DOI":"10.3141\/1857-09"},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","unstructured":"Hans-Peter Kriegel Peer Kr\u00f6ger J\u00f6rg Sander and Arthur Zimek. 2011. Density-based clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 1 3 (2011) 231\u2013240.","DOI":"10.1002\/widm.30"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Naval Research Logistics Quarterly 2 1\u20132 (1955) 83\u201397.","DOI":"10.1002\/nav.3800020109"},{"key":"e_1_3_1_16_2","first-page":"11906","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Lan Shiyong","year":"2022","unstructured":"Shiyong Lan, Yitong Ma, Weikang Huang, Wenwu Wang, Hongyu Yang, and Pyang Li. 2022. Dstagnn: Dynamic spatial-temporal aware graph neural network for traffic flow forecasting. In Proceedings of the International Conference on Machine Learning. PMLR, 11906\u201311917."},{"key":"e_1_3_1_17_2","unstructured":"Geoffrey Hinton Laurens van der Maaten. 2008. Visualizing data using t-sne. Journal of Machine Learning Research 9 11 (2008) 2579\u20132605."},{"key":"e_1_3_1_18_2","doi-asserted-by":"crossref","unstructured":"Junyi Li Fangce Guo Aruna Sivakumar Yanjie Dong and Rajesh Krishnan. 2021. Transferability improvement in short-term traffic prediction using stacked LSTM network. Transportation Research Part C: Emerging Technologies 124 (2021) 102977.","DOI":"10.1016\/j.trc.2021.102977"},{"key":"e_1_3_1_19_2","unstructured":"Yaguang Li Rose Yu Cyrus Shahabi and Yan Liu. 2018. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In Proceedings of International Conference on Learning Representations . 1\u201316."},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3139958.3139960"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6353"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020571"},{"key":"e_1_3_1_23_2","doi-asserted-by":"crossref","unstructured":"Xiaolei Ma Zhimin Tao Yinhai Wang Haiyang Yu and Yunpeng Wang. 2015. Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transportation Research Part C: Emerging Technologies 54 (2015) 187\u2013197.","DOI":"10.1016\/j.trc.2015.03.014"},{"key":"e_1_3_1_24_2","first-page":"281","volume-title":"Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability","author":"MacQueen James","year":"1967","unstructured":"James MacQueen. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. 281\u2013297."},{"key":"e_1_3_1_25_2","unstructured":"P. R. Newswire. 2021. New harris survey reveals chronic nasal congestion is surprisingly widespread: Nearly 1 in 4 americans with nasal congestion experience symptoms almost every day. https:\/\/www.biospace.com\/article\/releases\/new-harris-survey-reveals-chronic-nasal-congestion-is-surprisingly-widespread-nearly-1-in-4-americans-with-nasal-congestion-experience-symptoms-almost-every-day\/."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330884"},{"key":"e_1_3_1_27_2","doi-asserted-by":"crossref","unstructured":"Mingyu Pi Hanbyul Yeon Hyesook Son and Yun Jang. 2019. Visual cause analytics for traffic congestion. IEEE Transactions on Visualization and Computer Graphics 27 3 (2019) 2186\u20132201.","DOI":"10.1109\/TVCG.2019.2940580"},{"key":"e_1_3_1_28_2","doi-asserted-by":"crossref","unstructured":"William M. Rand. 1971. Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association 66 336 (1971) 846\u2013850.","DOI":"10.1080\/01621459.1971.10482356"},{"key":"e_1_3_1_29_2","unstructured":"David Schrank Bill Eisele and Tim Lomax. 2021. Urban mobility report 2021. https:\/\/static.tti.tamu.edu\/tti.tamu.edu\/documents\/mobility-report-2021.pdf."},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","unstructured":"Jinchao Song Chunli Zhao Shaopeng Zhong Thomas Alexander Sick Nielsen and Alexander V. Prishchepov. 2019. Mapping spatio-temporal patterns and detecting the factors of traffic congestion with multi-source data fusion and mining techniques. Computers Environment and Urban Systems 77 (2019) 101364.","DOI":"10.1016\/j.compenvurbsys.2019.101364"},{"key":"e_1_3_1_31_2","unstructured":"Alexander Strehl and Joydeep Ghosh. 2002. Cluster ensembles\u2014a knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research 3 12 (2002) 583\u2013617."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/SmartCity.2015.63"},{"key":"e_1_3_1_33_2","doi-asserted-by":"crossref","unstructured":"Mascha Van Der Voort Mark Dougherty and Susan Watson. 1996. Combining kohonen maps with ARIMA time series models to forecast traffic flow. Transportation Research Part C: Emerging Technologies 4 5 (1996) 307\u2013318.","DOI":"10.1016\/S0968-090X(97)82903-8"},{"key":"e_1_3_1_34_2","unstructured":"Pascal Vincent Hugo Larochelle Isabelle Lajoie Yoshua Bengio Pierre-Antoine Manzagol and L\u00e9on Bottou. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research 11 12 (2010) 3371\u20133408."},{"key":"e_1_3_1_35_2","unstructured":"Tiange Wang Zijun Zhang and Kwok-Leung Tsui. 2021. PSTN: Periodic spatial-temporal deep neural network for traffic condition prediction. arXiv:2108.02424. Retrieved from https:\/\/arxiv.org\/abs\/2108.02424."},{"key":"e_1_3_1_36_2","doi-asserted-by":"crossref","unstructured":"Zonghan Wu Shirui Pan Guodong Long Jing Jiang and Chengqi Zhang. 2019. Graph waveNet for deep spatial-temporal graph modeling. In Proceedings of the 28th International Joint Conference on Artificial Intelligence . 1907\u20131913.","DOI":"10.24963\/ijcai.2019\/264"},{"key":"e_1_3_1_37_2","first-page":"478","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Xie Junyuan","year":"2016","unstructured":"Junyuan Xie, Ross Girshick, and Ali Farhadi. 2016. Unsupervised deep embedding for clustering analysis. In Proceedings of the International Conference on Machine Learning. PMLR, 478\u2013487."},{"key":"e_1_3_1_38_2","first-page":"3861","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Yang Bo","year":"2017","unstructured":"Bo Yang, Xiao Fu, Nicholas D. Sidiropoulos, and Mingyi Hong. 2017. Towards k-means-friendly spaces: Simultaneous deep learning and clustering. In Proceedings of the International Conference on Machine Learning. PMLR, 3861\u20133870."},{"key":"e_1_3_1_39_2","doi-asserted-by":"crossref","unstructured":"Bing Yu Haoteng Yin and Zhanxing Zhu. 2018. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence . 3634\u20133640.","DOI":"10.24963\/ijcai.2018\/505"},{"key":"e_1_3_1_40_2","doi-asserted-by":"crossref","unstructured":"Da Zhang and Mansur R. Kabuka. 2018. Combining weather condition data to predict traffic flow: A GRU-based deep learning approach. IET Intelligent Transport Systems 12 7 (2018) 578\u2013585.","DOI":"10.1049\/iet-its.2017.0313"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i16.17689"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i17.17761"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604810","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3604810","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:05Z","timestamp":1750178765000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604810"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,11]]},"references-count":41,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,10,31]]}},"alternative-id":["10.1145\/3604810"],"URL":"https:\/\/doi.org\/10.1145\/3604810","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"type":"print","value":"2157-6904"},{"type":"electronic","value":"2157-6912"}],"subject":[],"published":{"date-parts":[[2023,8,11]]},"assertion":[{"value":"2022-11-14","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-06-06","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-08-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}