{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T08:40:24Z","timestamp":1774255224577,"version":"3.50.1"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2022,1,5]],"date-time":"2022-01-05T00:00:00Z","timestamp":1641340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"RGC Theme-based Research Scheme","award":["T41-603\/20-R"],"award-info":[{"award-number":["T41-603\/20-R"]}]},{"name":"RGC Collaborative Research Fund","award":["C5026-18G"],"award-info":[{"award-number":["C5026-18G"]}]},{"name":"PolyU Internal Start-up Fund","award":["P0038876"],"award-info":[{"award-number":["P0038876"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2022,4,30]]},"abstract":"<jats:p>Spatio-temporal (ST) data is a collection of multiple time series data with different spatial locations and is inherently stochastic and unpredictable. An accurate prediction over such data is an important building block for several urban applications, such as taxi demand prediction, traffic flow prediction, and so on. Existing deep learning based approaches assume that outcome is deterministic and there is only one plausible future; therefore, cannot capture the multimodal nature of future contents and dynamics. In addition, existing approaches learn spatial and temporal data separately as they assume weak correlation between them. To handle these issues, in this article, we propose a stochastic spatio-temporal generative model (named D-GAN) which adopts Generative Adversarial Networks (GANs)-based structure for more accurate ST prediction in multiple time steps. D-GAN consists of two components: (1) spatio-temporal correlation network which models spatio-temporal joint distribution of pixels and supports a stochastic sampling of latent variables for multiple plausible futures; (2) a stochastic adversarial network to jointly learn generation and variational inference of data through implicit distribution modeling. D-GAN also supports fusion of external factors through explicit objective to improve the model learning. Extensive experiments performed on two real-world datasets show that D-GAN achieves significant improvements and outperforms baseline models.<\/jats:p>","DOI":"10.1145\/3458025","type":"journal-article","created":{"date-parts":[[2022,1,5]],"date-time":"2022-01-05T15:07:50Z","timestamp":1641395270000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Multimodal Spatio-Temporal Prediction with Stochastic Adversarial Networks"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6847-585X","authenticated-orcid":false,"given":"Divya","family":"Saxena","sequence":"first","affiliation":[{"name":"Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong"}]},{"given":"Jiannong","family":"Cao","sequence":"additional","affiliation":[{"name":"Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong"}]}],"member":"320","published-online":{"date-parts":[[2022,1,5]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969329"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.5555\/3298239.3298479"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.5555\/3504035.3504351"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICNP.2017.8117559"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3378889"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412054"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-16148-4_21"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2019.2900481"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2019.2955794"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1049\/iet-its.2019.0017"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33015668"},{"key":"e_1_3_2_13_2","unstructured":"Haoxing Lin Weijia Jia Yongjian You and Yiping Sun. 2020. \u201cInterpretable crowd flow prediction with spatial-temporal self-attention.\u201d arXiv preprint arXiv:2002.09693."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295313"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304244"},{"key":"e_1_3_2_16_2","unstructured":"Senzhang Wang Jiannong Cao and Philip Yu. 2019. Deep Learning for Spatio-Temporal Data Mining: A Survey. arXiv preprint arXiv:1906.04928."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/2996913.2997016"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2891537"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33013656"},{"key":"e_1_3_2_20_2","unstructured":"Theis Lucas A\u00e4ron van den Oord and Matthias Bethge. 2015. \u201cA note on the evaluation of generative models.\u201c arXiv preprint arXiv:1511.01844."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.5555\/2969033.2969125"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.5555\/3045390.3045555"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.5555\/3294771.3294816"},{"key":"e_1_3_2_24_2","unstructured":"Alex Lee Richard Zhang Frederik Ebert Pieter Abbeel Chelsea Finn and Sergey Levine. 2018. \u201cStochastic adversarial video prediction.\u201d arXiv preprint arXiv:1804.01523."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.5555\/2125163.2125192"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2013.2262376"},{"issue":"2","key":"e_1_3_2_27_2","first-page":"653","article-title":"Traffic flow prediction for road transportation networks with limited traffic data","volume":"16","author":"Abadi Afshin","year":"2014","unstructured":"Afshin Abadi, Tooraj Rajabioun, and Petros A. Ioannou. 2014. Traffic flow prediction for road transportation networks with limited traffic data. IEEE Transactions on Intelligent Transportation Systems 16, 2 (2014), 653\u2013662.","journal-title":"IEEE Transactions on Intelligent Transportation Systems"},{"key":"e_1_3_2_28_2","first-page":"5643","volume-title":"Proceedings of the National Academy of Sciences","volume":"112","author":"Ricardo Silva","year":"2015","unstructured":"Silva Ricardo, Soong Moon Kang, and Edoardo M. Airoldi. 2015. Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. Proceedings of the National Academy of Sciences 112, 18 (2015), 5643\u20135648."},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/2996913.2996962"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098018"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/2983323.2983667"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2017.83"},{"key":"e_1_3_2_33_2","unstructured":"Zhiyong Cui Ruimin Ke Ziyuan Pu and Yinhai Wang. 2018. \u201cDeep bidirectional and unidirectional LSTM recurrent neural network for network-wide traffic speed prediction.\u201d arXiv preprint arXiv:1801.02143."},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304889.3304985"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304889.3305069"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330884"},{"key":"e_1_3_2_37_2","unstructured":"Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114."},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3446374"},{"key":"e_1_3_2_39_2","first-page":"7354","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zhang Han","year":"2019","unstructured":"Han Zhang, Ian Goodfellow, Dimitris Metaxas, and Augustus Odena. 2019. Self-attention generative adversarial networks. In Proceedings of the International Conference on Machine Learning. 7354\u20137363."},{"key":"e_1_3_2_40_2","doi-asserted-by":"crossref","unstructured":"Jianpeng Cheng Li Dong and Mirella Lapata. 2016. Long short-term memory-networks for machine reading. arXiv preprint arXiv:1601.06733.","DOI":"10.18653\/v1\/D16-1053"},{"key":"e_1_3_2_41_2","doi-asserted-by":"crossref","unstructured":"Ankur Parikh Oscar T\u00e4ckstr\u00f6m Dipanjan Das and Jakob Uszkoreit. 2016. \u201cA decomposable attention model for natural language inference.\u201d arXiv preprint arXiv:1606.01933.","DOI":"10.18653\/v1\/D16-1244"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.5555\/3298239.3298479"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339561"},{"key":"e_1_3_2_45_2","volume-title":"Proceedings of the IEEE Transactions on Intelligent Transportation Systems","author":"Liu Lingbo","year":"2020","unstructured":"Lingbo Liu, Jiajie Zhen, Guanbin Li, Geng Zhan, Zhaocheng He, Bowen Du, and Liang Lin. 2020. Dynamic spatial-temporal representation learning for traffic flow prediction. In Proceedings of the IEEE Transactions on Intelligent Transportation Systems."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2002.1017616"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304273"},{"key":"e_1_3_2_48_2","first-page":"3","volume-title":"Proceedings of the 30th International Conference on Machine Learning","author":"Maas Andrew","year":"2013","unstructured":"Andrew Maas, Awni Y. Hannun, and Andrew Y. Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In Proceedings of the 30th International Conference on Machine Learning. 3."}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3458025","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3458025","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:38Z","timestamp":1750195478000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3458025"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,5]]},"references-count":47,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,4,30]]}},"alternative-id":["10.1145\/3458025"],"URL":"https:\/\/doi.org\/10.1145\/3458025","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,5]]},"assertion":[{"value":"2020-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-01-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}