{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T17:18:45Z","timestamp":1772039925023,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":57,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,7,25]],"date-time":"2019-07-25T00:00:00Z","timestamp":1564012800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,7,25]]},"DOI":"10.1145\/3292500.3330932","type":"proceedings-article","created":{"date-parts":[[2019,7,26]],"date-time":"2019-07-26T13:17:26Z","timestamp":1564147046000},"page":"1480-1490","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":68,"title":["Sequential Anomaly Detection using Inverse Reinforcement Learning"],"prefix":"10.1145","author":[{"given":"Min-hwan","family":"Oh","sequence":"first","affiliation":[{"name":"Columbia University, New York, NY, USA"}]},{"given":"Garud","family":"Iyengar","sequence":"additional","affiliation":[{"name":"Columbia University, New York, NY, USA"}]}],"member":"320","published-online":{"date-parts":[[2019,7,25]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150459"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1497577.1497581"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2008.10.024"},{"key":"e_1_3_2_1_4_1","volume-title":"Some asymptotic theory for the bootstrap. The Annals of Statistics","author":"Bickel Peter J","year":"1981","unstructured":"Peter J Bickel and David A Freedman . 1981. Some asymptotic theory for the bootstrap. The Annals of Statistics ( 1981 ), 1196--1217. Peter J Bickel and David A Freedman. 1981. Some asymptotic theory for the bootstrap. The Annals of Statistics (1981), 1196--1217."},{"key":"e_1_3_2_1_5_1","volume-title":"Weight uncertainty in neural networks. arXiv preprint arXiv:1505.05424","author":"Blundell Charles","year":"2015","unstructured":"Charles Blundell , Julien Cornebise , Koray Kavukcuoglu , and Daan Wierstra . 2015. Weight uncertainty in neural networks. arXiv preprint arXiv:1505.05424 ( 2015 ). Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. 2015. Weight uncertainty in neural networks. arXiv preprint arXiv:1505.05424 (2015)."},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics . 182--189","author":"Boularias Abdeslam","year":"2011","unstructured":"Abdeslam Boularias , Jens Kober , and Jan Peters . 2011 . Relative entropy inverse reinforcement learning . In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics . 182--189 . Abdeslam Boularias, Jens Kober, and Jan Peters. 2011. Relative entropy inverse reinforcement learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics . 182--189."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/335191.335388"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557043"},{"key":"e_1_3_2_1_9_1","unstructured":"Jaedeug Choi and Kee-Eung Kim. 2011. Map inference for bayesian inverse reinforcement learning. In Advances in Neural Information Processing Systems. 1989--1997.   Jaedeug Choi and Kee-Eung Kim. 2011. Map inference for bayesian inverse reinforcement learning. In Advances in Neural Information Processing Systems. 1989--1997."},{"key":"e_1_3_2_1_10_1","unstructured":"Hannah M Dee and David C Hogg. 2004. Detecting inexplicable behaviour.. In BMVC. 1--10.  Hannah M Dee and David C Hogg. 2004. Detecting inexplicable behaviour.. In BMVC. 1--10."},{"key":"e_1_3_2_1_11_1","volume-title":"An introduction to the bootstrap","author":"Efron Bradley","unstructured":"Bradley Efron and Robert J Tibshirani . 1994. An introduction to the bootstrap . CRC press . Bradley Efron and Robert J Tibshirani. 1994. An introduction to the bootstrap .CRC press."},{"key":"e_1_3_2_1_12_1","volume-title":"A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models. arXiv preprint arXiv:1611.03852","author":"Finn Chelsea","year":"2016","unstructured":"Chelsea Finn , Paul Christiano , Pieter Abbeel , and Sergey Levine . 2016a. A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models. arXiv preprint arXiv:1611.03852 ( 2016 ). Chelsea Finn, Paul Christiano, Pieter Abbeel, and Sergey Levine. 2016a. A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models. arXiv preprint arXiv:1611.03852 (2016)."},{"key":"e_1_3_2_1_13_1","volume-title":"International Conference on Machine Learning. 49--58","author":"Finn Chelsea","year":"2016","unstructured":"Chelsea Finn , Sergey Levine , and Pieter Abbeel . 2016 b. Guided cost learning: Deep inverse optimal control via policy optimization . In International Conference on Machine Learning. 49--58 . Chelsea Finn, Sergey Levine, and Pieter Abbeel. 2016b. Guided cost learning: Deep inverse optimal control via policy optimization. In International Conference on Machine Learning. 49--58."},{"key":"e_1_3_2_1_14_1","volume-title":"international conference on machine learning . 1050--1059","author":"Gal Yarin","year":"2016","unstructured":"Yarin Gal and Zoubin Ghahramani . 2016 . Dropout as a Bayesian approximation: Representing model uncertainty in deep learning . In international conference on machine learning . 1050--1059 . Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning . 1050--1059."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871716"},{"key":"e_1_3_2_1_16_1","unstructured":"Alex Graves. 2011. Practical variational inference for neural networks. In Advances in neural information processing systems. 2348--2356.   Alex Graves. 2011. Practical variational inference for neural networks. In Advances in neural information processing systems. 2348--2356."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2013.184"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8655(03)00003-5"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2006.176"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2013.6630743"},{"key":"e_1_3_2_1_21_1","volume-title":"Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs. In 2018 IEEE 25th International Conference on High Performance Computing (HiPC). IEEE, 92--101","author":"Kanezashi Hiroki","year":"2018","unstructured":"Hiroki Kanezashi , Toyotaro Suzumura , Dario Garcia-Gasulla , Min-hwan Oh, and Satoshi Matsuoka . 2018 . Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs. In 2018 IEEE 25th International Conference on High Performance Computing (HiPC). IEEE, 92--101 . Hiroki Kanezashi, Toyotaro Suzumura, Dario Garcia-Gasulla, Min-hwan Oh, and Satoshi Matsuoka. 2018. Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs. In 2018 IEEE 25th International Conference on High Performance Computing (HiPC). IEEE, 92--101."},{"key":"e_1_3_2_1_22_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_23_1","unstructured":"Diederik P Kingma Tim Salimans and Max Welling. 2015. Variational dropout and the local reparameterization trick. In Advances in Neural Information Processing Systems. 2575--2583.   Diederik P Kingma Tim Salimans and Max Welling. 2015. Variational dropout and the local reparameterization trick. In Advances in Neural Information Processing Systems. 2575--2583."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s007780050006"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2008.4497422"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2012.03.040"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2008.17"},{"key":"e_1_3_2_1_28_1","volume-title":"Multiplicative normalizing flows for variational bayesian neural networks. arXiv preprint arXiv:1703.01961","author":"Louizos Christos","year":"2017","unstructured":"Christos Louizos and Max Welling . 2017. Multiplicative normalizing flows for variational bayesian neural networks. arXiv preprint arXiv:1703.01961 ( 2017 ). Christos Louizos and Max Welling. 2017. Multiplicative normalizing flows for variational bayesian neural networks. arXiv preprint arXiv:1703.01961 (2017)."},{"key":"e_1_3_2_1_29_1","volume-title":"Motor anomaly detection for unmanned aerial vehicles using reinforcement learning","author":"Lu Huimin","year":"2017","unstructured":"Huimin Lu , Yujie Li , Shenglin Mu , Dong Wang , Hyoungseop Kim , and Seiichi Serikawa . 2017. Motor anomaly detection for unmanned aerial vehicles using reinforcement learning . IEEE internet of things journal, Vol. 5 , 4 ( 2017 ), 2315--2322. Huimin Lu, Yujie Li, Shenglin Mu, Dong Wang, Hyoungseop Kim, and Seiichi Serikawa. 2017. Motor anomaly detection for unmanned aerial vehicles using reinforcement learning. IEEE internet of things journal, Vol. 5, 4 (2017), 2315--2322."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1992.4.3.448"},{"key":"e_1_3_2_1_31_1","volume-title":"Lstm-based encoder-decoder for multi-sensor anomaly detection. arXiv preprint arXiv:1607.00148","author":"Malhotra Pankaj","year":"2016","unstructured":"Pankaj Malhotra , Anusha Ramakrishnan , Gaurangi Anand , Lovekesh Vig , Puneet Agarwal , and Gautam Shroff . 2016. Lstm-based encoder-decoder for multi-sensor anomaly detection. arXiv preprint arXiv:1607.00148 ( 2016 ). Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, and Gautam Shroff. 2016. Lstm-based encoder-decoder for multi-sensor anomaly detection. arXiv preprint arXiv:1607.00148 (2016)."},{"key":"e_1_3_2_1_32_1","first-page":"139","article-title":"One-class SVMs for document classification","volume":"2","author":"Manevitz Larry M","year":"2001","unstructured":"Larry M Manevitz and Malik Yousef . 2001 . One-class SVMs for document classification . Journal of machine Learning research , Vol. 2 , Dec (2001), 139 -- 154 . Larry M Manevitz and Malik Yousef. 2001. One-class SVMs for document classification. Journal of machine Learning research, Vol. 2, Dec (2001), 139--154.","journal-title":"Journal of machine Learning research"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0896-6273(02)00974-1"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2013.2262376"},{"key":"e_1_3_2_1_35_1","unstructured":"Radford M Neal. 1993. Bayesian learning via stochastic dynamics. In Advances in neural information processing systems. 475--482.   Radford M Neal. 1993. Bayesian learning via stochastic dynamics. In Advances in neural information processing systems. 475--482."},{"key":"e_1_3_2_1_36_1","volume-title":"Apprenticeship learning using inverse reinforcement learning and gradient methods. arXiv preprint arXiv:1206.5264","author":"Neu Gergely","year":"2012","unstructured":"Gergely Neu and Csaba Szepesv\u00e1ri . 2012. Apprenticeship learning using inverse reinforcement learning and gradient methods. arXiv preprint arXiv:1206.5264 ( 2012 ). Gergely Neu and Csaba Szepesv\u00e1ri. 2012. Apprenticeship learning using inverse reinforcement learning and gradient methods. arXiv preprint arXiv:1206.5264 (2012)."},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 17th international conference on Machine learning. 663--670","author":"Ng Andrew Y","year":"2000","unstructured":"Andrew Y Ng , Stuart J Russell , 2000 . Algorithms for inverse reinforcement learning .. In Proceedings of the 17th international conference on Machine learning. 663--670 . Andrew Y Ng, Stuart J Russell, et almbox. 2000. Algorithms for inverse reinforcement learning.. In Proceedings of the 17th international conference on Machine learning. 663--670."},{"key":"e_1_3_2_1_38_1","volume-title":"Crowd Counting with Decomposed Uncertainty. arXiv preprint arXiv:1903.07427","author":"Olsen Peder A","year":"2019","unstructured":"Min-hwan Oh, Peder A Olsen , and Karthikeyan Natesan Ramamurthy . 2019. Crowd Counting with Decomposed Uncertainty. arXiv preprint arXiv:1903.07427 ( 2019 ). Min-hwan Oh, Peder A Olsen, and Karthikeyan Natesan Ramamurthy. 2019. Crowd Counting with Decomposed Uncertainty. arXiv preprint arXiv:1903.07427 (2019)."},{"key":"e_1_3_2_1_39_1","unstructured":"Ian Osband Charles Blundell Alexander Pritzel and Benjamin Van Roy. 2016. Deep exploration via bootstrapped DQN. In Advances in Neural Information Processing Systems. 4026--4034.   Ian Osband Charles Blundell Alexander Pritzel and Benjamin Van Roy. 2016. Deep exploration via bootstrapped DQN. In Advances in Neural Information Processing Systems. 4026--4034."},{"key":"e_1_3_2_1_40_1","volume-title":"Recurrent Neural Radio Anomaly Detection. arXiv preprint arXiv:1611.00301","author":"O'Shea Timothy J","year":"2016","unstructured":"Timothy J O'Shea , T Charles Clancy , and Robert W McGwier . 2016. Recurrent Neural Radio Anomaly Detection. arXiv preprint arXiv:1611.00301 ( 2016 ). Timothy J O'Shea, T Charles Clancy, and Robert W McGwier. 2016. Recurrent Neural Radio Anomaly Detection. arXiv preprint arXiv:1611.00301 (2016)."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2078195"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2008.2005599"},{"key":"e_1_3_2_1_43_1","first-page":"1","article-title":"Bayesian inverse reinforcement learning","volume":"51","author":"Ramachandran Deepak","year":"2007","unstructured":"Deepak Ramachandran and Eyal Amir . 2007 . Bayesian inverse reinforcement learning . Urbana , Vol. 51 , 61801 (2007), 1 -- 4 . Deepak Ramachandran and Eyal Amir. 2007. Bayesian inverse reinforcement learning. Urbana, Vol. 51, 61801 (2007), 1--4.","journal-title":"Urbana"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143936"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ITSC.2016.7795584"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/279943.279964"},{"key":"e_1_3_2_1_47_1","volume-title":"Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes. Computer Vision and Image Understanding","author":"Sabokrou Mohammad","year":"2018","unstructured":"Mohammad Sabokrou , Mohsen Fayyaz , Mahmood Fathy , Zahra Moayed , and Reinhard Klette . 2018 . Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes. Computer Vision and Image Understanding (2018). Mohammad Sabokrou, Mohsen Fayyaz, Mahmood Fathy, Zahra Moayed, and Reinhard Klette. 2018. Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes. Computer Vision and Image Understanding (2018)."},{"key":"e_1_3_2_1_48_1","volume-title":"Proceedings of the 32nd International Conference on Machine Learning (ICML-15)","author":"Schulman John","year":"2015","unstructured":"John Schulman , Sergey Levine , Pieter Abbeel , Michael Jordan , and Philipp Moritz . 2015 . Trust region policy optimization . In Proceedings of the 32nd International Conference on Machine Learning (ICML-15) . 1889--1897. John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, and Philipp Moritz. 2015. Trust region policy optimization. In Proceedings of the 32nd International Conference on Machine Learning (ICML-15). 1889--1897."},{"key":"e_1_3_2_1_49_1","first-page":"035","article-title":"Semi-supervised Learning for Anomalous Trajectory Detection","volume":"1","author":"Sillito Rowland R","year":"2008","unstructured":"Rowland R Sillito and Robert B Fisher . 2008 . Semi-supervised Learning for Anomalous Trajectory Detection .. In BMVC , Vol. 1. 035 -- 031 . Rowland R Sillito and Robert B Fisher. 2008. Semi-supervised Learning for Anomalous Trajectory Detection.. In BMVC, Vol. 1. 035--1.","journal-title":"BMVC"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2670313"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390286"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.3390\/ijgi7010025"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539882"},{"key":"e_1_3_2_1_54_1","volume-title":"Maximum entropy deep inverse reinforcement learning. arXiv preprint arXiv:1507.04888","author":"Wulfmeier Markus","year":"2015","unstructured":"Markus Wulfmeier , Peter Ondruska , and Ingmar Posner . 2015. Maximum entropy deep inverse reinforcement learning. arXiv preprint arXiv:1507.04888 ( 2015 ). Markus Wulfmeier, Peter Ondruska, and Ingmar Posner. 2015. Maximum entropy deep inverse reinforcement learning. arXiv preprint arXiv:1507.04888 (2015)."},{"key":"e_1_3_2_1_55_1","first-page":"32","article-title":"Geolife: A collaborative social networking service among user, location and trajectory","volume":"33","author":"Zheng Yu","year":"2010","unstructured":"Yu Zheng , Xing Xie , and Wei-Ying Ma . 2010 . Geolife: A collaborative social networking service among user, location and trajectory . IEEE Data Eng. Bull. , Vol. 33 , 2 (2010), 32 -- 39 . Yu Zheng, Xing Xie, and Wei-Ying Ma. 2010. Geolife: A collaborative social networking service among user, location and trajectory. IEEE Data Eng. Bull., Vol. 33, 2 (2010), 32--39.","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526816"},{"key":"e_1_3_2_1_57_1","volume-title":"AAAI","volume":"8","author":"Ziebart Brian D","year":"2008","unstructured":"Brian D Ziebart , Andrew L Maas , J Andrew Bagnell , and Anind K Dey . 2008 . Maximum Entropy Inverse Reinforcement Learning .. In AAAI , Vol. 8 . Chicago, IL, USA, 1433--1438. Brian D Ziebart, Andrew L Maas, J Andrew Bagnell, and Anind K Dey. 2008. Maximum Entropy Inverse Reinforcement Learning.. In AAAI, Vol. 8. Chicago, IL, USA, 1433--1438."}],"event":{"name":"KDD '19: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Anchorage AK USA","acronym":"KDD '19","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3292500.3330932","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3292500.3330932","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:26:03Z","timestamp":1750206363000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3292500.3330932"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,7,25]]},"references-count":57,"alternative-id":["10.1145\/3292500.3330932","10.1145\/3292500"],"URL":"https:\/\/doi.org\/10.1145\/3292500.3330932","relation":{},"subject":[],"published":{"date-parts":[[2019,7,25]]},"assertion":[{"value":"2019-07-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}