{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,17]],"date-time":"2026-01-17T18:29:04Z","timestamp":1768674544753,"version":"3.49.0"},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2017,12,31]],"date-time":"2017-12-31T00:00:00Z","timestamp":1514678400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Spatial Algorithms Syst."],"published-print":{"date-parts":[[2017,12,31]]},"abstract":"<jats:p>Data analytics has an ever increasing impact on tackling various societal challenges. In this article, we investigate how data from several heterogeneous online sources can be used to discover insights and make predictions about the spatial distribution of crime in large urban environments. A series of important research questions is addressed, following a purely data-driven approach and methodology. First, we examine how useful different types of data are for the task of crime levels prediction, focusing especially on how prediction accuracy can be improved by combining data from multiple information sources. To that end, we not only investigate prediction accuracy across all individual areas studied, but also examine how these predictions affect the accuracy of identified crime hotspots. Then, we look into individual features, aiming to identify and quantify the most important factors. Finally, we drill down to different crime types, elaborating on how the prediction accuracy and the importance of individual features vary across them. Our analysis involves six different datasets, from which more than 3,000 features are extracted, filtered, and used to learn models for predicting crime rates across 14 different crime categories. Our results indicate that combining data from multiple information sources can significantly improve prediction accuracy. They also highlight which features affect prediction accuracy the most, as well as for which particular crime categories the predictions are more accurate.<\/jats:p>","DOI":"10.1145\/3190345","type":"journal-article","created":{"date-parts":[[2018,4,27]],"date-time":"2018-04-27T16:00:21Z","timestamp":1524844821000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["Analyzing and Predicting Spatial Crime Distribution Using Crowdsourced and Open Data"],"prefix":"10.1145","volume":"3","author":[{"given":"Alexandros","family":"Belesiotis","sequence":"first","affiliation":[{"name":"Information Management Systems Institute, R.C. Athena, Marousi, Greece"}]},{"given":"George","family":"Papadakis","sequence":"additional","affiliation":[{"name":"Department of Informatics 8 Telecommunications, University of Athens, Panepistimiopolis, Ilisia, Athens, Greece"}]},{"given":"Dimitrios","family":"Skoutas","sequence":"additional","affiliation":[{"name":"Information Management Systems Institute, R.C. Athena, Marousi, Greece"}]}],"member":"320","published-online":{"date-parts":[[2018,4,27]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2820783.2820828"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/2663204.2663254"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1938606.1938608"},{"key":"e_1_2_1_5_1","first-page":"7","article-title":"Risk terrain modeling for spatial risk assessment","volume":"17","author":"Caplan Joel M.","year":"2015","journal-title":"Cityscape"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1080\/07418825.2010.486037"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1057\/palgrave.sj.8350066"},{"key":"e_1_2_1_8_1","first-page":"155","article-title":"Support vector regression machines","volume":"9","author":"Drucker Harris","year":"1997","journal-title":"Adv. Neural Info. Process. Syst."},{"key":"e_1_2_1_9_1","volume-title":"Mapping Crime: Understanding Hotspots. Technical Report","author":"Eck John","year":"2005"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2014.02.003"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.2307\/1271436"},{"key":"e_1_2_1_12_1","unstructured":"Da Kuang P. Jeffrey Brantingham and Andrea L. Bertozzi. 2017. Crime topic modeling. CoRR abs\/1701.01505 (2017). Da Kuang P. Jeffrey Brantingham and Andrea L. Bertozzi. 2017. Crime topic modeling. CoRR abs\/1701.01505 (2017)."},{"key":"e_1_2_1_13_1","first-page":"23","article-title":"Nonparametrics: Statistical methods based on ranks (rev. ed.) prentice-hall","volume":"292","author":"Lehmann Erich L.","year":"1998","journal-title":"Englewood Cliffs, NJ"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.2105\/AJPH.2012.300927"},{"key":"e_1_2_1_15_1","unstructured":"Huan Liu and Hiroshi Motoda. 2012. Feature Selection for Knowledge Discovery and Data Mining. Vol. 454. Springer Science & Business Media. Huan Liu and Hiroshi Motoda. 2012. Feature Selection for Knowledge Discovery and Data Mining. Vol. 454. Springer Science & Business Media."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1869790.1869881"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346926"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcrimjus.2016.03.002"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-9868.2010.00740.x"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1198\/jasa.2011.ap09546"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.2015.1077710"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1111\/j.1538-4632.1995.tb00912.x","article-title":"Local spatial autocorrelation statistics: Distributional issues and an application","volume":"27","author":"Keith Ord J.","year":"1995","journal-title":"Geogr. Anal."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2078195"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/1561426042000191305"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.v61:11"},{"key":"e_1_2_1_26_1","first-page":"210","article-title":"Investigating crime hotspot places and their implication to urban environmental design: A geographic visualization and data mining approach","volume":"5","author":"Tabangin Donna R.","year":"2010","journal-title":"Int. J. Hum. Soc. Sci."},{"key":"e_1_2_1_27_1","unstructured":"Bart Thomee David A. Shamma Gerald Friedland Benjamin Elizalde Karl Ni Douglas Poland Damian Borth and Li-Jia Li. 2015. The new data and new challenges in multimedia research. CoRR abs\/1503.01817 (2015). http:\/\/arxiv.org\/abs\/1503.01817 Bart Thomee David A. Shamma Gerald Friedland Benjamin Elizalde Karl Ni Douglas Poland Damian Borth and Li-Jia Li. 2015. The new data and new challenges in multimedia research. CoRR abs\/1503.01817 (2015). http:\/\/arxiv.org\/abs\/1503.01817"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989734.1989742"},{"key":"e_1_2_1_29_1","first-page":"163","article-title":"Fear of crime, perceived disorders and property crime: A multivariate analysis at the area level","volume":"21","author":"Tseloni Andromachi","year":"2007","journal-title":"Crime Prevention Studies"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-29047-3_28"},{"key":"e_1_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Richard Keith Wortley and Lorraine Ann Mazerolle. 2013. Environmental Criminology and Crime Analysis. Willan. Richard Keith Wortley and Lorraine Ann Mazerolle. 2013. Environmental Criminology and Crime Analysis. Willan.","DOI":"10.4324\/9780203118214"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-06605-9_15"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2011.56"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2820783.2820832"}],"container-title":["ACM Transactions on Spatial Algorithms and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3190345","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3190345","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:08:08Z","timestamp":1750208888000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3190345"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,12,31]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2017,12,31]]}},"alternative-id":["10.1145\/3190345"],"URL":"https:\/\/doi.org\/10.1145\/3190345","relation":{},"ISSN":["2374-0353","2374-0361"],"issn-type":[{"value":"2374-0353","type":"print"},{"value":"2374-0361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,12,31]]},"assertion":[{"value":"2017-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-04-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}