{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,10]],"date-time":"2024-08-10T05:53:57Z","timestamp":1723269237177},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2022,5,19]],"date-time":"2022-05-19T00:00:00Z","timestamp":1652918400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,5,19]],"date-time":"2022-05-19T00:00:00Z","timestamp":1652918400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Vienna University of Economics and Business"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["AI &amp; Soc"],"published-print":{"date-parts":[[2023,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>For decision making in government, it is necessary to have well-structured sources of information. In several countries, it is difficult to access government data as the information are dispersed, disconnected, and poorly structured. For this reason, this work presents a framework to gather, unify, and enrich missing person data from distributed web sources. The framework allows inserting new tasks specific to the user\u2019s domain to improve data quality. In this study, Brazilian missing person data from non-governmental organizations (NGOs) and governmental websites were collected and semantically enriched. To enhance the understanding of the gathered missing people cases, we create interpretive models using machine learning techniques to extract knowledge and to encourage the use of standards for publishing the data that are frequently ignored by organizations, hindering analysis and decision-making on data. After the collection and semantic enrichment process, there was an increase of approximately 11% in the data present in the base. Also, the mining process evidenced the disappearance and reappearance of a person in Brazil according to several factors such as age, state initiatives, skin tone, hair colors, etc.<\/jats:p>","DOI":"10.1007\/s00146-022-01456-5","type":"journal-article","created":{"date-parts":[[2022,5,19]],"date-time":"2022-05-19T08:02:43Z","timestamp":1652947363000},"page":"565-579","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Indexing, enriching, and understanding Brazilian missing person cases from data of distributed repositories on the web"],"prefix":"10.1007","volume":"38","author":[{"suffix":"Jr.","given":"Jor\u00e3o","family":"Gomes","sequence":"first","affiliation":[]},{"given":"Heder Soares","family":"Bernardino","sequence":"additional","affiliation":[]},{"given":"Jairo Francisco","family":"de Souza","sequence":"additional","affiliation":[]},{"given":"Enayat","family":"Rajabi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,5,19]]},"reference":[{"issue":"2","key":"1456_CR1","first-page":"91","volume":"6","author":"UA Algemili","year":"2016","unstructured":"Algemili UA (2016) Outstanding challenges in recent open government data initiatives. Int J e-Educ e-Bus e-Manag e-Learn 6(2):91","journal-title":"Int J e-Educ e-Bus e-Manag e-Learn"},{"key":"1456_CR2","doi-asserted-by":"crossref","unstructured":"Assaf A, Troncy R, Senart A (2015) Roomba: an extensible framework to validate and build dataset profiles. In: Gandon F, Gu\u00b4eret C, Villata S, Breslin J, Faron-Zucker C, Zimmermann A (eds) The semantic web: ESWC 2015 satellite events. Springer International Publishing, Cham, pp 325\u2013339","DOI":"10.1007\/978-3-319-25639-9_46"},{"key":"1456_CR3","doi-asserted-by":"publisher","unstructured":"Beno M, Misek J, Zavoral F (2009) Agentmat: Framework for data scraping and semantization. In: 2009 third international conference on research challenges in information science. IEEE, Fez, Morocco, pp 225\u2013236. https:\/\/doi.org\/10.1109\/RCIS.2009.5089286","DOI":"10.1109\/RCIS.2009.5089286"},{"key":"1456_CR4","volume-title":"Lost from view: missing persons in the UK","author":"N Biehal","year":"2003","unstructured":"Biehal N, Mitchell F, Wade J (2003) Lost from view: missing persons in the UK. Policy Press, Bristol"},{"issue":"3","key":"1456_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.4018\/jswis.2009081901","volume":"5","author":"C Bizer","year":"2009","unstructured":"Bizer C, Heath T, Berners-Lee T (2009) Linked data\u2014the story so far. Int J Semant Web Inf Syst 5(3):1\u201322","journal-title":"Int J Semant Web Inf Syst"},{"issue":"5","key":"1456_CR6","doi-asserted-by":"publisher","first-page":"850","DOI":"10.1136\/amiajnl-2013-002411","volume":"21","author":"DDA Bui","year":"2014","unstructured":"Bui DDA, Zeng-Treitler Q (2014) Learning regular expressions for clinical text classification. J Am Med Inform Assoc 21(5):850\u2013857","journal-title":"J Am Med Inform Assoc"},{"key":"1456_CR7","volume-title":"Missing persons: data and analysis 2015\u20132016","author":"UMP Bureau","year":"2017","unstructured":"Bureau UMP (2017) Missing persons: data and analysis 2015\u20132016. National Crime Agency, London"},{"issue":"425","key":"1456_CR8","doi-asserted-by":"publisher","first-page":"e1","DOI":"10.1016\/j.forsciint.2019.03.032","volume":"298","author":"M Calmon","year":"2019","unstructured":"Calmon M (2019) Forensic anthropology and missing persons: a Brazilian perspective. Forensic Sci Int 298(425):e1-425.e6. https:\/\/doi.org\/10.1016\/j.forsciint.2019.03.032","journal-title":"Forensic Sci Int"},{"key":"1456_CR9","first-page":"38","volume":"11","author":"L Caraffi","year":"2017","unstructured":"Caraffi L (2017) Pessoas desaparecidas\u2014acabar com o sil\u00eancio. 11o Anu\u00e1rio Brasileiro De Seguran\u00e7a P\u00fablica 11:38\u201341","journal-title":"11o Anu\u00e1rio Brasileiro De Seguran\u00e7a P\u00fablica"},{"key":"1456_CR10","unstructured":"Cavalcanti RP (2017) \u2018Over, under and through the walls\u2019: the dynamics of public security, police-community relations and the limits of managerialism in crime control in Recife, Brazil. Ph.D. thesis, King\u2019s College London"},{"key":"1456_CR11","doi-asserted-by":"publisher","unstructured":"Chaulagain RS, Pandey S, Basnet SR, Shakya S (2017) Cloud based web scraping for big data applications. In: 2017 IEEE international conference on Smart Cloud (SmartCloud). IEEE, Columbia University New York EUA, pp 138\u2013143. https:\/\/doi.org\/10.1109\/SmartCloud.2017","DOI":"10.1109\/SmartCloud.2017"},{"key":"1456_CR12","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321\u2013357","journal-title":"J Artif Intell Res"},{"key":"1456_CR13","doi-asserted-by":"crossref","unstructured":"Chi YL, Sung HY, Lien YY (2020) Towards the ethnic understanding of Taiwanese indigenous peoples: a mashup based on semantic web and open data. In: Rau PLP (ed) Cross-cultural design. User experience of products, services, and intelligent environments. Springer International Publishing, Cham, pp 287\u2013297","DOI":"10.1007\/978-3-030-49788-0_21"},{"key":"1456_CR14","unstructured":"Claudiano MR (2014) Mortos sem sepultura: O desaparecimento de pessoas e seus desdobramentos. PalavraCom Editora Ltda., Florian\u00f3polis, SC, Brasil. ISBN 978-85-64034-07-5"},{"issue":"1","key":"1456_CR15","doi-asserted-by":"publisher","first-page":"255","DOI":"10.1016\/j.fsigss.2009.08","volume":"2","author":"LAF da Silva","year":"2009","unstructured":"da Silva LAF, Vila\u00e7a W, Azevedo D, Majella G, Silva IF, Silva BF (2009) Missing and unidentified persons database. Forensic Sci Int Genet Suppl Ser 2(1):255\u2013257. https:\/\/doi.org\/10.1016\/j.fsigss.2009.08 (progress in forensic genetics 13)","journal-title":"Forensic Sci Int Genet Suppl Ser"},{"issue":"3","key":"1456_CR16","first-page":"326","volume":"2","author":"AG de Oliveira","year":"2017","unstructured":"de Oliveira AG, Vieira RF (2017) Volta vem viver outra vez ao meu lado: An\u00e1lise dos impacos psicol\u00f3gicos vivenciados por familiares de pessoas desaparecidas. Pretextos Revista Da Gradua\u00e7\u00e3o Em Psicologia Da PUC Minas 2(3):326\u2013344","journal-title":"Pretextos Revista Da Gradua\u00e7\u00e3o Em Psicologia Da PUC Minas"},{"issue":"3","key":"1456_CR17","first-page":"37","volume":"17","author":"U Fayyad","year":"1996","unstructured":"Fayyad U, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery in databases. AI Mag 17(3):37\u201337","journal-title":"AI Mag"},{"issue":"1","key":"1456_CR18","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1590\/S0104-93132013000100002","volume":"19","author":"LCDM Ferreira","year":"2013","unstructured":"Ferreira LCDM (2013) \u201cApenas preencher papel\u201d: reflex\u00f5es sobre registros policiais de desaparecimento de pessoa e outros documentos. Mana 19(1):39\u201368","journal-title":"Mana"},{"key":"1456_CR19","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/978-3-319-44944-9_14","volume-title":"Artificial intelligence applications and innovations","author":"T Gogar","year":"2016","unstructured":"Gogar T, Hubacek O, Sedivy J (2016) Deep neural networks for web page information extraction. In: Iliadis L, Maglogiannis I (eds) Artificial intelligence applications and innovations. Springer International Publishing, Cham, pp 154\u2013163"},{"key":"1456_CR20","doi-asserted-by":"publisher","unstructured":"Gomes, J. Jr, & Souza, JF. Brazilian Missing Persons, 2, https:\/\/doi.org\/10.5281\/zenodo.3820787 (2020)","DOI":"10.5281\/zenodo.3820787"},{"key":"1456_CR21","volume-title":"Data mining: concepts and techniques","author":"J Han","year":"2011","unstructured":"Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, Amsterdam"},{"key":"1456_CR22","unstructured":"Hassan MIA, Twinomurinzi H (2018) A systematic literature review of open government data research: Challenges, opportunities and gaps. In: 2018 open innovations conference (OI). IEEE, pp 299\u2013304"},{"key":"1456_CR23","unstructured":"House AR (2019) The search for mollie tibbetts: how social media functions in missing persons cases. Chancellor\u2019s Honors Program Projects. https:\/\/trace.tennessee.edu\/utk_chanhonoproj\/2275. Accessed 20 Jan 2021"},{"key":"1456_CR24","doi-asserted-by":"crossref","unstructured":"Hu W, Singh KK, Xiao F, Han J, Chuah CN, Lee YJ (2018) Who will share my image? Predicting the content diffusion path in online social networks. In: Proceedings of the eleventh ACM international conference on web search and data mining. ACM, pp 252\u2013260","DOI":"10.1145\/3159652.3159705"},{"key":"1456_CR25","doi-asserted-by":"publisher","unstructured":"Jadhav D, Chobe SV, Vaibhav M, Khandare L (2017) Missing person detection system in iot. In: 2017 international conference on computing, communication, control and automation (ICCUBEA), pp 1\u20136. https:\/\/doi.org\/10.1109\/ICCUBEA.2017.8463857","DOI":"10.1109\/ICCUBEA.2017.8463857"},{"issue":"3","key":"1456_CR26","first-page":"31","volume":"6","author":"K Lee","year":"2015","unstructured":"Lee K, Mahmud J, Chen J, Zhou M, Nichols J (2015) Who will retweet this? Detecting strangers from twitter to retweet information. ACM Trans Intell Syst Technol (TIST) 6(3):31","journal-title":"ACM Trans Intell Syst Technol (TIST)"},{"key":"1456_CR27","doi-asserted-by":"publisher","DOI":"10.1016\/j.cities.2020.102860","volume":"106","author":"FT Neves","year":"2020","unstructured":"Neves FT, de Castro Neto M, Aparicio M (2020) The impacts of open data initiatives on smart cities: a framework for evaluation and monitoring. Cities 106:102860","journal-title":"Cities"},{"key":"1456_CR28","doi-asserted-by":"crossref","unstructured":"Oliveira DDd (2007) Desaparecidos civis: conflitos familiares, institucionais e seguran\u00e7a p\u00fablica. Ph.D. thesis, Universidade de Bras\u00edlia","DOI":"10.1590\/S0102-69922007000300013"},{"key":"1456_CR29","volume-title":"Principles of distributed database systems","author":"MT Ozsu","year":"2011","unstructured":"Ozsu MT, Valduriez P (2011) Principles of distributed database systems. Springer Science & Business Media, Berlin"},{"key":"1456_CR30","unstructured":"Poliano F, Stern S, Trecenti J, Vendramini E (2016) Perfil de pessoas desaparecidas no estado de S\u00e3o Paulo"},{"key":"1456_CR31","volume-title":"Children who go missing from care: a participatory project with young people as peer interviewers","author":"BC Sanford","year":"2012","unstructured":"Sanford BC, Ibrahim N (2012) Children who go missing from care: a participatory project with young people as peer interviewers. National Society for the Prevention of Cruelty to Children (NSPCC), London"},{"issue":"3","key":"1456_CR32","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1109\/MIS.2012.23","volume":"27","author":"N Shadbolt","year":"2012","unstructured":"Shadbolt N, O\u2019Hara K, Berners-Lee T, Gibbins N, Glaser H, Hall W et al (2012) Linked open government data: lessons from data. gov. uk. IEEE Intell Syst 27(3):16\u201324","journal-title":"IEEE Intell Syst"},{"issue":"1","key":"1456_CR33","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1007\/s00146-014-0539-6","volume":"30","author":"DK Tayal","year":"2015","unstructured":"Tayal DK, Jain A, Arora S, Agarwal S, Gupta T, Tyagi N (2015) Crime detection and criminal identification in India using data mining techniques. AI Soc 30(1):117\u2013127","journal-title":"AI Soc"},{"key":"1456_CR34","doi-asserted-by":"publisher","DOI":"10.6028\/NIST.SP.811e2008","volume-title":"Use of the international system of units (si)","author":"A Thompson","year":"2008","unstructured":"Thompson A, Taylor BN (2008) Use of the international system of units (si). NIST Special Publication, Gaithersburg"},{"key":"1456_CR35","unstructured":"UK Missing Persons Unit (2019) Missing Persons Data Report 2016\/2017. National Crime Agency. 2020-11-10. https:\/\/www.nationalcrimeagency.gov.uk\/who-we-are\/publications\/304-2016-17-ukmpu-data-report-v1\/file"},{"issue":"1","key":"1456_CR36","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1016\/j.compag.2009.12.006","volume":"71","author":"Y Yang","year":"2010","unstructured":"Yang Y, Wilson L, Wang J (2010) Development of an automated climatic data scraping, filtering and display system. Comput Electron Agric 71(1):77\u201387","journal-title":"Comput Electron Agric"},{"key":"1456_CR37","doi-asserted-by":"publisher","unstructured":"Yi M (2019) Exploring the quality of government open data: Comparison study of the UK, the USA and Korea. Electron Libr 37(1). https:\/\/doi.org\/10.1108\/EL-06-2018-0124","DOI":"10.1108\/EL-06-2018-0124"}],"container-title":["AI &amp; SOCIETY"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00146-022-01456-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00146-022-01456-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00146-022-01456-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T06:07:06Z","timestamp":1688105226000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00146-022-01456-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,19]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,4]]}},"alternative-id":["1456"],"URL":"https:\/\/doi.org\/10.1007\/s00146-022-01456-5","relation":{},"ISSN":["0951-5666","1435-5655"],"issn-type":[{"value":"0951-5666","type":"print"},{"value":"1435-5655","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,19]]},"assertion":[{"value":"18 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 April 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 May 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}