{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T21:13:57Z","timestamp":1776114837328,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":38,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,7,3]],"date-time":"2019-07-03T00:00:00Z","timestamp":1562112000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,7,3]]},"DOI":"10.1145\/3314344.3332496","type":"proceedings-article","created":{"date-parts":[[2019,7,25]],"date-time":"2019-07-25T12:34:36Z","timestamp":1564058076000},"page":"13-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Examining the challenges in development data pipeline"],"prefix":"10.1145","author":[{"given":"Fahad","family":"Pervaiz","sequence":"first","affiliation":[{"name":"University of Washington"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aditya","family":"Vashistha","sequence":"additional","affiliation":[{"name":"University of Washington"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Richard","family":"Anderson","sequence":"additional","affiliation":[{"name":"University of Washington"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,7,3]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Information systems development: methodologies, techniques and tools","author":"Avison David","unstructured":"David Avison and Guy Fitzgerald . 2003. Information systems development: methodologies, techniques and tools . McGraw Hill . David Avison and Guy Fitzgerald. 2003. Information systems development: methodologies, techniques and tools. McGraw Hill."},{"key":"e_1_3_2_1_2_1","volume-title":"International journal of epidemiology 31, 6","author":"Blakely Tony","year":"2002","unstructured":"Tony Blakely and Clare Salmond . 2002. Probabilistic record linkage and a method to calculate the positive predictive value . International journal of epidemiology 31, 6 ( 2002 ), 1246--1252. Tony Blakely and Clare Salmond. 2002. Probabilistic record linkage and a method to calculate the positive predictive value. International journal of epidemiology 31, 6 (2002), 1246--1252."},{"key":"e_1_3_2_1_3_1","volume-title":"Using thematic analysis in psychology. Qualitative research in psychology 3, 2","author":"Braun Virginia","year":"2006","unstructured":"Virginia Braun and Victoria Clarke . 2006. Using thematic analysis in psychology. Qualitative research in psychology 3, 2 ( 2006 ), 77--101. Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative research in psychology 3, 2 (2006), 77--101."},{"key":"e_1_3_2_1_4_1","volume-title":"Readings in information visualization","author":"Card Stuart K","unstructured":"Stuart K Card , Jock D Mackinlay , and Ben Shneiderman . 1999. Using vision to think . In Readings in information visualization . Morgan Kaufmann Publishers Inc ., 579--581. Stuart K Card, Jock D Mackinlay, and Ben Shneiderman. 1999. Using vision to think. In Readings in information visualization. Morgan Kaufmann Publishers Inc., 579--581."},{"key":"e_1_3_2_1_5_1","volume-title":"AAAI Spring Symposium: Artificial Intelligence for Development.","author":"Chen Kuang","year":"2010","unstructured":"Kuang Chen , Emma Brunskill , Jonathan Dick , and Prabhjot Dhadialla . 2010 . Learning to Identify Locally Actionable Health Anomalies .. In AAAI Spring Symposium: Artificial Intelligence for Development. Kuang Chen, Emma Brunskill, Jonathan Dick, and Prabhjot Dhadialla. 2010. Learning to Identify Locally Actionable Health Anomalies.. In AAAI Spring Symposium: Artificial Intelligence for Development."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2011.31"},{"key":"e_1_3_2_1_7_1","unstructured":"Kuang Chen Joseph M Hellerstein and Tapan S Parikh. 2011b. Data in the First Mile.. In CIDR. Citeseer 203--206.  Kuang Chen Joseph M Hellerstein and Tapan S Parikh. 2011b. Data in the First Mile.. In CIDR. Citeseer 203--206."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2160601.2160605"},{"key":"e_1_3_2_1_9_1","volume-title":"Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection","author":"Christen Peter","unstructured":"Peter Christen . 2012. Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection . Springer Science & Business Media . Peter Christen. 2012. Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection. Springer Science & Business Media."},{"key":"e_1_3_2_1_10_1","volume-title":"Exploratory data mining and data cleaning","author":"Dasu Tamraparni","unstructured":"Tamraparni Dasu and Theodore Johnson . 2003. Exploratory data mining and data cleaning . Vol. 479 . John Wiley & Sons . Tamraparni Dasu and Theodore Johnson. 2003. Exploratory data mining and data cleaning. Vol. 479. John Wiley & Sons."},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of Graphics Interface","author":"Dell Nicola","year":"2013","unstructured":"Nicola Dell , Nathan Breit , Jacob O Wobbrock , and Gaetano Borriello . 2013 . Improving form-based data entry with image snippets . In Proceedings of Graphics Interface 2013. Canadian Information Processing Society, 157--164. Nicola Dell, Nathan Breit, Jacob O Wobbrock, and Gaetano Borriello. 2013. Improving form-based data entry with image snippets. In Proceedings of Graphics Interface 2013. Canadian Information Processing Society, 157--164."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2675133.2675145"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/1191547.1191739"},{"key":"e_1_3_2_1_14_1","volume-title":"Managing quality: Integrating the supply chain","author":"Thomas Foster S","unstructured":"S Thomas Foster and Kunal K Ganguly . 2007. Managing quality: Integrating the supply chain . Pearson Prentice Hall Upper Saddle River , New Jersey . S Thomas Foster and Kunal K Ganguly. 2007. Managing quality: Integrating the supply chain. Pearson Prentice Hall Upper Saddle River, New Jersey."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367564"},{"key":"e_1_3_2_1_16_1","volume-title":"Tableau software white paper-visual thinking for business intelligence. Tableau Software","author":"Hanrahan Pat","year":"2003","unstructured":"Pat Hanrahan . 2003. Tableau software white paper-visual thinking for business intelligence. Tableau Software , Seattle, WA ( 2003 ). Pat Hanrahan. 2003. Tableau software white paper-visual thinking for business intelligence. Tableau Software, Seattle, WA (2003)."},{"key":"e_1_3_2_1_17_1","volume-title":"Quantitative data cleaning for large databases. United Nations Economic Commission for Europe (UNECE)","author":"Hellerstein Joseph M","year":"2008","unstructured":"Joseph M Hellerstein . 2008. Quantitative data cleaning for large databases. United Nations Economic Commission for Europe (UNECE) ( 2008 ). Joseph M Hellerstein. 2008. Quantitative data cleaning for large databases. United Nations Economic Commission for Europe (UNECE) (2008)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2945356"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2611567"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1979444"},{"key":"e_1_3_2_1_21_1","article-title":"DHIS2: The tool to improve health data demand and use in Kenya","volume":"8","author":"Karuri Josephine","year":"2014","unstructured":"Josephine Karuri , Peter Waiganjo , ORWA Daniel , and Ayub MANYA. 2014 . DHIS2: The tool to improve health data demand and use in Kenya . Journal of Health Informatics in Developing Countries 8 , 1 (2014). Josephine Karuri, Peter Waiganjo, ORWA Daniel, and Ayub MANYA. 2014. DHIS2: The tool to improve health data demand and use in Kenya. Journal of Health Informatics in Developing Countries 8, 1 (2014).","journal-title":"Journal of Health Informatics in Developing Countries"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1021564703268"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/648312.755367"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/07317131.2012.682016"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442882.2442931"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0005483"},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the 9th Python in Science Conference","volume":"445","author":"Wes","unstructured":"Wes McKinney and others. 2010. Data structures for statistical computing in python . In Proceedings of the 9th Python in Science Conference , Vol. 445 . van der Voort S, Millman J, 51--56. Wes McKinney and others. 2010. Data structures for statistical computing in python. In Proceedings of the 9th Python in Science Conference, Vol. 445. van der Voort S, Millman J, 51--56."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jclinepi.2006.11.021"},{"key":"e_1_3_2_1_29_1","first-page":"14","article-title":"Matching algorithms within a duplicate detection system","volume":"23","author":"Monge Alvaro E.","year":"2000","unstructured":"Alvaro E. Monge . 2000 . Matching algorithms within a duplicate detection system . IEEE Data Eng. Bull. 23 , 4 (2000), 14 -- 20 . Alvaro E. Monge. 2000. Matching algorithms within a duplicate detection system. IEEE Data Eng. Bull. 23, 4 (2000), 14--20.","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_3_2_1_30_1","volume-title":"Role development of community health workers: an examination of selection and training processes in the intervention literature. American journal of preventive medicine 37, 6","author":"O'Brien Matthew J","year":"2009","unstructured":"Matthew J O'Brien , Allison P Squires , Rebecca A Bixby , and Steven C Larson . 2009. Role development of community health workers: an examination of selection and training processes in the intervention literature. American journal of preventive medicine 37, 6 ( 2009 ), S262--S269. Matthew J O'Brien, Allison P Squires, Rebecca A Bixby, and Steven C Larson. 2009. Role development of community health workers: an examination of selection and training processes in the intervention literature. American journal of preventive medicine 37, 6 (2009), S262--S269."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1435417.1435433"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1124772.1124857"},{"key":"e_1_3_2_1_33_1","unstructured":"Fahad Pervaiz Richard Anderson and Sophie Newland. Data Specification for Information Systems for the Immunization Cold Chain. (????).  Fahad Pervaiz Richard Anderson and Sophie Newland. Data Specification for Information Systems for the Immunization Cold Chain. (????)."},{"key":"e_1_3_2_1_34_1","first-page":"3","article-title":"Data cleaning: Problems and current approaches","volume":"23","author":"Rahm Erhard","year":"2000","unstructured":"Erhard Rahm and Hong Hai Do . 2000 . Data cleaning: Problems and current approaches . IEEE Data Eng. Bull. 23 , 4 (2000), 3 -- 13 . Erhard Rahm and Hong Hai Do. 2000. Data cleaning: Problems and current approaches. IEEE Data Eng. Bull. 23, 4 (2000), 3--13.","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_3_2_1_35_1","first-page":"381","article-title":"Potter's wheel: An interactive data cleaning system","volume":"1","author":"Raman Vijayshankar","year":"2001","unstructured":"Vijayshankar Raman and Joseph M Hellerstein . 2001 . Potter's wheel: An interactive data cleaning system . In VLDB , Vol. 1. 381 -- 390 . Vijayshankar Raman and Joseph M Hellerstein. 2001. Potter's wheel: An interactive data cleaning system. In VLDB, Vol. 1. 381--390.","journal-title":"VLDB"},{"key":"e_1_3_2_1_36_1","volume-title":"Benjamin A Wolfe, Darius Jazayeri, Christian Allen, Justin Miranda, Elaine Baker, Nicholas Musinguzi, and others.","author":"Seebregts Christopher J","year":"2009","unstructured":"Christopher J Seebregts , Burke W Mamlin , Paul G Biondich , Hamish SF Fraser , Benjamin A Wolfe, Darius Jazayeri, Christian Allen, Justin Miranda, Elaine Baker, Nicholas Musinguzi, and others. 2009 . The OpenMRS implementers network. International journal of medical informatics 78, 11 (2009), 711--720. Christopher J Seebregts, Burke W Mamlin, Paul G Biondich, Hamish SF Fraser, Benjamin A Wolfe, Darius Jazayeri, Christian Allen, Justin Miranda, Elaine Baker, Nicholas Musinguzi, and others. 2009. The OpenMRS implementers network. International journal of medical informatics 78, 11 (2009), 711--720."},{"key":"e_1_3_2_1_37_1","volume-title":"CommCare: Automated quality improvement to strengthen community-based health","author":"Svoronos T","year":"2010","unstructured":"T Svoronos , P Mjungu , R Dhadialla , R Luk , C Zue , J Jackson , and N Lesh . 2010. CommCare: Automated quality improvement to strengthen community-based health . Weston : D-Tree International ( 2010 ). T Svoronos, P Mjungu, R Dhadialla, R Luk, C Zue, J Jackson, and N Lesh. 2010. CommCare: Automated quality improvement to strengthen community-based health. Weston: D-Tree International (2010)."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18637\/jss.v059.i10"}],"event":{"name":"COMPASS '19: ACM SIGCAS Conference on Computing and Sustainable Societies","location":"Accra Ghana","acronym":"COMPASS '19","sponsor":["SIGCAS ACM Special Interest Group on Computers and Society"]},"container-title":["Proceedings of the 2nd ACM SIGCAS Conference on Computing and Sustainable Societies"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3314344.3332496","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3314344.3332496","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:53:29Z","timestamp":1750204409000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3314344.3332496"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,7,3]]},"references-count":38,"alternative-id":["10.1145\/3314344.3332496","10.1145\/3314344"],"URL":"https:\/\/doi.org\/10.1145\/3314344.3332496","relation":{},"subject":[],"published":{"date-parts":[[2019,7,3]]},"assertion":[{"value":"2019-07-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}