{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,29]],"date-time":"2026-05-29T20:52:58Z","timestamp":1780087978381,"version":"3.54.0"},"reference-count":49,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2022,3,15]],"date-time":"2022-03-15T00:00:00Z","timestamp":1647302400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Frequent and granular population data are essential for decision making. Further-more, for progress monitoring towards achieving the sustainable development goals (SDGs), data availability at global scales as well as at different disaggregated levels is required. The high population coverage of mobile cellular signals has been accelerating the generation of large-scale spatiotemporal data such as call detail record (CDR) data. This has enabled resource-scarce countries to collect digital footprints at scales and resolutions that would otherwise be impossible to achieve solely through traditional surveys. However, using such data requires multiple processes, algorithms, and considerable effort. This paper proposes a big data-analysis pipeline built exclusively on an open-source framework with our spatial enhancement library and a proposed open-source mobility analysis package called Mobipack. Mobipack consists of useful modules for mobility analysis, including data anonymization, origin\u2013destination extraction, trip extraction, zone analysis, route interpolation, and a set of mobility indicators. Several implemented use cases are presented to demonstrate the advantages and usefulness of the proposed system. In addition, we explain how a large-scale data platform that requires efficient resource allocation can be con-structed for managing data as well as how it can be used and maintained in a sustainable manner. The platform can further help to enhance the capacity of CDR data analysis, which usually requires a specific skill set and is time-consuming to implement from scratch. The proposed system is suited for baseline processing and the effective handling of CDR data; thus, it allows for improved support and on-time preparation.<\/jats:p>","DOI":"10.3390\/ijgi11030196","type":"journal-article","created":{"date-parts":[[2022,3,16]],"date-time":"2022-03-16T03:34:13Z","timestamp":1647401653000},"page":"196","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Development of Big Data-Analysis Pipeline for Mobile Phone Data with Mobipack and Spatial Enhancement"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1454-1820","authenticated-orcid":false,"given":"Apichon","family":"Witayangkurn","sequence":"first","affiliation":[{"name":"School of Information, Computer, and Communication Technology (ICT), Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani 12120, Thailand"},{"name":"Center for Spatial Information Science, The University of Tokyo, Chiba 277-8568, Japan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ayumi","family":"Arai","sequence":"additional","affiliation":[{"name":"Center for Spatial Information Science, The University of Tokyo, Chiba 277-8568, Japan"},{"name":"Spatial Data Commons, The University of Tokyo, Chiba 277-8568, Japan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ryosuke","family":"Shibasaki","sequence":"additional","affiliation":[{"name":"Center for Spatial Information Science, The University of Tokyo, Chiba 277-8568, Japan"},{"name":"Spatial Data Commons, The University of Tokyo, Chiba 277-8568, Japan"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,3,15]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"Leveraging big data to support measurement of the sustainable development goals","volume":"1","author":"Gomez","year":"2017","journal-title":"SSRN Electron. J."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"E622","DOI":"10.1016\/S2589-7500(20)30193-X","article-title":"Measuring mobility to monitor travel and physical distancing interventions: A common framework for mobile phone data analysis","volume":"2","author":"Kishore","year":"2020","journal-title":"Lancet Digit. Health"},{"key":"ref_3","unstructured":"ITU (2020). Measuring Digital Development: Facts and Figures 2020, ITU Publication."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1016\/j.trc.2013.11.003","article-title":"Understanding monthly variability in human activity spaces: A twelve-month study using mobile phone call detail records","volume":"38","author":"Olle","year":"2014","journal-title":"Transp. Res. Part C: Emerg. Technol."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1016\/j.tourman.2007.05.014","article-title":"Evaluating passive mobile positioning data for tourism surveys: An Estonian case study","volume":"29","author":"Rien","year":"2008","journal-title":"Tour. Manag."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1038\/nature06958","article-title":"Understanding individual human mobility patterns","volume":"453","author":"Hidalgo","year":"2008","journal-title":"Nature"},{"key":"ref_7","unstructured":"United Nations (2020). The Sustainable Development Goals Report 2020, United Nations Publications."},{"key":"ref_8","unstructured":"UN Global Working Group on Big Data for Official Statistics (2019). Handbook on the Use of Mobile Phone Data for Official Statistics, United Nations Publications."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1016\/j.trc.2019.02.013","article-title":"Inferring dynamic origin-destination flows by transport mode using mobile phone data","volume":"101","author":"Bachir","year":"2019","journal-title":"Transp. Res. Part C Emerg. Technol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.tmaid.2012.12.003","article-title":"Mobile phones and malaria: Modeling human and parasite travel","volume":"11","author":"Buckee","year":"2013","journal-title":"Travel Med. Infect. Dis."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Bengtsson, L., Lu, X., Thorson, A., Garfield, R., and Schreeb, J.V. (2011). Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: A post-earthquake geospatial study in Haiti. PLoS Med., 8.","DOI":"10.1371\/journal.pmed.1001083"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"E24","DOI":"10.1017\/dap.2021.21","article-title":"Guiding Principles to Maintain Public Trust in the Use of Mobile Operator Data for Policy Purposes","volume":"3","author":"Ronald","year":"2021","journal-title":"Data Policy"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"e20","DOI":"10.1017\/dap.2021.10","article-title":"Challenges and opportunities in accessing mobile phone data for COVID-19 response in developing countries","volume":"3","author":"Milusheva","year":"2021","journal-title":"Data Policy"},{"key":"ref_14","unstructured":"Ayumi, A., Witayangkurn, A., Kanasugi, H., Fan, Z., Ohira, W., Cumbane, S.P., and Shibasaki, R. (2020, January 15\u201317). Building a data ecosystem for using telecom data to inform the COVID-19 response effort. Proceedings of the 5th International Data for Policy Conference 2020, London, UK."},{"key":"ref_15","unstructured":"Flowminder (2021, August 01). FlowKit. Available online: https:\/\/github.com\/Flowminder\/FlowKit."},{"key":"ref_16","unstructured":"The COVID19 Mobility Task Force (2021, August 01). COVID-Mobile-Data. Available online: https:\/\/github.com\/worldbank\/covid-mobile-data."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"20120986","DOI":"10.1098\/rsif.2012.0986","article-title":"The impact of biases in mobile phone ownership on estimates of human mobility","volume":"10","author":"Wesolowski","year":"2013","journal-title":"J. R. Soc. Interface"},{"key":"ref_18","first-page":"145","article-title":"Is the sky falling? New technology, changing media, and the future of surveys","volume":"7","author":"Couper","year":"2013","journal-title":"Surv. Res. Methods"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"15888","DOI":"10.1073\/pnas.1408439111","article-title":"Dynamic population mapping using mobile phone data","volume":"111","author":"Deville","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Liu, Y., Sui, Z., Kang, C., and Gao, Y. (2014). Uncovering patterns of inter-urban trip and spatial interaction from social media check-in data. PLoS ONE, 9.","DOI":"10.1371\/journal.pone.0086026"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1080\/10630731003597306","article-title":"Using Mobile Positioning Data to Model Locations Meaningful to Users of Mobile Phones","volume":"17","author":"Rein","year":"2010","journal-title":"J. Urban Technol."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wilson, R., Erbach-Schoenberg, E.Z., Albert, M., Power, D., Tudge, S., Gonzalez, M., and Bengtsson, L. (2016). Rapid and near real-time assessments of population displacement using mobile phone data following disasters: The 2015 Nepal earthquake. PLoS Curr., 8.","DOI":"10.1371\/currents.dis.d073fbece328e4c39087bc086d694b5c"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Liu, J., Braun, E., D\u00fcpmeier, C., Kuckertz, P., Ryberg, D.S., Robinius, M., and Hagenmeyer, V. (2019). Architectural concept and evaluation of a framework for the efficient automation of computational scientific work flows: An energy systems analysis example. Appl. Sci., 9.","DOI":"10.3390\/app9040728"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Isah, H., and Zulkernine, F. (2018, January 10\u201313). A Scalable and Robust Framework for Data Stream Ingestion. Proceedings of the 2018 IEEE International Conference on Big Data, Seattle, WA, USA.","DOI":"10.1109\/BigData.2018.8622360"},{"key":"ref_25","unstructured":"Liu, J., Braun, E., Dupmeier, C., Kuckertz, P., Ryberg, D.S., Robinius, M., and Hagenmeyer, V. (May, January 30). A Generic and Highly Scalable Framework for the Automation and Execution of Scientific Data Processing and Simulation Workflows. Proceedings of the IEEE 15th International Conference on Software Architecture, Seattle, WA, USA."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1007\/s13278-018-0507-0","article-title":"Review of social media analytics process and Big Data pipeline","volume":"8","author":"Sebei","year":"2018","journal-title":"Soc. Netw. Anal. Min."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Pervaiz, F., Vashistha, A., and Anderson, R. (2019, January 3\u20135). Examining the challenges in development data pipeline. Proceedings of the 2019 Conference on Computing and Sustainable Societies, Accra, Ghana.","DOI":"10.1145\/3314344.3332496"},{"key":"ref_28","unstructured":"Omidvar-Tehrani, B., and Amer-Yahia, S. (July, January 30). Data pipelines for user group analytics. Proceedings of the ACM SIGMOD International Conference on Management of Data, Amsterdam, the Netherlands."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"392","DOI":"10.1016\/j.future.2018.05.030","article-title":"PiCo: High-performance data analytics pipelines in modern C++","volume":"87","author":"Misale","year":"2018","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Aung, T., Min, H.Y., and Maw, A.H. (2018, January 18\u201320). Performance Evaluation for Real-Time Messaging System in Big Data Pipeline Architecture. Proceedings of the 2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, Zhengzhou, China.","DOI":"10.1109\/CyberC.2018.00047"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Yang, J., Dash, M., and Teo, S.G. (2021). PPTPF: Privacy-Preserving Trajectory Publication Framework for CDR Mobile Trajectories. ISPRS Int. J. Geo-Inf., 10.","DOI":"10.3390\/ijgi10040224"},{"key":"ref_32","first-page":"1303","article-title":"Case study: Spark GPU-enabled framework to control COVID-19 spread using cell-phone spatio-temporal data. Computers","volume":"65","author":"Abdallah","year":"2020","journal-title":"Mater. Contin."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"8239047","DOI":"10.1155\/2019\/8239047","article-title":"Applying Big Data Analytics to Monitor Tourist Flow for the Scenic Area Operation Management","volume":"2019","author":"Qin","year":"2019","journal-title":"Discret. Dyn. Nat. Soc."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Novovi\u0107, O., Brdar, S., Mesaro\u0161, M., Crnojevi\u0107, V.N., and Papadopoulos, A. (2020). Uncovering the Relationship between Human Connectivity Dynamics and Land Use. ISPRS Int. J. Geo-Inf., 9.","DOI":"10.3390\/ijgi9030140"},{"key":"ref_35","unstructured":"ITU (2017). Call Detail Record (CDR) Analysis: Republic of Guinea, ITU Report."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Shangguan, B., Yue, P., Wu, Z., and Jiang, L. (2017, January 7\u201310). Big spatial data processing with Apache Spark. Proceedings of the Sixth International Conference on Agro-Geoinformatics, Fairfax, VA, USA.","DOI":"10.1109\/Agro-Geoinformatics.2017.8047039"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Witayangkurn, A., Horanont, T., and Shibasaki, R. (2012, January 1\u20133). Performance comparisons of spatial data processing techniques for a large-scale mobile phone dataset. Proceedings of the 3rd International Conference on Computing for Geospatial Research and Applications, Washington, DC, USA.","DOI":"10.1145\/2345316.2345346"},{"key":"ref_38","unstructured":"Apache Software Foundation (2021, August 01). Hadoop. Available online: https:\/\/hadoop.apache.org."},{"key":"ref_39","unstructured":"Apache Software Foundation (2021, August 01). Spark. Available online: https:\/\/spark.apache.org."},{"key":"ref_40","unstructured":"The Mobipack Software (2021, August 01). Spatial Data Commons. Available online: https:\/\/github.com\/SpatialDataCommons."},{"key":"ref_41","unstructured":"GSMA (2014). GSMA Guidelines on the Protection of Privacy in the Use of Mobile Phone Data for Responding to the Ebola Outbreak, GSMA Guidelines."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Vanhoof, M., Lee, C., and Smoreda, Z. (2020). Performance and sensitivities of home detection on mobile phone data. Big Data Meets Survey Science 2020: A Collection of Innovative Methods, Wiley.","DOI":"10.1002\/9781118976357.ch8"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Bhandari, D.M., Witayangkurn, A., Shibasaki, R., and Rahman, M.M. (2018, January 15\u201317). Estimation of Origin-Destination using Mobile Phone Call Data: A Case Study of Greater Dhaka, Bangladesh. Proceedings of the Thirteenth International Conference on Knowledge, Information and Creativity Support Systems (KICSS), Pattaya, Thailand.","DOI":"10.1109\/KICSS45055.2018.8950620"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Kanasugi, H., Sekimoto, Y., Kurokawa, M., Watanabe, T., Muramatsu, S., and Shibasaki, R. (2013, January 18\u201322). Spatiotemporal Route Estimation Consistent with Human Mobility Using Cellular Network Data. Proceedings of the 2013 IEEE International Conference on Pervasive Computing and Communications Workshops (DERCOM Workshops), San Diego, CA, USA.","DOI":"10.1109\/PerComW.2013.6529493"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"E9","DOI":"10.1017\/dap.2021.7","article-title":"The hidden potential of call detail records in The Gambia","volume":"3","author":"Arai","year":"2021","journal-title":"Data Policy"},{"key":"ref_46","unstructured":"ITU (2017). Call Detail Record (CDR) Analysis: Republic of Liberia, ITU Report."},{"key":"ref_47","unstructured":"ITU (2017). Call Detail Record (CDR) Analysis: Sierra Leone, ITU Report."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Batran, M., Arai, A., Kanasugi, H., Cumbane, S.P., Grachane, C., Sekimoto, Y., and Shibasaki, R. (2018, January 11\u201314). Urban Travel Time Estimation in Greater Maputo Using Mobile Phone Big Data. Proceedings of the 2018 IEEE 20th Conference on Business Informatics (CBI), Vienna, Austria.","DOI":"10.1109\/CBI.2018.10057"},{"key":"ref_49","unstructured":"GSMA (2021). Utilising Mobile Big Data and AI to Benefit Society: Insights from the COVID-19 Response, GSMA Report."}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/11\/3\/196\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:36:50Z","timestamp":1760135810000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/11\/3\/196"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,15]]},"references-count":49,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,3]]}},"alternative-id":["ijgi11030196"],"URL":"https:\/\/doi.org\/10.3390\/ijgi11030196","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,15]]}}}