{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T16:46:35Z","timestamp":1772901995271,"version":"3.50.1"},"reference-count":46,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2021,1,9]],"date-time":"2021-01-09T00:00:00Z","timestamp":1610150400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>According to research, generally, 2.5 quintillion bytes of data are produced every day. About 90% of the world\u2019s data has been produced in the last two years alone. The amount of data is increasing immensely. There is a fight to use and store this tremendous information effectively. HBase is the top option for storing huge data. HBase has been selected for several purposes, including its scalability, efficiency, strong consistency support, and the capacity to support a broad range of data models. This paper seeks to define, taxonomically classify, and systematically compare existing research on a broad range of storage technologies, methods, and data models based on HBase storage architecture\u2019s symmetry. We perform a systematic literature review on a number of published works proposed for HBase storage architecture. This research synthesis results in a knowledge base that helps understand which big data storage method is an effective one.<\/jats:p>","DOI":"10.3390\/sym13010109","type":"journal-article","created":{"date-parts":[[2021,1,10]],"date-time":"2021-01-10T23:03:42Z","timestamp":1610319822000},"page":"109","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["A Comprehensive Study of HBase Storage Architecture\u2014A Systematic Literature Review"],"prefix":"10.3390","volume":"13","author":[{"given":"Muhammad Umair","family":"Hassan","sequence":"first","affiliation":[{"name":"Department of ICT and Natural Sciences, Norwegian University of Science and Technology (NTNU), Larsg\u00e5rdsvegen 2, 6009 \u00c5lesund, Norway"}]},{"given":"Irfan","family":"Yaqoob","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, University of Jinan, Jinan 250022, China"}]},{"given":"Sidra","family":"Zulfiqar","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Okara, Okara 56300, Pakistan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1252-260X","authenticated-orcid":false,"given":"Ibrahim A.","family":"Hameed","sequence":"additional","affiliation":[{"name":"Department of ICT and Natural Sciences, Norwegian University of Science and Technology (NTNU), Larsg\u00e5rdsvegen 2, 6009 \u00c5lesund, Norway"}]}],"member":"1968","published-online":{"date-parts":[[2021,1,9]]},"reference":[{"key":"ref_1","unstructured":"Coughlin, T. (2020, February 13). 2019. Available online: https:\/\/www.seagate.com\/in\/en\/our-story\/data-age-2025\/."},{"key":"ref_2","unstructured":"Morris, T. (2020, March 21). 2019. Available online: https:\/\/www.business2community.com\/big-data\/19-data-and-analytics-predictions-through-2025-02178668."},{"key":"ref_3","first-page":"85","article-title":"Research on vector spatial data storage schema based on Hadoop platform","volume":"6","author":"Zheng","year":"2013","journal-title":"Int. J. Database Theory Appl."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1825","DOI":"10.1007\/s11227-016-1670-6","article-title":"Distributed RDF store for efficient searching billions of triples based on Hadoop","volume":"72","author":"Um","year":"2016","journal-title":"J. Supercomput."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhang, J., Wu, G., Hu, X., and Wu, X. (2012, January 20\u201323). A distributed cache for hadoop distributed file system in real-time cloud services. Proceedings of the 2012 ACM\/IEEE 13th International Conference on Grid Computing, Beijing, China.","DOI":"10.1109\/Grid.2012.17"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Li, M., Zhu, Z., and Chen, G. (2013, January 22\u201326). A scalable and high-efficiency discovery service using a new storage. Proceedings of the 2013 IEEE 37th Annual Computer Software and Applications Conference, Kyoto, Japan.","DOI":"10.1109\/COMPSAC.2013.125"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Kim, M., Choi, J., and Yoon, J. (2015, January 4\u20136). Development of the big data management system on national virtual power plant. Proceedings of the 2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), Krakow, Poland.","DOI":"10.1109\/3PGCIC.2015.101"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1007\/s10766-017-0513-2","article-title":"Real-time big data stream processing using GPU with spark over hadoop ecosystem","volume":"46","author":"Rathore","year":"2018","journal-title":"Int. J. Parallel Program."},{"key":"ref_9","unstructured":"Smith, K. (2019, December 20). 2018. Available online: https:\/\/www.brandwatch.com\/blog\/facebook-statistics\/."},{"key":"ref_10","unstructured":"George, L. (2011). HBase: The Definitive Guide: Random Access to Your Planet-Size Data, O\u2019Reilly Media, Inc."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Huang, X., Wang, L., Yan, J., Deng, Z., Wang, S., and Ma, Y. (2018, January 28\u201330). Towards Building a Distributed Data Management Architecture to Integrate Multi-Sources Remote Sensing Big Data. Proceedings of the 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC\/SmartCity\/DSS), Exeter, UK.","DOI":"10.1109\/HPCC\/SmartCity\/DSS.2018.00043"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Taylor, R.C. (2010). An overview of the Hadoop\/MapReduce\/HBase framework and its current applications in bioinformatics. BMC Bioinform., 11.","DOI":"10.1186\/1471-2105-11-S12-S1"},{"key":"ref_13","unstructured":"Sinha, S. (2020, September 28). HBase Tutorial: HBase Introduction and FaceBook Case Study. Available online: https:\/\/www.edureka.co\/blog\/hbase-tutorial."},{"key":"ref_14","unstructured":"Okoli, C., and Schabram, K. (2020, November 10). A Guide to Conducting a Systematic Literature Review of Information Systems Research, Available online: https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=1954824."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zheng, Y., and Liu, C. (2016, January 23\u201324). HBase based storage system for the internet of things. Proceedings of the 2016 4th International Conference on Machinery, Materials and Computing Technology, Hangzhou, China.","DOI":"10.2991\/icmmct-16.2016.96"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13638-017-0827-1","article-title":"A novel HBase data storage in wireless sensor networks","volume":"2017","author":"Li","year":"2017","journal-title":"Eurasip J. Wirel. Commun. Netw."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, B., Zhu, Y., Wang, C., Chen, Y., Huang, T., Shi, W., and Mao, Y. (2016, January 18\u201320). A versatile event-driven data model in hbase database for multi-source data of power grid. Proceedings of the 2016 IEEE International Conference on Smart Cloud (SmartCloud), New York, NY, USA.","DOI":"10.1109\/SmartCloud.2016.28"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Chen, Z., Chen, S., and Feng, X. (2016, January 13\u201315). A design of distributed storage and processing system for internet of vehicles. Proceedings of the 2016 8th International Conference on Wireless Communications & Signal Processing (WCSP), Yangzhou, China.","DOI":"10.1109\/WCSP.2016.7752671"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liu, B., Huang, R., Huang, T., and Yan, Y. (2017, January 26\u201329). MSDB: A massive sensor data processing middleware for HBase. Proceedings of the 2017 IEEE Second International Conference on Data Science in Cyberspace (DSC), Shenzhen, China.","DOI":"10.1109\/DSC.2017.90"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.procs.2018.03.071","article-title":"An improved distributed storage and query for remote sensing data","volume":"129","author":"Jing","year":"2018","journal-title":"Procedia Comput. Sci."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Gao, F., Yue, P., Wu, Z., and Zhang, M. (2017, January 7\u201310). Geospatial data storage based on HBase and MapReduce. Proceedings of the 2017 6th International Conference on Agro-Geoinformatics, Fairfax, VA, USA.","DOI":"10.1109\/Agro-Geoinformatics.2017.8047040"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1080\/14498596.2018.1440648","article-title":"Building an efficient storage model of spatial-temporal information based on HBase","volume":"64","author":"Wang","year":"2019","journal-title":"J. Spat. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"3657","DOI":"10.1007\/s10586-017-1253-1","article-title":"HBase storage schemas for massive spatial vector data","volume":"20","author":"Wang","year":"2017","journal-title":"Clust. Comput."},{"key":"ref_24","first-page":"012168","article-title":"Research and Implementation of Geography Information Query System Based on HBase","volume":"Volume 384","author":"Qian","year":"2019","journal-title":"IOP Conference Series: Earth and Environmental Science"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Qin, J., Ma, L., and Niu, J. (2018, January 21\u201323). Massive AIS Data Management Based on HBase and Spark. Proceedings of the 2018 3rd Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), Singapore.","DOI":"10.1109\/ACIRS.2018.8467233"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Nitnaware, C., and Khan, A. (2015, January 19\u201320). A multi-dimensional data storage model for location based application on Hbase. Proceedings of the 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), Coimbatore, India.","DOI":"10.1109\/ICIIECS.2015.7193237"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, L., Li, Q., Li, Y., and Cai, Y. (2018, January 18\u201321). A Distributed Storage Model for Healthcare Big Data Designed on HBase. Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA.","DOI":"10.1109\/EMBC.2018.8513400"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Gui, H., Zheng, R., Ma, C., Fan, H., and Xu, L. (2016). An architecture for healthcare big data management and analysis. International Conference on Health Information Science, Springer.","DOI":"10.1007\/978-3-319-48335-1_17"},{"key":"ref_29","first-page":"2418","article-title":"A Storage Model of Equipment Data Based on HBase","volume":"713\u2013715","author":"Lei","year":"2014","journal-title":"Appl. Mech. Mater."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"324","DOI":"10.26599\/BDMA.2018.9020026","article-title":"Distributed storage system for electric power data based on hbase","volume":"1","author":"Jin","year":"2018","journal-title":"Big Data Min. Anal."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Daki, H., El Hannani, A., and Ouahmane, H. (2018, January 26\u201327). HBase-based storage system for electrical consumption forecasting in a Moroccan engineering school. Proceedings of the 2018 4th International Conference on Optimization and Applications (ICOA), Mohammedia, Morocco.","DOI":"10.1109\/ICOA.2018.8370520"},{"key":"ref_32","first-page":"739","article-title":"An HBase-based platform for massive power data storage in power system","volume":"Volume 1070","author":"Yan","year":"2015","journal-title":"Advanced Materials Research"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhengjun, P., and Lianfen, Z. (2018, January 20\u201322). Application and research of massive big data storage system based on HBase. Proceedings of the 2018 IEEE 3rd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA), Chengdu, China.","DOI":"10.1109\/ICCCBDA.2018.8386515"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Wen, S. (2018, January 30\u201331). Efficient DNA Sequences Storage Scheme based on HBase. Proceedings of the 2018 International Conference on Mechanical, Electronic, Control and Automation Engineering (MECAE 2018), Qingdao, China.","DOI":"10.2991\/mecae-18.2018.122"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhuang, H., Lu, K., Li, C., Sun, M., Chen, H., and Zhou, X. (2015, January 4\u20137). Design of a more scalable database system. Proceedings of the 2015 15th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing, Shenzhen, China.","DOI":"10.1109\/CCGrid.2015.70"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Hong, S., Cho, M., Shin, S., Um, J.H., Seon, C.N., and Song, S.K. (2016, January 3\u20136). Optimizing hbase table scheme for marketing strategy suggestion. Proceedings of the 2016 8th International Conference on Knowledge and Smart Technology (KST), Chiangmai, Thailand.","DOI":"10.1109\/KST.2016.7440532"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Saloustros, G., and Magoutis, K. (July, January 29). Rethinking HBase: Design and implementation of an elastic key-value store over log-structured local volumes. Proceedings of the 2015 14th International Symposium on Parallel and Distributed Computing, Limassol, Cyprus.","DOI":"10.1109\/ISPDC.2015.33"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhu, L., and Li, Y. (2015, January 11\u201313). Distributed storage and analysis of massive urban road traffic flow data based on Hadoop. Proceedings of the 2015 12th Web Information System and Application Conference (WISA), Jinan, China.","DOI":"10.1109\/WISA.2015.29"},{"key":"ref_39","first-page":"1053","article-title":"G-hbase: A high performance geographical database based on hbase","volume":"101","author":"Takasu","year":"2018","journal-title":"IEICE Trans. Inf. Syst."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Kuo, C.T., and Hon, W.K. (April, January 30). Practical index framework for efficient time-travel phrase queries on versioned documents. Proceedings of the 2016 Data Compression Conference (DCC), Snowbird, UT, USA.","DOI":"10.1109\/DCC.2016.52"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Cao, C., Wang, W., Zhang, Y., and Ma, X. (2017, January 25\u201330). Leveraging column family to improve multi-dimensional query performance in HBase. Proceedings of the 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), Honolulu, CA, USA.","DOI":"10.1109\/CLOUD.2017.22"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Wu, H., Zhu, Y., Wang, C., Hou, J., Li, M., Xue, Q., and Mao, K. (2017, January 3\u20135). A performance-improved and storage-efficient secondary index for big data processing. Proceedings of the 2017 IEEE International Conference on Smart Cloud (SmartCloud), New York, NY, USA.","DOI":"10.1109\/SmartCloud.2017.32"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Chi, Y., Yang, Y., Xu, P., Li, G., and Li, S. (2018, January 9\u201312). Design and implementation of monitoring data storage and processing scheme based on distributed computing. Proceedings of the 2018 IEEE 3rd International Conference on Big Data Analysis (ICBDA), Shanghai, China.","DOI":"10.1109\/ICBDA.2018.8367678"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Xu, Y., Zou, Q., and Feng, X. (2016, January 15\u201318). Efficient and Timely Querying of Massive Trajectory Data in Internet of Vehicles. Proceedings of the 2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), Chengdu, China.","DOI":"10.1109\/iThings-GreenCom-CPSCom-SmartData.2016.73"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1016\/j.ijinfomgt.2018.08.006","article-title":"Real-time big data processing for anomaly detection: A survey","volume":"45","author":"Habeeb","year":"2019","journal-title":"Int. J. Inf. Manag."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1016\/j.bdr.2019.03.001","article-title":"Systematic review of the literature on big data in the transportation domain: Concepts and applications","volume":"17","author":"Neilson","year":"2019","journal-title":"Big Data Res."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/13\/1\/109\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:09:17Z","timestamp":1760159357000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/13\/1\/109"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,9]]},"references-count":46,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,1]]}},"alternative-id":["sym13010109"],"URL":"https:\/\/doi.org\/10.3390\/sym13010109","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,9]]}}}