{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T15:43:10Z","timestamp":1781106190285,"version":"3.54.1"},"reference-count":43,"publisher":"IGI Global Scientific Publishing","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,7]]},"abstract":"<jats:p>Before the arrival of the Big Data era, data warehouse (DW) systems were considered the best decision support systems (DSS). DW systems have always helped organizations around the world to analyse their stored data and use it in making decisive decisions. However, analyzing and mining data of poor quality can give the wrong conclusions. Several data quality (DQ) problems can appear during a data warehouse project like missing values, duplicates values, integrity constrains issues and more. As a result, organizations around the world are more aware of the importance of data quality and invest a lot of money in order to manage data quality in the DW systems. On the other hand, with the arrival of BD, new challenges have to be considered like the need for collecting the most recent data and the ability to make real-time decisions. This article provides a survey about the exiting techniques to control the quality of the stored data in the DW systems and the new solutions proposed in the literature to face the new Big Data requirements.<\/jats:p>","DOI":"10.4018\/ijoci.2020070101","type":"journal-article","created":{"date-parts":[[2020,6,17]],"date-time":"2020-06-17T10:02:30Z","timestamp":1592388150000},"page":"1-13","source":"Crossref","is-referenced-by-count":3,"title":["Data Warehouses and Big Data"],"prefix":"10.4018","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7354-4920","authenticated-orcid":true,"given":"Hamid Naceur","family":"Benkhaled","sequence":"first","affiliation":[{"name":"EEDIS Laboratory, University of Djillali Liabes, Sidi Bel Abbes, Algeria"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Djamel","family":"Berrabah","sequence":"additional","affiliation":[{"name":"EEDIS Laboratory, University of Djillali Liabes, Sidi Bel Abbes, Algeria"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Faouzi","family":"Boufares","sequence":"additional","affiliation":[{"name":"LIPN Laboratory, Paris13 University, Paris, France"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"2432","reference":[{"key":"IJOCI.2020070101-0","unstructured":"Bala, M., Boussaid, O., Alimazighi, Z., & Bentayeb, F. (2014). Pfetl: vers l\u2019int\u00e9gration de donn\u00e9es massives dans les fonctionnalit\u00e9s d\u2019etl. In INFORSID (pp. 61\u201376). Academic Press."},{"key":"IJOCI.2020070101-1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2015.76"},{"key":"IJOCI.2020070101-2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24106-7"},{"key":"IJOCI.2020070101-3","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2015.08.005"},{"key":"IJOCI.2020070101-4","unstructured":"Benkhaled, H. N., & Berrabah, D. (2019). Data Quality Management For Data Warehouse Systems: State Of The Art. In Proceedings of JERI 2019. Academic Press."},{"key":"IJOCI.2020070101-5","doi-asserted-by":"publisher","DOI":"10.1109\/CoDIT.2019.8820340"},{"key":"IJOCI.2020070101-6","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-013-0266-7"},{"key":"IJOCI.2020070101-7","doi-asserted-by":"publisher","DOI":"10.1109\/WICT.2011.6141255"},{"key":"IJOCI.2020070101-8","unstructured":"Cisco. (2016). Global mobile data traffic forecast update, 2015\u2013 2020 white paper."},{"key":"IJOCI.2020070101-9","unstructured":"Dijcks, J. P. (2012). Oracle: Big data for the enterprise. Oracle."},{"key":"IJOCI.2020070101-10","doi-asserted-by":"publisher","DOI":"10.1145\/1651291.1651299"},{"key":"IJOCI.2020070101-11","unstructured":"Feugey. (2016), D. Ne confondez pas le big data avec un data warehouse g\u00e9ant. Retrieved from https:\/\/www.silicon.fr\/hub\/hpe-intel-hub\/ne-confondez-pas-le-big-data-avecun-data-warehouse-geant\/amp"},{"key":"IJOCI.2020070101-12","unstructured":"Geiger, J. G. (2004). Data quality management, the most critical initiative you can implement."},{"key":"IJOCI.2020070101-13","unstructured":"Geisler, S., Weber, S., & Quix, C. (2011). An ontology-based data quality framework for data stream applications. In Proceedings of the16th International Conference on Information Quality (pp. 145\u2013159). Academic Press."},{"key":"IJOCI.2020070101-14","unstructured":"Helfert, M., & Herrmann, C. (2002). Proactive data quality management for data warehouse systems. In DMDW (pp. 97\u2013106). Academic Press."},{"key":"IJOCI.2020070101-15","unstructured":"Helfert, M., Zellner, G., and Sousa, C. (2002). Data quality problems and proactive data quality management in data-warehouse-systems. In Proceedings of BITWorld. Academic Press."},{"key":"IJOCI.2020070101-16","article-title":"Building the data warehouse. QED Technical Publishing","author":"W.Inmon","year":"1992","journal-title":"Group"},{"key":"IJOCI.2020070101-17","unstructured":"Jensen, C. S. (2010). Synthesis lectures on data management."},{"key":"IJOCI.2020070101-18","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2017.02.010"},{"key":"IJOCI.2020070101-19","doi-asserted-by":"crossref","unstructured":"Kumar, V. & Thareja, R. (2013). A simplified approach for quality management in data warehouse.","DOI":"10.5121\/ijdkp.2013.3506"},{"key":"IJOCI.2020070101-20","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367528"},{"key":"IJOCI.2020070101-21","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-59834-5"},{"key":"IJOCI.2020070101-22","unstructured":"Meehan, J., Aslantas, C., Zdonik, S., Tatbul, N., & Du, J. (2017). Data ingestion for the connected world. In Proceedings of CIDR. Academic Press."},{"key":"IJOCI.2020070101-23","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01112-2_30"},{"key":"IJOCI.2020070101-24","doi-asserted-by":"publisher","DOI":"10.4018\/IJSITA.2017100102"},{"key":"IJOCI.2020070101-25","doi-asserted-by":"crossref","unstructured":"Palepu, R.B. & Rao, D. (2012). Meta data quality control architecture in data warehousing. International Journal of Computer Science, Engineering and Information Technology, 15\u201324.","DOI":"10.5121\/ijcseit.2012.2402"},{"key":"IJOCI.2020070101-26","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-018-6155-6"},{"key":"IJOCI.2020070101-27","doi-asserted-by":"publisher","DOI":"10.1145\/1980022.1980130"},{"issue":"4","key":"IJOCI.2020070101-28","first-page":"3","article-title":"Data cleaning: Problems and current approaches.","volume":"23","author":"E.Rahm","year":"2000","journal-title":"IEEE Data Eng. Bull."},{"key":"IJOCI.2020070101-29","unstructured":"Redmond, W. (2012). The big bang: How the big data explosion is changing the world."},{"key":"IJOCI.2020070101-30","doi-asserted-by":"publisher","DOI":"10.1109\/CTS.2013.6567202"},{"key":"IJOCI.2020070101-31","doi-asserted-by":"publisher","DOI":"10.4236\/jcc.2014.29013"},{"issue":"2","key":"IJOCI.2020070101-32","first-page":"1","article-title":"Data warehouse and big data integration.","volume":"9","author":"S. O.Salinas","year":"2017","journal-title":"Int. Journal of Comp. Sci. and Inf. Tech"},{"key":"IJOCI.2020070101-33","doi-asserted-by":"publisher","DOI":"10.1145\/1451940.1451949"},{"issue":"1","key":"IJOCI.2020070101-34","first-page":"21","article-title":"Towards implementing total data quality management in a data warehouse.","volume":"16","author":"G.Shankaranarayanan","year":"2005","journal-title":"Journal of Information Technology Management"},{"issue":"3","key":"IJOCI.2020070101-35","first-page":"41","article-title":"A descriptive classification of causes of data quality problems in data warehousing.","volume":"7","author":"R.Singh","year":"2010","journal-title":"International Journal of Computer Science Issues"},{"key":"IJOCI.2020070101-36","doi-asserted-by":"publisher","DOI":"10.1145\/1183512.1183526"},{"key":"IJOCI.2020070101-37","first-page":"307","article-title":"A UML based approach for modeling ETL processes in data warehouses.","author":"J.Trujillo","year":"2003","journal-title":"Proceedings of the International Conference on Conceptual Modeling"},{"key":"IJOCI.2020070101-38","doi-asserted-by":"crossref","unstructured":"ur Rehman, M. H., Chang, V., Batool, A., & Wah, T. Y. (2016). Big data reduction framework for value creation in sustainable enterprises. International Journal of Information Management, 36(6), 917\u2013928.","DOI":"10.1016\/j.ijinfomgt.2016.05.013"},{"key":"IJOCI.2020070101-39","doi-asserted-by":"publisher","DOI":"10.1145\/583890.583893"},{"key":"IJOCI.2020070101-40","doi-asserted-by":"publisher","DOI":"10.4236\/jcc.2016.45012"},{"key":"IJOCI.2020070101-41","unstructured":"Zaidi, H., Boufar\u00e8s, F., & Pollet, Y. (2016b). Nettoyage de donn\u00e9es guid\u00e9 par les s\u00e9mantiques inter-colonnes. In EGC (pp. 549\u2013550). Academic Press."},{"key":"IJOCI.2020070101-42","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-23781-7_5"}],"container-title":["International Journal of Organizational and Collective Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=256978","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,6]],"date-time":"2022-05-06T10:08:13Z","timestamp":1651831693000},"score":1,"resource":{"primary":{"URL":"http:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJOCI.2020070101"}},"subtitle":["How to Cope With Data Quality"],"short-title":[],"issued":{"date-parts":[[2020,7]]},"references-count":43,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.4018\/ijoci.2020070101","relation":{},"ISSN":["1947-9344","1947-9352"],"issn-type":[{"value":"1947-9344","type":"print"},{"value":"1947-9352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7]]}}}