{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,30]],"date-time":"2025-12-30T08:57:23Z","timestamp":1767085043076,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2021,8,13]],"date-time":"2021-08-13T00:00:00Z","timestamp":1628812800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004837","name":"Spanish Ministry of Science and Innovation","doi-asserted-by":"publisher","award":["857191"],"award-info":[{"award-number":["857191"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010675","name":"H2020 Innovation In SMEs","doi-asserted-by":"publisher","award":["ICT-11-2018-2019"],"award-info":[{"award-number":["ICT-11-2018-2019"]}],"id":[{"id":"10.13039\/100010675","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002809","name":"Generalitat de Catalunya","doi-asserted-by":"publisher","award":["2017-SGR-1414"],"award-info":[{"award-number":["2017-SGR-1414"]}],"id":[{"id":"10.13039\/501100002809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>The recent great technological advance has led to a broad proliferation of Monitoring Infrastructures, which typically keep track of specific assets along time, ranging from factory machinery, device location, or even people. Gathering this data has become crucial for a wide number of applications, like exploration dashboards or Machine Learning techniques, such as Anomaly Detection. Time-Series Databases, designed to handle these data, grew in popularity, becoming the fastest-growing database type from 2019. In consequence, keeping track and mastering those rapidly evolving technologies became increasingly difficult. This paper introduces the holistic design approach followed for building NagareDB, a Time-Series database built on top of MongoDB\u2014the most popular NoSQL Database, typically discouraged in the Time-Series scenario. The goal of NagareDB is to ease the access to three of the essential resources needed to building time-dependent systems: Hardware, since it is able to work in commodity machines; Software, as it is built on top of an open-source solution; and Expert Personnel, as its foundation database is considered the most popular NoSQL DB, lowering its learning curve. Concretely, NagareDB is able to outperform MongoDB recommended implementation up to 4.7 times, when retrieving data, while also offering a stream-ingestion up to 35% faster than InfluxDB, the most popular Time-Series database. Moreover, by relaxing some requirements, NagareDB is able to reduce the disk space usage up to 40%.<\/jats:p>","DOI":"10.3390\/data6080091","type":"journal-article","created":{"date-parts":[[2021,8,13]],"date-time":"2021-08-13T05:34:46Z","timestamp":1628832886000},"page":"91","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["NagareDB: A Resource-Efficient Document-Oriented Time-Series Database"],"prefix":"10.3390","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8185-3667","authenticated-orcid":false,"given":"Carlos Garcia","family":"Calatrava","sequence":"first","affiliation":[{"name":"Barcelona Supercomputing Center, Pla\u00e7a Eusebi G\u00fcell, 1\u20133, 08034 Barcelona, Spain"},{"name":"Department of Computer Architecture, Universitat Polit\u00e8cnica de Catalunya (BarcelonaTech), C. Jordi Girona, 31, 08034 Barcelona, Spain"}]},{"given":"Yolanda Becerra","family":"Fontal","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Pla\u00e7a Eusebi G\u00fcell, 1\u20133, 08034 Barcelona, Spain"},{"name":"Department of Computer Architecture, Universitat Polit\u00e8cnica de Catalunya (BarcelonaTech), C. Jordi Girona, 31, 08034 Barcelona, Spain"}]},{"given":"Fernando M.","family":"Cucchietti","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Pla\u00e7a Eusebi G\u00fcell, 1\u20133, 08034 Barcelona, Spain"}]},{"given":"Carla Div\u00ed","family":"Cuesta","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Pla\u00e7a Eusebi G\u00fcell, 1\u20133, 08034 Barcelona, Spain"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,13]]},"reference":[{"key":"ref_1","unstructured":"Golonka, P., Gonzalez-Berges, M., Guzik, J., and Kulaga, R. (2017, January 8\u201313). Future archiver for CERN SCADA systems. Proceedings of the International Conference on Accelerator and Large Experimental Control Systems (ICALEPCS2017), Barcelona, Spain."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2581","DOI":"10.1109\/TKDE.2017.2740932","article-title":"Time Series Management Systems: A Survey","volume":"29","author":"Jensen","year":"2017","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Stonebraker, M., and Cetintemel, U. (2005, January 5\u20138). \u201cOne size fits all\u201d: An idea whose time has come and gone. Proceedings of the 21st International Conference on Data Engineering (ICDE\u201905), Tokyo, Japan.","DOI":"10.1109\/ICDE.2005.1"},{"key":"ref_4","unstructured":"(2021, February 23). The DB-Engines Ranking, according to Their Popularity. Available online: https:\/\/db-engines.com\/en\/ranking."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2151","DOI":"10.1002\/qre.2008","article-title":"How Can SMEs Benefit from Big Data? Challenges and a Path Forward","volume":"32","author":"Coleman","year":"2016","journal-title":"Qual. Reliab. Eng. Int."},{"key":"ref_6","unstructured":"Davenport, T.H., and Patil, D.J. (2021, April 12). Harvard Business Review: \u201cData Scientist: The Sexiest Job of the 21st Century\u201d. Available online: https:\/\/hbr.org\/2012\/10\/data-scientist-the-sexiest-job-of-the-21st-century."},{"key":"ref_7","unstructured":"Hajek, V., Klapka, T., and Kudibal, O. (2021, February 23). Benchmarking InfluxDB vs. MongoDB for Time Series Data, Metrics & Management. An Influxdata Technical Paper. Available online: https:\/\/www.influxdata.com\/blog\/influxdb-is-27x-faster-vs-mongodb-for-time-series-workloads\/."},{"key":"ref_8","unstructured":"Kiefer, R., and Sewrathan, A. (2021, February 23). How to Store Time-Series Data in MongoDB, and Why That\u2019s a Bad Idea. Available online: https:\/\/blog.timescale.com\/blog\/how-to-store-time-series-data-mongodb-vs-timescaledb-postgresql-a73939734016\/."},{"key":"ref_9","unstructured":"Makris, A., Tserpes, T., Spiliopoulos, G., and Anagnostopoulos, D. (2019, January 26\u201329). Performance Evaluation of MongoDB and PostgreSQL for Spatio-temporal Data. Proceedings of the International Conference on Database Theory, EDBT\/ICDT Workshops, Lisbon, Portugal."},{"key":"ref_10","unstructured":"Bader, A., Kopp, O., and Falkental, M. (2017). Survey and Comparison of Open Source Time Series Databases, Lecture Notes in Informatics (LNI), University of Stuttgart."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Patnaik, L.M., and Gill, P.S. (1985). GRDB: A general purpose relational database system. Information System, Elsevier. [10th ed.].","DOI":"10.1016\/0306-4379(85)90034-1"},{"key":"ref_12","unstructured":"Zhaofeng, Z. (2021, March 26). Key Concepts and Features of Time Series Databases. Available online: https:\/\/www.alibabacloud.com\/blog\/key-concepts-and-features-of-time-series-databases_594734."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Brockwell, P.J., and Davis, R.A. (1986). Time Series: Theory and Methods, Springerg.","DOI":"10.1007\/978-1-4899-0004-3"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Hecht, R., and Jablonski, S. (2011, January 12\u201314). NoSQL evaluation: A use case oriented survey. Proceedings of the International Conference on Cloud and Service Computing, Hong Kong, China.","DOI":"10.1109\/CSC.2011.6138544"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1109\/MC.2012.33","article-title":"Consistency Tradeoffs in Modern Distributed Database System Design: CAP is Only Part of the Story","volume":"45","author":"Abadi","year":"2012","journal-title":"IEEE Comput."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Blok, H.E., Hiemstra, D., Choenni, S., Jong, F., Blanken, H., and Apers, P. (2001, January 5\u201310). Predicting the Cost-Quality Trade-off for Information Retrieval Queries: Facilitating Database Design and Query Optimization. Proceedings of the ACM Tenth International Conference on Information and Knowledge Management, Atlanta, GA, USA.","DOI":"10.1145\/502585.502621"},{"key":"ref_17","unstructured":"Jovanovski, J., Arsov, N., Stevanoska, E., Simons, M.J., and Velinov, G. (2006, January 26\u201328). A meta-heuristic approach for RLE compression in a column store table. Proceedings of the ACM SIGMOD International Conference on Management of Data, Chicago, IL, USA."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Gupta, A., Bansal, A., and Khanduja, V. (2017, January 22\u201324). Modern Lossless Compression Techniques: Review, Comparison and Analysis. Proceedings of the 2nd IEEE International Conference on Electrical, Computer and Communication Technologies, Coimbatore, India.","DOI":"10.1109\/ICECCT.2017.8117850"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Ding, N., Gao, H., Bu, H., Ma, H., and Si, H. (2018). Multivariate-Time-Series-Driven Real-time Anomaly Detection Based on Bayesian Network. Sensors, 18.","DOI":"10.3390\/s18103367"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Mallak, A., and Fathi, M. (2021). Sensor and Component Fault Detection and Diagnosis for Hydraulic Machinery Integrating LSTM Autoencoder Detector and Diagnostic Classifiers. Sensors, 21.","DOI":"10.3390\/s21020433"},{"key":"ref_21","unstructured":"(2021, March 15). MongoDB Website. Available online: https:\/\/www.mongodb.com\/."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Parker, Z., Poe, S., and Vrbsky, S.V. (2013, January 4\u20136). Comparing NoSQL MongoDB to an SQL DB. Proceedings of the 51st ACM Southeast Conference, Savannah, GA, USA.","DOI":"10.1145\/2498328.2500047"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1721654.1721659","article-title":"SQL Databases v. NoSQL Databases","volume":"53","author":"Stonebraker","year":"2010","journal-title":"Commun. ACM"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Gu, Y., Wang, X., Shen, S., Wang, J., and Kim, J. (2015, January 6\u20138). Analysis of data storage mechanism in NoSQL database MongoDB. Proceedings of the IEEE International Conference on Consumer Electronics, Taipei, Taiwan.","DOI":"10.1109\/ICCE-TW.2015.7217036"},{"key":"ref_25","unstructured":"Yuhanna, N., Leganza, G., and Perdoni, R. (2019). The Forrester Wave\u2122: Big Data NoSQL, Forrester. Q1 2019 Report."},{"key":"ref_26","unstructured":"(2021, March 15). MongoDB Manual. Available online: https:\/\/docs.mongodb.com\/manual\/."},{"key":"ref_27","unstructured":"(2021, March 15). InfluxDB Website. Available online: https:\/\/www.influxdata.com\/."},{"key":"ref_28","unstructured":"Freedman, M., and Sewrathan, A. (2021, February 23). TimescaleDB vs. InfluxDB: Purpose Built Differently for Time-Series Data. Available online: https:\/\/blog.timescale.com\/blog\/timescaledb-vs-influxdb-for-time-series-data-timescale-influx-sql-nosql-36489299877."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Abadi, D.J., Madden, S.R., and Hachem, N. (2008, January 9\u201312). Column-Stores vs. Row-Stores: How Different Are They Really?. Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, Vancouver, BC, Canada.","DOI":"10.1145\/1376616.1376712"},{"key":"ref_30","unstructured":"(2021, March 22). Time Series and MongoDB: Best Practices. Available online: https:\/\/www.mongodb.com\/blog\/post\/time-series-data-and-mongodb-part-2-schema-design-best-practices."},{"key":"ref_31","unstructured":"(2021, March 26). Too Many Open Files Problem, in InfluxDB Github. Available online: https:\/\/github.com\/influxdata\/influxdb\/search?q=too+many+open+files&type=issues."},{"key":"ref_32","unstructured":"Bovet, D.P., and Cesati, M. (2005). Understanding the Linux Kernel, O\u2019Reilly Media, Inc.. [3rd ed.]."},{"key":"ref_33","unstructured":"(2021, March 26). InfluxDB Documentation. Available online: https:\/\/docs.influxdata.com\/influxdb\/v2.0\/."},{"key":"ref_34","unstructured":"Wang, N., Choi, J., Brand, D., Chen, C., and Gopalakrishnan, K. (2018, January 3\u20138). Training Deep Neural Networks with 8-Bit Floating Point Numbers. Proceedings of the 32nd International Conference on Neural Information Processing Systems, Montr\u00e9al, QC, Canada."},{"key":"ref_35","unstructured":"Gupta, S., Agrawal, A., Gopalakrishnan, K., and Narayanan, P. (2015, January 7\u20139). Deep Learning with Limited Numerical Precision. Proceedings of the 32nd International Conference on International Conference on Machine Learning, Lille, France."},{"key":"ref_36","unstructured":"(2021, July 04). Google Snappy Algorithm Github. Available online: http:\/\/google.github.io\/snappy\/."},{"key":"ref_37","unstructured":"(2021, March 26). Zstandard Benchmarking. Available online: https:\/\/facebook.github.io\/zstd\/."},{"key":"ref_38","unstructured":"(2021, March 26). InfluxDB Glossary Reference. Available online: https:\/\/docs.influxdata.com\/influxdb\/cloud\/reference\/glossary\/#precision."},{"key":"ref_39","unstructured":"(2021, March 26). Understanding Dependent Tags In Series Cardinality. Available online: https:\/\/www.influxdata.com\/blog\/tldr-influxdb-tech-tips-december-15-2016\/."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Micheloni, R., Marelli, A., and Eshghi, K. (2012). Inside Solid State Drives (SSDs), Springer.","DOI":"10.1007\/978-94-007-5146-0"},{"key":"ref_41","unstructured":"Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J.D., Manasse, M., and Panigrahy, R. (2008, January 23\u201324). Design Tradeoffs for SSD Performance. Proceedings of the USENIX Annual Technical Conference, Boston, MA, USA."},{"key":"ref_42","unstructured":"(2021, July 04). InfluxDB Hardware Sizing. Available online: https:\/\/docs.influxdata.com\/influxdb\/v1.8\/guides\/hardware_sizing\/#storage-volume-and-iops."},{"key":"ref_43","unstructured":"(2021, April 12). Indexes Reference, Glossary of Concepts. Available online: https:\/\/docs.influxdata.com\/influxdb\/v1.8\/concepts\/glossary\/#field-value."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chopade, R., and Pachghare, V. (2020). MongoDB Indexing for Performance Improvement, ICT Systems and Sustainability, Springer.","DOI":"10.1007\/978-981-15-0936-0_56"},{"key":"ref_45","unstructured":"(2021, April 12). InfluxDB Storage Engine. Available online: https:\/\/docs.influxdata.com\/influxdb\/v2.0\/reference\/internals\/storage-engine\/."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/6\/8\/91\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:45:14Z","timestamp":1760165114000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/6\/8\/91"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,13]]},"references-count":45,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2021,8]]}},"alternative-id":["data6080091"],"URL":"https:\/\/doi.org\/10.3390\/data6080091","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2021,8,13]]}}}