{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T03:05:48Z","timestamp":1765422348709},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T00:00:00Z","timestamp":1700265600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T00:00:00Z","timestamp":1700265600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Distributed database system (DDBS) design is still an open challenge even after decades of research, especially in a dynamic network setting. Hence, to meet the demands of high-speed data gathering and for the management and preservation of huge systems, it is important to construct a distributed database for real-time data storage. Incidentally, some fragmentation schemes, such as horizontal, vertical, and hybrid, are widely used for DDBS design. At the same time, data allocation could not be done without first physically fragmenting the data because the fragmentation process is the foundation of the DDBS design. Extensive research have been conducted to develop effective solutions for DDBS design problems. But the great majority of them barely consider the RDDBS's initial design. Therefore, this work aims at proposing a clustering-based horizontal fragmentation and allocation technique to handle both the early and late stages of the DDBS design. To ensure that each operation flows into the next without any increase in complexity, fragmentation and allocation are done simultaneously. With this approach, the main goals are to minimize communication expenses, response time, and irrelevant data access. Most importantly, it has been observed that the proposed approach may effectively expand RDDBS performance by simultaneously fragmenting and assigning various relations. Through simulations and experiments on synthetic and real databases, we demonstrate the viability of our strategy and how it considerably lowers communication costs for typical access patterns at both the early and late stages of design.<\/jats:p>","DOI":"10.1186\/s40537-023-00849-7","type":"journal-article","created":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T17:02:12Z","timestamp":1700326932000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["On hierarchical clustering-based approach for RDDBS design"],"prefix":"10.1186","volume":"10","author":[{"given":"Hassan I.","family":"Abdalla","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ali A.","family":"Amer","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sri Devi","family":"Ravana","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,11,18]]},"reference":[{"key":"849_CR1","doi-asserted-by":"crossref","unstructured":"Ortiz-Ballona AO, Rodr\u00edguez-Mazahua L, L\u00f3pez-Chau A, Abud-Figueroa MA, Romero-Torres C, Castro-Medina F. A brief review of vertical fragmentation methods considering multimedia databases and content-based queries. In: International conference on software process improvement. Cham: Springer; 2021, October. p. 55\u201368).","DOI":"10.1007\/978-3-030-89909-7_5"},{"issue":"1","key":"849_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3150223","volume":"51","author":"D Nashat","year":"2018","unstructured":"Nashat D, Amer AA. A comprehensive taxonomy of fragmentation and allocation techniques in distributed database design. ACM Comput Surv (CSUR). 2018;51(1):1\u201325.","journal-title":"ACM Comput Surv (CSUR)"},{"key":"849_CR3","doi-asserted-by":"crossref","unstructured":"Castillo-Garc\u00eda A, Rodr\u00edguez-Mazahua L, Castro-Medina F, Olivares-Zepahua BA, Abud-Figueroa MA. A review of horizontal fragmentation methods considering multimedia data and dynamic access patterns. In International conference on software process improvement. Cham: Springer; 2021, October\u200f. p. 69\u201382.","DOI":"10.1007\/978-3-030-89909-7_6"},{"issue":"1","key":"849_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-019-0178-3","volume":"6","author":"S Mazumdar","year":"2019","unstructured":"Mazumdar S, Seybold D, Kritikos K, Verginadis Y. A survey on data storage and placement methodologies for Cloud-Big Data ecosystem. J Big Data. 2019;6(1):1\u201337.","journal-title":"J Big Data"},{"issue":"1","key":"849_CR5","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1186\/s40537-017-0087-2","volume":"4","author":"C Sreedhar","year":"2017","unstructured":"Sreedhar C, Kasiviswanath N, Chenna Reddy P. Clustering large datasets using K-means modified inter and intra clustering (KM-I2C) in Hadoop. J Big Data. 2017;4(1):27.","journal-title":"J Big Data"},{"issue":"1","key":"849_CR6","first-page":"8","volume":"7","author":"AAC Fauzi","year":"2021","unstructured":"Fauzi AAC, Rahman WFWA, Fauzi A, Weigelt F. Managing fragmented database in distributed database environment. J Math Comput Sci. 2021;7(1):8\u201314.","journal-title":"J Math Comput Sci"},{"issue":"1","key":"849_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.heliyon.2020.e03172","volume":"6","author":"AA Amer","year":"2020","unstructured":"Amer AA, Mohamed MH, Al-Asri K. ASGOP: an aggregated similarity-based greedy-oriented approach for relational DDBSs design. Heliyon. 2020;6(1):1.","journal-title":"Heliyon"},{"issue":"12","key":"849_CR8","doi-asserted-by":"publisher","first-page":"e00487","DOI":"10.1016\/j.heliyon.2017.e00487","volume":"3","author":"AA Amer","year":"2017","unstructured":"Amer AA, Sewisy AA, Elgendy TM. An optimized approach for simultaneous horizontal data fragmentation and allocation in Distributed Database Systems (DDBSs). Heliyon. 2017;3(12):e00487.","journal-title":"Heliyon"},{"key":"849_CR9","doi-asserted-by":"crossref","unstructured":"Aggarwal M, Bajaj SB, Jaglan V. Performance analysis of degree of redundancy for replication in distributed database system. In: 2022 1st international conference on informatics (ICI), New York: IEEE; 2022, April. p. 176\u201380.\u200f","DOI":"10.1109\/ICI53355.2022.9786886"},{"key":"849_CR10","doi-asserted-by":"publisher","first-page":"864","DOI":"10.1016\/j.ins.2022.09.003","volume":"612","author":"YF Ge","year":"2022","unstructured":"Ge YF, Zhan ZH, Cao J, Wang H, Zhang Y, Lai KK, Zhang J. DSGA: a distributed segment-based genetic algorithm for multi-objective outsourced database partitioning. Inf Sci. 2022;612:864\u201386.","journal-title":"Inf Sci"},{"key":"849_CR11","doi-asserted-by":"crossref","unstructured":"Singh A, Khehra BS, Mavi BS. Simplified-BBO for non-redundant allocation of data in distributed database design. In: 2021 IEEE international midwest symposium on circuits and systems (MWSCAS). New York: IEEE; 2021, August. p. 544\u20138.\u200f","DOI":"10.1109\/MWSCAS47672.2021.9531836"},{"issue":"2","key":"849_CR12","first-page":"184","volume":"86","author":"N Lotfi","year":"2021","unstructured":"Lotfi N, Tamouk J. A hybrid method based on SA and VNS algorithms for solving DAP in DDS. Comput Sci J Moldova. 2021;86(2):184\u2013205.","journal-title":"Comput Sci J Moldova"},{"issue":"9","key":"849_CR13","doi-asserted-by":"publisher","first-page":"2461","DOI":"10.37624\/IJERT\/13.9.2020.2461-2473","volume":"13","author":"A Singh","year":"2020","unstructured":"Singh A. SBBO based replicated data allocation approach for distributed database design. Int J Eng Res Technol. 2020;13(9):2461\u201373.","journal-title":"Int J Eng Res Technol"},{"key":"849_CR14","first-page":"1","volume":"2023","author":"M Chen","year":"2023","unstructured":"Chen M, An W, Liu Y, Dong C, Xu X, Han B, Zhang P. Modeling and performance analysis of single-server database over quasi-static rayleigh fading channel. IEEE Trans Veh Technol. 2023;2023:1.","journal-title":"IEEE Trans Veh Technol"},{"key":"849_CR15","first-page":"290","volume":"2022","author":"ZJ Ahmed","year":"2022","unstructured":"Ahmed ZJ, Alluhaibi ST. Hybrid data fragmentation using genetic killer whale optimization-based clustering model. J Pharmaceut Neg Results. 2022;2022:290\u20138.","journal-title":"J Pharmaceut Neg Results"},{"key":"849_CR16","doi-asserted-by":"crossref","unstructured":"Che Fauzi AA, Noraziah A, Mohd WMBW, Amer A, Herawan T. Managing fragmented database replication for Mygrants using binary vote assignment on cloud quorum. In: Applied mechanics and materials, vol. 490. Trans Tech Publications Ltd; 2014. p. 1342\u20136.","DOI":"10.4028\/www.scientific.net\/AMM.490-491.1342"},{"key":"849_CR17","doi-asserted-by":"crossref","unstructured":"Castro-Medina F, Rodr\u00edguez-Mazahua L, Abud-Figueroa MA, Romero-Torres C, Reyes-Hern\u00e1ndez L\u00c1, Alor-Hern\u00e1ndez G. Application of data fragmentation and replication methods in the cloud: a review. In: 2019 international conference on electronics, communications and computers (CONIELECOMP). New York: IEEE; 2019, February. p. 47\u201354\u200f.","DOI":"10.1109\/CONIELECOMP.2019.8673249"},{"issue":"06","key":"849_CR18","doi-asserted-by":"publisher","first-page":"1126","DOI":"10.1109\/TLA.2020.9099751","volume":"18","author":"F Castro-Medina","year":"2020","unstructured":"Castro-Medina F, Rodriguez-Mazahua L, L\u00f3pez-Chau A, Abud-Figueroa MA, Alor-Hern\u00e1ndez G. FRAGMENT: a web application for database fragmentation, allocation and replication over a cloud environment. IEEE Lat Am Trans. 2020;18(06):1126\u201334.","journal-title":"IEEE Lat Am Trans"},{"key":"849_CR19","doi-asserted-by":"crossref","unstructured":"Castro-Medina F, Rodr\u00edguez-Mazahua L, L\u00f3pez-Chau A, Alor-Hern\u00e1ndez G, Ju\u00e1rez-Mart\u00ednez U, S\u00e1nchez-Ram\u00edrez C. An improvement to FRAGMENT: a web application for database fragmentation, allocation, and replication over a cloud environment. In: Proceedings of 6th international congress on information and communication technology. Singapore: Springer; 2022. p. 685\u201396.","DOI":"10.1007\/978-981-16-1781-2_60"},{"key":"849_CR20","doi-asserted-by":"crossref","unstructured":"Tatarnikova TM, Arkhiptsev ED. Determine the number of distributed Big Data storage replicas. In: 2023 XXVI international conference on soft computing and measurements (SCM). New York: IEEE; 2023, May. p. 223\u20136.","DOI":"10.1109\/SCM58628.2023.10159087"},{"key":"849_CR21","first-page":"3","volume-title":"Handbook on decision making: volume 3: trends and challenges in intelligent decision support systems","author":"AO Ortiz-Ballona","year":"2022","unstructured":"Ortiz-Ballona AO, Rodr\u00edguez-Mazahua L, L\u00f3pez-Chau A, Castro-Medina F, Abud-Figueroa MA, Rodr\u00edguez-Mazahua N. A vertical fragmentation method for multimedia databases considering content-based queries. In: Handbook on decision making: volume 3: trends and challenges in intelligent decision support systems. Cham: Springer; 2022. p. 3\u201323."},{"key":"849_CR22","first-page":"1","volume":"2023","author":"A Ali","year":"2023","unstructured":"Ali A, Naeem S, Anam S, Ahmed MM. A state of art survey for Big Data processing and NoSQL database architecture. Int J Comput Digit Syst. 2023;2023:1.","journal-title":"Int J Comput Digit Syst"},{"key":"849_CR23","doi-asserted-by":"crossref","unstructured":"Yang Y, Sun J. Distributed database design and performance Tuing. In: 2023 IEEE 13th international conference on electronics information and emergency communication (ICEIEC). New York: IEEE; 2023, July. p. 1\u20133.","DOI":"10.1109\/ICEIEC58029.2023.10199204"},{"key":"849_CR24","doi-asserted-by":"publisher","unstructured":"Harikumar S, Ramachandran R. Hybridized fragmentation of very large databases using clustering. In: IEEE signal processing, informatics, communication and energy systems (SPICES); 2015. p. 1\u20135. https:\/\/doi.org\/10.1109\/SPICES.2015.7091488.","DOI":"10.1109\/SPICES.2015.7091488"},{"key":"849_CR25","first-page":"1","volume":"2022","author":"M Aggarwal","year":"2022","unstructured":"Aggarwal M, Bajaj SB, Jaglan V. An improved Vogel\u2019s approximation method (IVAM) for fragment allocation and replication in distributed database systems. Indian J Comput Sci Eng (IJCSE). 2022;2022:1.","journal-title":"Indian J Comput Sci Eng (IJCSE)"},{"key":"849_CR26","first-page":"1","volume":"2023","author":"M Mahi","year":"2023","unstructured":"Mahi M, Baykan OK, Kodaz H. A new approach based on greedy minimizing algorithm for solving data allocation problem. Soft Comput. 2023;2023:1\u201320.","journal-title":"Soft Comput"},{"key":"849_CR27","doi-asserted-by":"crossref","unstructured":"Abdalla HI. A brief comparison of K-means and agglomerative hierarchical clustering algorithms on small datasets. In: International conference on wireless communications, networking and applications. Singapore: Springer; 2022. p. 623\u201332.","DOI":"10.1007\/978-981-19-2456-9_64"},{"key":"849_CR28","first-page":"1","volume":"2022","author":"P Di Sanzo","year":"2022","unstructured":"Di Sanzo P, Quaglia F. On the effects of transaction data access patterns on performance in lock-based concurrency control. IEEE Trans Comput. 2022;2022:1.","journal-title":"IEEE Trans Comput"},{"key":"849_CR29","first-page":"1","volume":"2023","author":"YF Ge","year":"2023","unstructured":"Ge YF, Wang H, Bertino E, Zhan ZH, Cao J, Zhang Y, Zhang J. Evolutionary dynamic database partitioning optimization for privacy and utility. IEEE Trans Dependable Secure Comput. 2023;2023:1.","journal-title":"IEEE Trans Dependable Secure Comput"},{"key":"849_CR30","first-page":"11","volume":"2005","author":"G Mathiason","year":"2005","unstructured":"Mathiason G, Andler SF, Jagszent D. Virtual full replication by static segmentation for multiple properties of data objects. Proc RTIS. 2005;2005:11\u20138.","journal-title":"Proc RTIS"},{"issue":"1","key":"849_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-017-0087-2","volume":"4","author":"C Sreedhar","year":"2017","unstructured":"Sreedhar C, Kasiviswanath N, Chenna Reddy P. Clustering large datasets using K-means modified inter and intra clustering (KM-I2C) in Hadoop. J Big Data. 2017;4(1):1\u201319.","journal-title":"J Big Data"},{"issue":"1","key":"849_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-022-00563-w","volume":"9","author":"SIM Mosharraf","year":"2022","unstructured":"Mosharraf SIM, Adnan MA. Improving lookup and query execution performance in distributed Big Data systems using Cuckoo Filter. J Big Data. 2022;9(1):1\u201330.","journal-title":"J Big Data"},{"issue":"1","key":"849_CR33","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1186\/s40537-020-00306-9","volume":"7","author":"AA Amer","year":"2020","unstructured":"Amer AA. On K-means clustering-based approach for DDBSs design. J Big Data. 2020;7(1):31.","journal-title":"J Big Data"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00849-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-023-00849-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00849-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T17:40:56Z","timestamp":1700329256000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-023-00849-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,18]]},"references-count":33,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["849"],"URL":"https:\/\/doi.org\/10.1186\/s40537-023-00849-7","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,18]]},"assertion":[{"value":"18 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 October 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 November 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"172"}}