{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,6]],"date-time":"2026-01-06T15:35:03Z","timestamp":1767713703616,"version":"build-2065373602"},"reference-count":54,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2021,8,4]],"date-time":"2021-08-04T00:00:00Z","timestamp":1628035200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>A deep understanding about a field of research is valuable for academic researchers. In addition to technical knowledge, this includes knowledge about subareas, open research questions, and social communities (networks) of individuals and organizations within a given field. With bibliometric analyses, researchers can acquire quantitatively valuable knowledge about a research area by using bibliographic information on academic publications provided by bibliographic data providers. Bibliometric analyses include the calculation of bibliometric networks to describe affiliations or similarities of bibliometric entities (e.g., authors) and group them into clusters representing subareas or communities. Calculating and visualizing bibliometric networks is a nontrivial and time-consuming data science task that requires highly skilled individuals. In addition to domain knowledge, researchers must often provide statistical knowledge and programming skills or use software tools having limited functionality and usability. In this paper, we present the ambalytics bibliometric platform, which reduces the complexity of bibliometric network analysis and the visualization of results. It accompanies users through the process of bibliometric analysis and eliminates the need for individuals to have programming skills and statistical knowledge, while preserving advanced functionality, such as algorithm parameterization, for experts. As a proof-of-concept, and as an example of bibliometric analyses outcomes, the calculation of research fronts networks based on a hybrid similarity approach is shown. Being designed to scale, ambalytics makes use of distributed systems concepts and technologies. It is based on the microservice architecture concept and uses the Kubernetes framework for orchestration. This paper presents the initial building block of a comprehensive bibliometric analysis platform called ambalytics, which aims at a high usability for users as well as scalability.<\/jats:p>","DOI":"10.3390\/fi13080203","type":"journal-article","created":{"date-parts":[[2021,8,4]],"date-time":"2021-08-04T03:57:41Z","timestamp":1628049461000},"page":"203","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8998-0890","authenticated-orcid":false,"given":"Klaus","family":"Kammerer","sequence":"first","affiliation":[{"name":"Institute of Clinical Epidemiology and Biometry, University of W\u00fcrzburg, 97080 W\u00fcrzburg, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3805-5456","authenticated-orcid":false,"given":"Manuel","family":"G\u00f6ster","sequence":"additional","affiliation":[{"name":"Institute of Clinical Epidemiology and Biometry, University of W\u00fcrzburg, 97080 W\u00fcrzburg, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2536-4153","authenticated-orcid":false,"given":"Manfred","family":"Reichert","sequence":"additional","affiliation":[{"name":"Institute of Databases and Information Systems, Ulm University, 89081 Ulm, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1522-785X","authenticated-orcid":false,"given":"R\u00fcdiger","family":"Pryss","sequence":"additional","affiliation":[{"name":"Institute of Clinical Epidemiology and Biometry, University of W\u00fcrzburg, 97080 W\u00fcrzburg, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,4]]},"reference":[{"key":"ref_1","unstructured":"Havemann, F. (2009). Einf\u00fchrung in die Bibliometrie, Gesellschaft f\u00fcr Wissenschaftsforschung."},{"key":"ref_2","unstructured":"Ozdemir, S. (2016). Principles of Data Science, Packt Publishing."},{"key":"ref_3","unstructured":"G\u00f6ster, M. (2020). Citarics\u2014A Microservice Platform for Bibliometric Network Analysis and Visualization. [Master\u2019s Thesis, Ulm University]."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1016\/j.joi.2017.08.007","article-title":"bibliometrix: An R-tool for Comprehensive Science Mapping Analysis","volume":"11","author":"Aria","year":"2017","journal-title":"J. Informetr."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"802","DOI":"10.1016\/j.joi.2014.07.006","article-title":"CitNetExplorer: A New Software Tool for Analyzing and Visualizing Citation Networks","volume":"8","author":"Waltman","year":"2014","journal-title":"J. Informetr."},{"key":"ref_6","unstructured":"Van Eck, N.J., and Waltman, L. (2021, June 11). VOSviewer Manual. Available online: https:\/\/www.vosviewer.com\/documentation\/Manual_VOSviewer_1.6.8.pdf."},{"key":"ref_7","first-page":"9","article-title":"How to Use Bibexcel for Various Types of Bibliometric Analysis","volume":"Volume 5","author":"Persson","year":"2009","journal-title":"Celebrating Scholarly Communication Studies: A Festschrift for Olle Persson at his 60th Birthday"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Knutas, A., Hajikhani, A., Salminen, J., Ikonen, J., and Porras, J. (2015, January 25\u201326). Cloud-Based Bibliometric Analysis Service for Systematic Mapping Studies. Proceedings of the 16th International Conference on Computer Systems and Technologies, Dublin, Ireland.","DOI":"10.1145\/2812428.2812442"},{"key":"ref_9","unstructured":"Zammit, A., Penza, K., Haddod, F., Abela, C., and Azzopardi, J. (2017, January 28). ACE: Big Data Approach to Scientific Collaboration Patterns Analysis. Proceedings of the Scientometrics and Enabling Decentralised Scholarly Communication, Portoro\u017e, Slovenia."},{"key":"ref_10","unstructured":"Cyberinfrastructure for Network Science Center, Indiana University at Bloomington (2021, May 06). Sci2 Tool. Available online: https:\/\/sci2.cns.iu.edu\/."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Sinha, A., Shen, Z., Song, Y., Ma, H., Eide, D., Hsu, B.J., and Wang, K. (2015, January 18\u201322). An Overview of Microsoft Academic Service (MAS) and Applications. Proceedings of the 24th International Conference on World Wide Web, Florence, Italy.","DOI":"10.1145\/2740908.2742839"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1016\/j.datak.2008.05.001","article-title":"Change Patterns and Change Support Features\u2014Enhancing Flexibility in Process-aware Information Systems","volume":"66","author":"Weber","year":"2008","journal-title":"Data Knowl. Eng."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Hoppenstedt, B., Pryss, R., Stelzer, B., Meyer-Br\u00f6tz, F., Kammerer, K., Tre\u00df, A., and Reichert, M. (2018). Techniques and Emerging Trends for State of the Art Equipment Maintenance Systems\u2014A Bibliometric Analysis. Appl. Sci., 8.","DOI":"10.3390\/app8060916"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1809","DOI":"10.1007\/s11192-015-1645-z","article-title":"The Bibliometric Analysis of Scholarly Production: How Great is the Impact?","volume":"105","author":"Ellegaard","year":"2015","journal-title":"Scientometrics"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Manning, C.D., Raghavan, P., and Sch\u00fctze, H. (2008). An Introduction to Information Retrieval, Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.techfore.2015.06.008","article-title":"Combining the Scenario Technique With Bibliometrics for Technology Foresight: The Case of Personalized Medicine","volume":"98","author":"Stelzer","year":"2015","journal-title":"Technol. Forecast. Soc. Chang."},{"key":"ref_17","unstructured":"Meyer-Br\u00f6tz, F. (2019). A Bibliometric Technique for Quantitative Technology Foresight. [Ph.D. Thesis, Universit\u00e4t Ulm]."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"510","DOI":"10.1126\/science.149.3683.510","article-title":"Networks of Scientific Papers","volume":"149","author":"Price","year":"1965","journal-title":"Science"},{"key":"ref_19","unstructured":"Tokunaga, T., and Makoto, I. (1994). Text Categorization Based on Weighted Inverse Document Frequency, Special Interest Groups and Information Process Society of Japan."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1007\/s11192-017-2366-2","article-title":"Experimental Evaluation of Parameter Settings in Calculation of Hybrid Similarities: Effects of First- and Second-order Similarity, Edge Cutting, and Weighting Factors","volume":"111","author":"Schiebel","year":"2017","journal-title":"Scientometrics"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","article-title":"Term-Weighting Approaches in Automatic Text Retrieval","volume":"24","author":"Salton","year":"1988","journal-title":"Inf. Process. Manag."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1007\/s11192-011-0347-4","article-title":"Using \u2018Core Documents\u2019 for the Representation of Clusters and Topics","volume":"88","author":"Thijs","year":"2011","journal-title":"Scientometrics"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1109\/2945.841119","article-title":"Graph Visualization and Navigation in Information Visualization: A Survey","volume":"6","author":"Herman","year":"2000","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_24","unstructured":"Cabena, P., Hadjinian, P., Stadler, R., Verhees, J., and Zanasi, A. (1998). Discovering Data Mining: From Concept to Implementation, Prentice-Hall, Inc."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Everitt, B., Landau, S., Leese, M., and Stahl, D. (2011). Cluster Analysis, Wiley.","DOI":"10.1002\/9780470977811"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Guidotti, R., and Coscia, M. (2018). On the Equivalence Between Community Discovery and Clustering. Smart Objects and Technologies for Social Good, Springer.","DOI":"10.1007\/978-3-319-76111-4_34"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.physrep.2009.11.002","article-title":"Community Detection in Graphs","volume":"486","author":"Fortunato","year":"2010","journal-title":"Phys. Rep."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"P10008","DOI":"10.1088\/1742-5468\/2008\/10\/P10008","article-title":"Fast Unfolding of Communities in Large Networks","volume":"2008","author":"Blondel","year":"2008","journal-title":"J. Stat. Mech. Theory Exp."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1382","DOI":"10.1002\/asi.21525","article-title":"Science Mapping Software tools: Review, Analysis, and Cooperative Study Among Tools","volume":"62","author":"Cobo","year":"2011","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1097\/01.CCM.0000206112.32673.D4","article-title":"Implementation of an evidence-based \u201cstandard operating procedure\u201d and outcome in septic shock","volume":"34","author":"Kortgen","year":"2006","journal-title":"Crit. Care Med."},{"key":"ref_31","unstructured":"Stellman, A., and Greene, J. (2006). Applied Software Project Management, O\u2019Reilly."},{"key":"ref_32","unstructured":"Richards, M., and Ford, N. (2020). Fundamentals of Software Architecture: An Engineering Approach, O\u2019Reilly."},{"key":"ref_33","unstructured":"Brown, S. (2021, May 06). Software Architecture for Developers. Available online: http:\/\/static.codingthearchitecture.com\/sddconf2014-software-architecture-for-developers-extract.pdf."},{"key":"ref_34","unstructured":"Linux Foundation (2021, May 06). Kubernetes. Available online: https:\/\/kubernetes.io\/."},{"key":"ref_35","unstructured":"Linux Foundation (2021, May 06). Cloud Native Computing Foundation. Available online: https:\/\/www.cncf.io\/."},{"key":"ref_36","unstructured":"Linux Foundation (2021, May 06). Kubernetes Documentation. Available online: https:\/\/kubernetes.io\/docs\/."},{"key":"ref_37","unstructured":"Ushio, T. (2021, May 06). Kubernetes in Three Diagrams. Available online: https:\/\/medium.com\/@tsuyoshiushio\/kubernetes-in-three-diagrams-6aba8432541c."},{"key":"ref_38","unstructured":"Matsuda, K., and Lea, R. (2013). WebGL Programming Guide: Interactive 3D Graphics Programming with WebGL, Addison-Wesley."},{"key":"ref_39","unstructured":"Moral Mu\u00f1oz, J.A., Herrera Viedma, E., Santisteban Espejo, A., and Cobo, M.J. (2021, July 07). Software Tools for Conducting Bibliometric Analysis in Science: An up-to-Date Review. Available online: http:\/\/hdl.handle.net\/10498\/22857."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1007\/s11192-017-2247-8","article-title":"Citation Analysis with Microsoft Academic","volume":"111","author":"Hug","year":"2017","journal-title":"Scientometrics"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1162\/qss_a_00112","article-title":"Large-scale Comparison of Bibliographic Data Sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic","volume":"2","author":"Visser","year":"2021","journal-title":"Quant. Sci. Stud."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Vicknair, C., Macias, M., Zhao, Z., Nan, X., Chen, Y., and Wilkins, D. (2010, January 15\u201317). A Comparison of a Graph Database and a Relational Database: A Data Provenance Perspective. Proceedings of the 48th Annual Southeast Regional Conference, Oxford, MS, USA.","DOI":"10.1145\/1900008.1900067"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Meyer, U., and Sanders, P. (2003). Algorithms for Memory Hierarchies: Advanced Lectures, Springer Science & Business Media.","DOI":"10.1007\/3-540-36574-5"},{"key":"ref_44","unstructured":"Apache Software Foundation (2021, May 06). Apache Spark. Available online: https:\/\/spark.apache.org\/."},{"key":"ref_45","unstructured":"White, C. (2021, May 06). Why Not Airflow?. Available online: https:\/\/medium.com\/the-prefect-blog\/why-not-airflow-4cfa423299c4."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Aldinucci, M., Danelutto, M., Kilpatrick, P., Meneghin, M., and Torquati, M. (2012). An Efficient Unbounded Lock-free Queue for Multi-core Systems. European Conference on Parallel Processing, Springer.","DOI":"10.1007\/978-3-642-32820-6_65"},{"key":"ref_47","first-page":"382","article-title":"Science Mapping and Visualization Tools used for Bibliometric and Scientometric Studies: A Comparative Study","volume":"6","author":"Bankar","year":"2019","journal-title":"J. Adv. Libr. Sci."},{"key":"ref_48","unstructured":"Synnestvedt, M.B., Chen, C., and Holmes, J.H. (2005, January 22\u201326). CiteSpace II: Visualization and Knowledge Discovery in Bibliographic Databases. Proceedings of the AMIA Annual Symposium Proceedings. American Medical Informatics Association, Washington, DC, USA."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1609","DOI":"10.1002\/asi.22688","article-title":"SciMAT: A New Science Mapping Analysis Software Tool","volume":"63","author":"Cobo","year":"2012","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1007\/s11192-011-0482-y","article-title":"Mapping Scientific Institutions","volume":"89","author":"Grauwin","year":"2011","journal-title":"Scientometrics"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/j.joi.2016.12.005","article-title":"Introducing metaknowledge: Software for Computational Research in Information Science, Network Analysis, and Science of Science","volume":"11","author":"McLevey","year":"2017","journal-title":"J. Informetr."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Roberts, R.J. (2021, May 06). PubMed Central: The GenBank of the Published Literature. Available online: https:\/\/www.pnas.org\/content\/98\/2\/381.full.","DOI":"10.1073\/pnas.98.2.381"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Ammar, W., Groeneveld, D., Bhagavatula, C., Beltagy, I., Crawford, M., Downey, D., Dunkelberger, J., Elgohary, A., Feldman, S., and Ha, V. (2018). Construction of the Literature Graph in Semantic Scholar. arXiv.","DOI":"10.18653\/v1\/N18-3011"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/S0378-8733(03)00009-1","article-title":"Friends and Neighbors on the Web","volume":"25","author":"Adamic","year":"2003","journal-title":"Soc. Netw."}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/13\/8\/203\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:40:07Z","timestamp":1760164807000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/13\/8\/203"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,4]]},"references-count":54,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2021,8]]}},"alternative-id":["fi13080203"],"URL":"https:\/\/doi.org\/10.3390\/fi13080203","relation":{},"ISSN":["1999-5903"],"issn-type":[{"type":"electronic","value":"1999-5903"}],"subject":[],"published":{"date-parts":[[2021,8,4]]}}}