{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T03:31:38Z","timestamp":1760239898125,"version":"build-2065373602"},"reference-count":40,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2019,2,3]],"date-time":"2019-02-03T00:00:00Z","timestamp":1549152000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61876190"],"award-info":[{"award-number":["61876190"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004735","name":"Natural Science Foundation of Hunan Province","doi-asserted-by":"publisher","award":["2018JJ2535"],"award-info":[{"award-number":["2018JJ2535"]}],"id":[{"id":"10.13039\/501100004735","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>In recent years, open-source software (OSS) development has grown, with many developers around the world working on different OSS projects. A variety of open-source software ecosystems have emerged, for instance, GitHub, StackOverflow, and SourceForge. One of the most typical social-programming and code-hosting sites, GitHub, has amassed numerous open-source-software projects and developers in the same virtual collaboration platform. Since GitHub itself is a large open-source community, it hosts a collection of software projects that are developed together and coevolve. The great challenge here is how to identify the relationship between these projects, i.e., project relevance. Software-ecosystem identification is the basis of other studies in the ecosystem. Therefore, how to extract useful information in GitHub and identify software ecosystems is particularly important, and it is also a research area in symmetry. In this paper, a Topic-based Project Knowledge Metrics Framework (TPKMF) is proposed. By collecting the multisource dataset of an open-source ecosystem, project-relevance analysis of the open-source software is carried out on the basis of software-ecosystem identification. Then, we used our Spectral Clustering algorithm based on Core Project (CP-SC) to identify software-ecosystem projects and further identify software ecosystems. We verified that most software ecosystems usually contain a core software project, and most other projects are associated with it. Furthermore, we analyzed the characteristics of the ecosystem, and we also found that interactive information has greater impact on project relevance. Finally, we summarize the Topic-based Project Knowledge Metrics Framework.<\/jats:p>","DOI":"10.3390\/sym11020182","type":"journal-article","created":{"date-parts":[[2019,2,5]],"date-time":"2019-02-05T11:31:07Z","timestamp":1549366267000},"page":"182","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Identification-Method Research for Open-Source Software Ecosystems"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5525-904X","authenticated-orcid":false,"given":"Zhifang","family":"Liao","sequence":"first","affiliation":[{"name":"School of Software, Central South University, Changsha 410075, China"}]},{"given":"Ningwei","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Software, Central South University, Changsha 410075, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1282-5176","authenticated-orcid":false,"given":"Shengzong","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Information Management, Hunan University of Finance and Economics, Changsha 410075, China"}]},{"given":"Yan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Computing, School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow G4 0BA, UK"}]},{"given":"Hui","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Missouri State University, Springfield, MO 65897, USA"}]},{"given":"Qi","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Software, Central South University, Changsha 410075, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,2,3]]},"reference":[{"key":"ref_1","first-page":"42","article-title":"Open source software and its eco-systems: Today and tommorow","volume":"34","author":"Jin","year":"2016","journal-title":"Sci. Technol. Rev."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Lungu, M.F. (2009). Reverse Engineering Software Ecosystems. [Ph.D. Thesis, Universit\u00e0 della Svizzera italiana].","DOI":"10.1109\/ICSM.2008.4658096"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"7173","DOI":"10.1109\/ACCESS.2017.2682323","article-title":"A Systematic Mapping Study of Software Development with GitHub","volume":"5","author":"Cosentino","year":"2017","journal-title":"IEEE Access"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Lungu, M., Robbes, R., and Lanza, M. (2010, January 20\u201324). Recovering inter-project dependencies in software ecosystems. Proceedings of the IEEE\/ACM International Conference on Automated Software Engineering, Antwerp, Belgium.","DOI":"10.1145\/1858996.1859058"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ossher, J., Bajracharya, S., and Lopes, C. (2010, January 2\u20133). Automated dependency resolution for open source software. Proceedings of the 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010), Cape Town, South Africa.","DOI":"10.1109\/MSR.2010.5463346"},{"key":"ref_6","unstructured":"Brand, M.V.D., Serebrenik, A., and Businge, J. (2012, January 23\u201328). Survival of Eclipse third-party plug-ins. Proceedings of the 2012 28th IEEE International Conference on Software Maintenance (ICSM), Trento, Italy."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Mockus, A. (2009, January 16\u201317). Amassing and indexing a large sample of version control systems: Towards the census of public source code history. Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories (2009), Vancouver, BC, Canada.","DOI":"10.1109\/MSR.2009.5069476"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1007\/s10664-008-9100-x","article-title":"Macrolevel software evolution: A case study of a large software compilation","volume":"14","author":"Robles","year":"2009","journal-title":"Empir. Softw. Eng."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1007\/s10664-014-9325-9","article-title":"How the Apache community upgrades dependencies: An evolutionary study","volume":"20","author":"Bavota","year":"2015","journal-title":"Empir. Softw. Eng."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"German, D.M., Gonzlezbarahona, J.M., and Robles, G. (2007, January 28\u201331). A Model to Understand the Building and Running Inter-Dependencies of Software. Proceedings of the 14th Working Conference on Reverse Engineering (WCRE 2007), Vancouver, BC, Canada.","DOI":"10.1109\/WCRE.2007.5"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Blincoe, K., Harrison, F., and Damian, D. (2015, January 16\u201317). Ecosystems in GitHub and a Method for Ecosystem Identification Using Reference Coupling. Proceedings of the 2015 IEEE\/ACM 12th Working Conference on Mining Software Repositories, Florence, Italy.","DOI":"10.1109\/MSR.2015.26"},{"key":"ref_12","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res. Arch."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Rachman, Z.A., and Maharani, W. (2013, January 20\u201322). The analysis and implementation of degree centrality in weighted graph in Social Network Analysis. Proceedings of the 2013 International Conference of Information and Communication Technology (ICoICT), Bandung, Indonesia.","DOI":"10.1109\/ICoICT.2013.6574552"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1038\/nbt.1522","article-title":"Dynamic modularity in protein interaction networks predicts breast cancer outcome","volume":"27","author":"Taylor","year":"2009","journal-title":"Nat. BioTechnol."},{"key":"ref_15","unstructured":"Mcdaid, A.F., Greene, D., and Hurley, N. (arXiv, 2011). Normalized Mutual Information to evaluate overlapping community finding algorithms, arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1002\/cyto.a.20896","article-title":"Quantifying colocalization by correlation: The Pearson correlation coefficient is superior to the Mander\u2019s overlap coefficient","volume":"77","author":"Adler","year":"2010","journal-title":"Cytometry Part A"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"8577","DOI":"10.1073\/pnas.0601602103","article-title":"Modularity and community structure in networks","volume":"103","author":"Newman","year":"2006","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_18","unstructured":"Kuncheva, L.I., and Hadjitodorov, S.T. (2004, January 10\u201313). Using diversity in cluster ensembles. Proceedings of the 2004 IEEE International Conference on Systems, Man and Cybernetics, Hague, The Netherlands."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"S186","DOI":"10.1098\/rsbl.2003.0057","article-title":"The emergent properties of a dolphin social network","volume":"270","author":"Lusseau","year":"2003","journal-title":"Proc. Biol. Sci."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"7821","DOI":"10.1073\/pnas.122653799","article-title":"Community structure in social and biological networks","volume":"99","author":"Girvan","year":"2002","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Jansen, S., Finkelstein, A., and Brinkkemper, S. (2009, January 16\u201324). A sense of community: A research agenda for software ecosystems. Proceedings of the 2009 31st International Conference on Software Engineering, Vancouver, BC, Canada.","DOI":"10.1109\/ICSE-COMPANION.2009.5070978"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Bosch, J., and Bosch-Sijtsema, P.M. (2010). Softwares Product Lines, Global Development and Ecosystems: Collaboration in Software Engineering, Springer.","DOI":"10.1007\/978-3-642-10294-3_4"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"German, D.M., Adams, B., and Hassan, A.E. (2013, January 5\u20138). The Evolution of the R Software Ecosystem. Proceedings of the 2013 17th European Conference on Software Maintenance and Reengineering, Genova, Italy.","DOI":"10.1109\/CSMR.2013.33"},{"key":"ref_24","unstructured":"Matragkas, N., Williams, J.R., Kolovos, D.S., and Paige, R.F. (June, January 31). Analysing the \u2019biodiversity\u2019 of open source ecosystems: The GitHub case. Proceedings of the 11th Working Conference on Mining Software Repositories, Hyderabad, India."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Thomas, D.R., Beresford, A.R., and Rice, A. (2015, January 12\u201315). Security Metrics for the Android Ecosystem. Proceedings of the 5th Annual ACM CCS Workshop on Security and Privacy in Smartphones and Mobile Devices, Denver, CO, USA.","DOI":"10.1145\/2808117.2808118"},{"key":"ref_26","unstructured":"Manikas, K., and Hansen, K.M. (2013, January 11\u201314). Reviewing the Health of Software Ecosystems\u2014A Conceptual Framework Proposal. Proceedings of the 5th International Workshop on Software Ecosystems, Potsdam, Germany."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Viljainen, M., and Kauppinen, M. (2011). Software Ecosystems: A Set of Management Practices for Platform Integrators in the Telecom Industry, Springer.","DOI":"10.1007\/978-3-642-21544-5_4"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1479","DOI":"10.1016\/j.jss.2012.03.033","article-title":"Understanding the role of licenses and evolution in open architecture software ecosystems","volume":"85","author":"Scacchi","year":"2012","journal-title":"J. Syst. Softw."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Liao, Z., Deng, L., Fan, X., Zhang, Y., Liu, H., Qi, X., and Zhou, Y. (2018). Empirical Research on the Evaluation Model and Method of Sustainability of the Open Source Ecosystem. Symmetry, 10.","DOI":"10.3390\/sym10120747"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"24003","DOI":"10.1109\/ACCESS.2018.2810295","article-title":"Exploring the Characteristics of Issue-related Behaviors in GitHub Using Visualization Techniques","volume":"6","author":"Liao","year":"2018","journal-title":"IEEE Access"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Thung, F., Bissyande, T.F., Lo, D., and Jiang, L. (2013, January 5\u20138). Network Structure of Social Coding in GitHub. Proceedings of the 2013 17th European Conference on Software Maintenance and Reengineering, Genova, Italy.","DOI":"10.1109\/CSMR.2013.41"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Decan, A., Mens, T., Claes, M., and Grosjean, P. (2016, January 14\u201318). When GitHub Meets CRAN: An Analysis of Inter-Repository Package Dependency Problems. Proceedings of the 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering, Osaka, Japan.","DOI":"10.1109\/SANER.2016.12"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Liao, Z., Zhao, B., Liu, S., Jin, H., He, D., Yang, L., Zhang, Y., and Wu, J. (2017). A Prediction Model of the Project Life-Span in Open Source Software Ecosystem. Mob. Netw. Appl.","DOI":"10.1007\/s11036-018-0993-3"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1088\/1742-5468\/2008\/10\/P10008","article-title":"Fast unfolding of communities in large networks","volume":"2008","author":"Blondel","year":"2008","journal-title":"J. Stat. Mech."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yu, Y., Yin, G., Wang, H., and Wang, T. (2014, January 17\u201319). Exploring the patterns of social behavior in GitHub. Proceedings of the 1st International Workshop on Crowd-based Software Development Methods and Technologies, Hong Kong, China.","DOI":"10.1145\/2666539.2666571"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1214\/14-AOS1290","article-title":"Robust and computationally feasible community detection in the presence of arbitrary outlier nodes","volume":"43","author":"Cai","year":"2015","journal-title":"Ann. Stat."},{"key":"ref_37","first-page":"784321","article-title":"Identification of core-periphery structure in networks","volume":"91","author":"Xiao","year":"2014","journal-title":"Phys. Rev. E Stat. Nonlinear Soft Matter Phys."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"088701","DOI":"10.1103\/PhysRevLett.115.088701","article-title":"Generalized Communities in Networks","volume":"115","author":"Newman","year":"2015","journal-title":"Phys. Rev. Lett."},{"key":"ref_39","unstructured":"Chen, Y., Kamath, G., Suh, C., and Tse, D. (2016, January 19\u201324). Community Recovery in Graphs with Locality. Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"026129","DOI":"10.1103\/PhysRevE.80.026129","article-title":"Detecting network communities by propagating labels under constraints","volume":"80","author":"Barber","year":"2009","journal-title":"Phys. Rev. E Stat. Nonlinear Soft Matter Phys."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/2\/182\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:30:54Z","timestamp":1760185854000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/2\/182"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,3]]},"references-count":40,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2019,2]]}},"alternative-id":["sym11020182"],"URL":"https:\/\/doi.org\/10.3390\/sym11020182","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2019,2,3]]}}}