{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T23:39:33Z","timestamp":1769729973157,"version":"3.49.0"},"reference-count":78,"publisher":"Association for Computing Machinery (ACM)","issue":"8","license":[{"start":{"date-parts":[[2024,11,23]],"date-time":"2024-11-23T00:00:00Z","timestamp":1732320000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62141210, 61977014, 61902329, 62002125, and 61802164"],"award-info":[{"award-number":["62141210, 61977014, 61902329, 62002125, and 61802164"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","award":["N2217005"],"award-info":[{"award-number":["N2217005"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Novel Software Technology, Nanjing University","award":["KFKT2021B01 and (B16009)"],"award-info":[{"award-number":["KFKT2021B01 and (B16009)"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Softw. Eng. Methodol."],"published-print":{"date-parts":[[2024,11,30]]},"abstract":"<jats:p>\n            Existing software clustering techniques tend to ignore prior knowledge from domain experts, leading to results (suggested big-bang remodularization actions) that cannot be acceptable to developers. Incorporating domain experts knowledge or constraints during clustering ensures the obtained modularization aligns with developers\u2019 perspectives, enhancing software quality. However, manual review by knowledgeable domain experts for constraint generation is time-consuming and labor-intensive. In this article, we propose an evolution-aware constraint derivation approach,\n            <jats:sc>Escort<\/jats:sc>\n            , which automatically derives clustering constraints based on the evolutionary history from the analyzed software. Specifically,\n            <jats:sc>Escort<\/jats:sc>\n            can serve as an alternative approach to derive implicit and explicit constraints in situations where domain experts are absent. In the subsequent constrained clustering process,\n            <jats:sc>Escort<\/jats:sc>\n            can be considered as a framework to help supplement and enhance various unconstrained clustering techniques to improve their accuracy and reliability. We evaluate\n            <jats:sc>Escort<\/jats:sc>\n            based on both quantitative and qualitative analysis. In quantitative validation,\n            <jats:sc>Escort<\/jats:sc>\n            , using generated clustering constraints, outperforms seven classic unconstrained clustering techniques. Qualitatively, a survey with developers from five IT companies indicates that 89% agree with\n            <jats:sc>Escort<\/jats:sc>\n            \u2019s clustering constraints. We also evaluate the utility of refactoring suggestions from our constrained clustering approach, with 54% acknowledged by project developers, either implemented or planned for future releases.\n          <\/jats:p>","DOI":"10.1145\/3676960","type":"journal-article","created":{"date-parts":[[2024,7,8]],"date-time":"2024-07-08T14:11:20Z","timestamp":1720447880000},"page":"1-43","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Evolution-Aware Constraint Derivation Approach for Software Remodularization"],"prefix":"10.1145","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6465-3295","authenticated-orcid":false,"given":"Fanyi","family":"Meng","sequence":"first","affiliation":[{"name":"Northeastern University, Shenyang, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8645-4326","authenticated-orcid":false,"given":"Ying","family":"Wang","sequence":"additional","affiliation":[{"name":"Northeastern University, Shenyang, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1164-0049","authenticated-orcid":false,"given":"Chun Yong","family":"Chong","sequence":"additional","affiliation":[{"name":"Monash University Malaysia, Bandar Sunway, Subang Jaya, Malaysia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8024-1781","authenticated-orcid":false,"given":"Hai","family":"Yu","sequence":"additional","affiliation":[{"name":"Northeastern University, Shenyang, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3422-5585","authenticated-orcid":false,"given":"Zhiliang","family":"Zhu","sequence":"additional","affiliation":[{"name":"Northeastern University, Shenyang, China"}]}],"member":"320","published-online":{"date-parts":[[2024,11,23]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","first-page":"1440","DOI":"10.1145\/2851613.2851781","volume-title":"Proceedings of the 31st ACM Symposium on Applied Computing","author":"Alt\u0269n\u0269s\u0269k Metin","year":"2016","unstructured":"Metin Alt\u0269n\u0269s\u0269k and Hasan S\u00f6zer. 2016. Automated procedure clustering for reverse engineering PL\/SQL programs. In Proceedings of the 31st ACM Symposium on Applied Computing ACM, 1440\u20131445."},{"issue":"2","key":"e_1_3_2_3_2","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1109\/TSE.2005.25","article-title":"Information-theoretic software clustering","volume":"31","author":"Andritsos Periklis","year":"2005","unstructured":"Periklis Andritsos and Vassilios Tzerpos. 2005. Information-theoretic software clustering. IEEE Trans. Softw. Eng. 31, 2 (2005), 150\u2013165.","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"9","key":"e_1_3_2_4_2","doi-asserted-by":"crossref","first-page":"3247","DOI":"10.1109\/TPAMI.2020.2979699","article-title":"Semi-supervised clustering with constraints of different types from multiple information sources","volume":"43","author":"Bai Liang","year":"2021","unstructured":"Liang Bai, JiYe Liang, and Fuyuan Cao. 2021. Semi-supervised clustering with constraints of different types from multiple information sources. IEEE Trans. Pattern Anal. Mach. Intell. 43, 9 (2021), 3247\u20133258.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_2_5_2","volume-title":"Software Architecture in Practice","author":"Bass Len","year":"2013","unstructured":"Len Bass, Paul Clements, and Rick Kazman. 2013. Software Architecture in Practice. Addison-Wesley."},{"key":"e_1_3_2_6_2","first-page":"333","volume-title":"Proceedings of the SIAM International Conference on Data Mining","author":"Basu Sugato","year":"2004","unstructured":"Sugato Basu, Arindam Banerjee, and Raymond J. Mooney. 2004. Active semi-supervision for pairwise constrained clustering. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 333\u2013344."},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1007\/978-3-642-33119-0_7","volume-title":"Proceedings of the 4th International Symposium Search Based Software Engineering","author":"Bavota Gabriele","year":"2012","unstructured":"Gabriele Bavota, Filomena Carnevale, Andrea De Lucia, Massimiliano Di Penta, and Rocco Oliveto. 2012. Putting the developer in-the-loop: An interactive GA for software re-modularization. In Proceedings of the 4th International Symposium Search Based Software Engineering. Springer, Berlin, 75\u201389."},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1007\/s10664-012-9226-8","article-title":"Using structural and semantic measures to improve software modularization","volume":"18","author":"Bavota Gabriele","year":"2013","unstructured":"Gabriele Bavota, Andrea De Lucia, Andrian Marcus, and Rocco Oliveto. 2013. Using structural and semantic measures to improve software modularization. Empirical Softw. Eng. 18 (2013), 901\u2013932.","journal-title":"Empirical Softw. Eng."},{"issue":"5","key":"e_1_3_2_9_2","doi-asserted-by":"crossref","first-page":"970","DOI":"10.1007\/s10664-012-9225-9","article-title":"On the impact of software evolution on software clustering","volume":"18","author":"Beck Fabian","year":"2013","unstructured":"Fabian Beck and Stephan Diehl. 2013. On the impact of software evolution on software clustering. Empirical Softw. Eng. 18, 5 (2013), 970\u20131004.","journal-title":"Empirical Softw. Eng."},{"key":"e_1_3_2_10_2","first-page":"1","volume-title":"Proceedings of the 24th International Conference on Program Comprehension","author":"Beck Fabian","year":"2016","unstructured":"Fabian Beck, Jan Melcher, and Daniel Weiskopf. 2016. Identifying modularization patterns by visual comparison of multiple hierarchies. In Proceedings of the 24th International Conference on Program Comprehension. IEEE, 1\u201310."},{"issue":"3","key":"e_1_3_2_11_2","doi-asserted-by":"crossref","first-page":"1146","DOI":"10.1007\/s10664-016-9466-0","article-title":"A large-scale study of architectural evolution in open-source software systems","volume":"22","author":"Behnamghader Pooyan","year":"2017","unstructured":"Pooyan Behnamghader, Duc M. Le, Joshua Garcia, Daniel Link, and Nenad Medvidovic. 2017. A large-scale study of architectural evolution in open-source software systems. Empirical Softw. Eng. 22, 3 (2017), 1146\u20131193.","journal-title":"Empirical Softw. Eng."},{"issue":"3","key":"e_1_3_2_12_2","first-page":"24:1","article-title":"Using cohesion and coupling for software remodularization: Is it enough?","volume":"25","author":"Candela Ivan","year":"2016","unstructured":"Ivan Candela, Gabriele Bavota, Barbara Russo, and Rocco Oliveto. 2016. Using cohesion and coupling for software remodularization: Is it enough? ACM Trans. Softw. Eng. Methodol. 25, 3 (2016), 24:1\u201324:28.","journal-title":"ACM Trans. Softw. Eng. Methodol"},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1016\/j.procs.2015.06.044","article-title":"Search-based object-oriented software re-structuring with structural coupling strength","volume":"54","author":"Chhabra Jitender K.","year":"2015","unstructured":"Jitender K. Chhabra. 2015. Search-based object-oriented software re-structuring with structural coupling strength. Procedia Comput. Sci. 54 (2015), 380\u2013389.","journal-title":"Procedia Comput. Sci."},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1016\/j.infsof.2016.09.011","article-title":"Improving modular structure of software system using structural and lexical dependency","volume":"82","author":"Chhabra Jitender K.","year":"2017","unstructured":"Jitender K. Chhabra. 2017. Improving modular structure of software system using structural and lexical dependency. Inf. Softw. Technol. 82 (2017), 96\u2013120.","journal-title":"Inf. Softw. Technol."},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.jss.2015.08.014","article-title":"Analyzing maintainability and reliability of object-oriented software using weighted complex network","volume":"110","author":"Chong Chun Y.","year":"2015","unstructured":"Chun Y. Chong and Sai P. Lee. 2015. Analyzing maintainability and reliability of object-oriented software using weighted complex network. J. Syst. Softw. 110 (2015), 28\u201353.","journal-title":"J. Syst. Softw."},{"key":"e_1_3_2_16_2","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.jss.2017.08.017","article-title":"Automatic clustering constraints derivation from object-oriented software using weighted complex network with graph theory analysis","volume":"133","author":"Chong Chun Y.","year":"2017","unstructured":"Chun Y. Chong and Sai P. Lee. 2017. Automatic clustering constraints derivation from object-oriented software using weighted complex network with graph theory analysis. J. Syst. Softw. 133 (2017), 28\u201353.","journal-title":"J. Syst. Softw."},{"key":"e_1_3_2_17_2","first-page":"11","article-title":"Efficient software clustering technique using an adaptive and preventive dendrogram cutting approach","volume":"55","author":"Chong Chun Y.","year":"2013","unstructured":"Chun Y. Chong, Sai P. Lee, and Teck C. Ling. 2013. Efficient software clustering technique using an adaptive and preventive dendrogram cutting approach. Inf. Softw. Technol. 55, 11 (2013), 1994\u20132012.","journal-title":"Inf. Softw. Technol."},{"issue":"11","key":"e_1_3_2_18_2","doi-asserted-by":"crossref","first-page":"1245","DOI":"10.1016\/S0045-7825(01)00323-1","article-title":"Theoretical and numerical constraint-handling techniques used with evolutionary algorithms: A survey of the state of the art","volume":"191","author":"Coello Carlos A. C.","year":"2002","unstructured":"Carlos A. C. Coello. 2002. Theoretical and numerical constraint-handling techniques used with evolutionary algorithms: A survey of the state of the art. Comput. Meth. Appl. Mech. Eng. 191, 11\u201312 (2002), 1245\u20131287.","journal-title":"Comput. Meth. Appl. Mech. Eng."},{"issue":"8","key":"e_1_3_2_19_2","doi-asserted-by":"crossref","first-page":"8529","DOI":"10.1109\/TKDE.2022.3206330","article-title":"Semi-supervised EEG clustering with multiple constraints","volume":"35","author":"Dai Chenglong","year":"2023","unstructured":"Chenglong Dai, Jia Wu, Jessica J. M. Monaghan, Guanghui Li, Hao Peng, Stefanie I. Becker, and David McAlpine. 2023. Semi-supervised EEG clustering with multiple constraints. IEEE Trans. Knowl. Data Eng. 35, 8 (2023), 8529\u20138544.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"issue":"6","key":"e_1_3_2_20_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00500-015-1643-3","article-title":"Automatic constraints generation for semisupervised clustering: experiences with documents classification","volume":"20","author":"Diaz-Valenzuela I.","year":"2016","unstructured":"I. Diaz-Valenzuela, V. Loia, M. J. Martin-Bautista, S. Senatore, and M. A. Vila. 2016. Automatic constraints generation for semisupervised clustering: experiences with documents classification. Soft Comput. 20, 6 (2016), 1\u201311.","journal-title":"Soft Comput."},{"issue":"4","key":"e_1_3_2_21_2","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1109\/TSE.2009.19","article-title":"Software architecture reconstruction: A process-oriented taxonomy","volume":"35","author":"Ducasse Stephane","year":"2009","unstructured":"Stephane Ducasse and Damien Pollet. 2009. Software architecture reconstruction: A process-oriented taxonomy. IEEE Trans. Softw. Eng. 35, 4 (2009), 573\u2013591.","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"3","key":"e_1_3_2_22_2","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1080\/01969727308546046","article-title":"A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters","volume":"3","author":"Dunn J. C.","year":"1973","unstructured":"J. C. Dunn. 1973. A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3, 3 (1973), 32\u201357.","journal-title":"J. Cybern."},{"key":"e_1_3_2_23_2","volume-title":"Refactoring: Improving the Design of Existing Code.","author":"Fowler Martin","year":"1999","unstructured":"Martin Fowler, Kent Beck, John Brant, William Opdyke, and Don Roberts. 1999. Refactoring: Improving the Design of Existing Code. Addison-Wesley."},{"issue":"6","key":"e_1_3_2_24_2","doi-asserted-by":"crossref","first-page":"1565","DOI":"10.1109\/TFUZZ.2008.2005692","article-title":"Fuzzy clustering and aggregation of relational data with instance-level constraints","volume":"16","author":"Frigui Hichem","year":"2008","unstructured":"Hichem Frigui and Cheul Hwang. 2008. Fuzzy clustering and aggregation of relational data with instance-level constraints. IEEE Trans. Fuzzy Syst. 16, 6 (2008), 1565\u20131581.","journal-title":"IEEE Trans. Fuzzy Syst."},{"key":"e_1_3_2_25_2","first-page":"486","volume-title":"Proceedings of the 28th IEEE\/ACM International Conference on Automated Software Engineering","author":"Garcia Joshua","year":"2013","unstructured":"Joshua Garcia, Igor Ivkovic, and Nenad Medvidovic. 2013. A comparative analysis of software architecture recovery techniques. In Proceedings of the 28th IEEE\/ACM International Conference on Automated Software Engineering. IEEE, 486\u2013496."},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/j.ins.2022.05.035","article-title":"Semi-supervised clustering with inaccurate pairwise annotations","volume":"607","author":"Gribel Daniel","year":"2022","unstructured":"Daniel Gribel, Michel Gendreau, and Thibaut Vidal. 2022. Semi-supervised clustering with inaccurate pairwise annotations. Inf. Sci. 607 (2022), 441\u2013457.","journal-title":"Inf. Sci."},{"key":"e_1_3_2_27_2","first-page":"466","volume-title":"Proceedings of the 29th IEEE International Conference on Software Maintenance and Evolution","author":"Hall Mathew","year":"2014","unstructured":"Mathew Hall, Muhammad A. Khojaye, Neil Walkinshaw, and Phil McMinn. 2014. Establishing the source code disruption caused by automated remodularisation tools. In Proceedings of the 29th IEEE International Conference on Software Maintenance and Evolution. IEEE, 466\u2013470."},{"key":"e_1_3_2_28_2","first-page":"472","volume-title":"Proceedings of the 28th International Conference on Software Maintenance","author":"Hall Mathew","year":"2012","unstructured":"Mathew Hall, Neil Walkinshaw, and Phil McMinn. 2012. Supervised software modularisation. In Proceedings of the 28th International Conference on Software Maintenance. IEEE, 472\u2013481."},{"issue":"7","key":"e_1_3_2_29_2","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1109\/TSE.2017.2786222","article-title":"Effectively incorporating expert knowledge in automated software remodularisation","volume":"44","author":"Hall Mathew","year":"2018","unstructured":"Mathew Hall, Neil Walkinshaw, and Phil McMinn. 2018. Effectively incorporating expert knowledge in automated software remodularisation. IEEE Trans. Softw. Eng. 44, 7 (2018), 613\u2013630.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_30_2","first-page":"463","volume-title":"Proceedings of the IEEE International Conference on Fuzzy Systems","author":"Havens Timothy C.","year":"2011","unstructured":"Timothy C. Havens, Radha Chitta, Anil K. Jain, and Rong Jin. 2011. Speedup of fuzzy and possibilistic kernel c-means for large-scale clustering. In Proceedings of the IEEE International Conference on Fuzzy Systems. IEEE, 463\u2013470."},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1016\/j.ins.2016.01.030","article-title":"A similarity-based modularization quality measure for software module clustering problems","volume":"342","author":"Huang Jinhuang","year":"2016","unstructured":"Jinhuang Huang and Jing Liu. 2016. A similarity-based modularization quality measure for software module clustering problems. Inf. Sci. 342 (2016), 96\u2013110.","journal-title":"Inf. Sci."},{"key":"e_1_3_2_32_2","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1016\/j.ins.2022.11.139","article-title":"K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data","volume":"622","author":"Ikotun Abiodun M.","year":"2023","unstructured":"Abiodun M. Ikotun, Absalom E. Ezugwu, Laith Abualigah, Belal Abuhaija, and Jia Heming. 2023. K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Inf. Sci. 622 (2023), 178\u2013210.","journal-title":"Inf. Sci."},{"issue":"5","key":"e_1_3_2_33_2","first-page":"5244","article-title":"Semi-supervised clustering under a \u201ccompact-cluster\u201d assumption","volume":"35","author":"Jiang Zhen","year":"2022","unstructured":"Zhen Jiang, Yongzhao Zhan, Qirong Mao, and Yang Du. 2022. Semi-supervised clustering under a \u201ccompact-cluster\u201d assumption. IEEE Trans. Knowl. Data Eng. 35, 5 (2022), 5244\u20135256.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_2_34_2","first-page":"67","volume-title":"Proceedings of the IEEE\/ACM 41st ENRE: A Tool Framework for Extensible eNtity Relation Extraction","author":"Jin Wuxia","year":"2019","unstructured":"Wuxia Jin, Yuanfang Cai, Rick Kazman, Qinghua Zheng, Di Cui, and Ting Liu. 2019. ENRE: A tool framework for extensible eNtity relation extraction. In Proceedings of the IEEE\/ACM 41st ENRE: A Tool Framework for Extensible eNtity Relation Extraction. IEEE, 67\u201370."},{"key":"e_1_3_2_35_2","first-page":"222","volume-title":"Proceedings of the 10th IEEE International Workshop on Future Trends of Distributed Computing Systems","author":"Kang Dazhou","year":"2004","unstructured":"Dazhou Kang, Baowen Xu, Jianjiang Lu, and W. C. Chu. 2004. A complexity measure for ontology based on UML. In Proceedings of the 10th IEEE International Workshop on Future Trends of Distributed Computing Systems. IEEE, 222\u2013228."},{"issue":"4","key":"e_1_3_2_36_2","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1049\/iet-sen.2019.0138","article-title":"New internal metric for software clustering algorithms validity","volume":"14","author":"Kargar M.","year":"2020","unstructured":"M. Kargar, A. Isazadeh, and H. Izadkhah. 2020. New internal metric for software clustering algorithms validity. IET Softw. 14, 4 (2020), 402\u2013410.","journal-title":"IET Softw."},{"key":"e_1_3_2_37_2","first-page":"307","volume-title":"Proceedings of the 19th International Conference on Machine Learning","author":"Klein Dan","year":"2002","unstructured":"Dan Klein, Sepandar D. Kamvar, and Christopher D. Manning. 2002. From instance-level constraints to space-level constraints: Making the most of prior knowledge in data clustering. In Proceedings of the 19th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc., 307\u2013314."},{"key":"e_1_3_2_38_2","first-page":"462","volume-title":"Proceedings of the 28th International Conference on Software Maintenance","author":"Kobayashi Kenichi","year":"2012","unstructured":"Kenichi Kobayashi, Manabu Kamimura, Koki Kato, Keisuke Yano, and Akihiko Matsuo. 2012. Feature-gathering dependency-based software clustering using Dedication and Modularity. In Proceedings of the 28th International Conference on Software Maintenance. IEEE, 462\u2013471."},{"issue":"6","key":"e_1_3_2_39_2","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1109\/52.469759","article-title":"The  \\(4+1\\)  View Model of architecture","volume":"12","author":"Kruchten P. B.","year":"1995","unstructured":"P. B. Kruchten. 1995. The \\(4+1\\) View Model of architecture. IEEE Softw. 12, 6 (1995), 42\u201350.","journal-title":"IEEE Softw."},{"issue":"6","key":"e_1_3_2_40_2","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/MS.2012.167","article-title":"Technical debt: From metaphor to theory and practice","volume":"29","author":"Kruchten Philippe","year":"2012","unstructured":"Philippe Kruchten, Robert L. Nord, and Ipek Ozkaya. 2012. Technical debt: From metaphor to theory and practice. IEEE Softw. 29, 6 (2012), 18\u201321.","journal-title":"IEEE Softw."},{"issue":"1","key":"e_1_3_2_41_2","doi-asserted-by":"crossref","first-page":"340","DOI":"10.1109\/TKDE.2020.2982387","article-title":"Label propagated nonnegative matrix factorization for clustering","volume":"34","author":"Lan Long","year":"2022","unstructured":"Long Lan, Tongliang Liu, Xiang Zhang, Chuanfu Xu, and Zhigang Luo. 2022. Label propagated nonnegative matrix factorization for clustering. IEEE Trans. Knowl. Data Eng. 34, 1 (2022), 340\u2013351.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_2_42_2","first-page":"1060","volume-title":"Proc. IEEE","volume":"68","author":"Lehman Meir M.","year":"1980","unstructured":"Meir M. Lehman. 1980. Programs, life cycles, and laws of software evolution. Proc. IEEE 68, 9 (1980), 1060\u20131076."},{"issue":"4","key":"e_1_3_2_43_2","doi-asserted-by":"crossref","first-page":"1980","DOI":"10.1109\/TKDE.2020.2997938","article-title":"SCHAIN-IRAM: An efficient and effective semi-supervised clustering algorithm for attributed heterogeneous information networks","volume":"34","author":"Li Xiang","year":"2022","unstructured":"Xiang Li, Yao Wu, Martin Ester, Ben Kao, Xin Wang, and Yudian Zheng. 2022. SCHAIN-IRAM: An efficient and effective semi-supervised clustering algorithm for attributed heterogeneous information networks. IEEE Trans. Knowl. Data Eng. 34, 4 (2022), 1980\u20131992.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_2_44_2","first-page":"1809","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Li Yeqing","year":"2016","unstructured":"Yeqing Li, Junzhou Huang, and Wei Liu. 2016. Scalable sequential spectral clustering. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI, 1809\u20131815."},{"issue":"2","key":"e_1_3_2_45_2","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1109\/TSE.2017.2671865","article-title":"Measuring the impact of code dependencies on software architecture recovery techniques","volume":"44","author":"Lutellier Thibaud","year":"2018","unstructured":"Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovi\u0107, and Robert Kroeger. 2018. Measuring the impact of code dependencies on software architecture recovery techniques. IEEE Trans. Softw. Eng. 44, 2 (2018), 159\u2013181.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_46_2","first-page":"315","volume-title":"Proceedings of the International Conference on Software Maintenance","author":"Mahdavi K.","year":"2003","unstructured":"K. Mahdavi, M. Harman, and R. M. Hierons. 2003. A multiple hill climbing approach to software module clustering. In Proceedings of the International Conference on Software Maintenance. IEEE, 315\u2013324."},{"issue":"4","key":"e_1_3_2_47_2","doi-asserted-by":"crossref","first-page":"1695","DOI":"10.1007\/s10664-016-9473-1","article-title":"Semantic topic models for source code analysis","volume":"22","author":"Mahmoud A.","year":"2017","unstructured":"A. Mahmoud and G. Bradshaw. 2017. Semantic topic models for source code analysis. Empirical Softw. Eng. 22, 4 (2017), 1695\u20132000.","journal-title":"Empirical Softw. Eng."},{"key":"e_1_3_2_48_2","first-page":"45","volume-title":"Proceedings of the 6th International Workshop on Program Comprehension","author":"Mancoridis S.","year":"1998","unstructured":"S. Mancoridis, B. S. Mitchell, C. Rorres, Y. Chen, and E. R. Gansner. 1998. Using automatic clustering to produce high-level system organizations of source code. In Proceedings of the 6th International Workshop on Program Comprehension. IEEE, 45\u201352."},{"key":"e_1_3_2_49_2","first-page":"11","article-title":"Hierarchical clustering for software architecture recovery","volume":"33","author":"Maqbool Onaiza","year":"2007","unstructured":"Onaiza Maqbool and Haroon Babri. 2007. Hierarchical clustering for software architecture recovery. IEEE Trans. Softw. Eng. 33, 11 (2007), 759\u2013780.","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"2","key":"e_1_3_2_50_2","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1109\/TSE.2007.70768","article-title":"Using the conceptual cohesion of classes for fault prediction in object-oriented systems","volume":"34","author":"Marcus Andrian","year":"2008","unstructured":"Andrian Marcus, Denys Poshyvanyk, and Rudolf Ferenc. 2008. Using the conceptual cohesion of classes for fault prediction in object-oriented systems. IEEE Trans. Softw. Eng. 34, 2 (2008), 287\u2013300.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_51_2","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1016\/j.infsof.2015.07.005","article-title":"Investigating architectural technical debt accumulation and refactoring over time: A multiple-case study","volume":"67","author":"Martini Antonio","year":"2015","unstructured":"Antonio Martini, Jan Bosch, and Michel Chaudron. 2015. Investigating architectural technical debt accumulation and refactoring over time: A multiple-case study. Inf. Softw. Technol. 67 (2015), 237\u2013253.","journal-title":"Inf. Softw. Technol."},{"issue":"3","key":"e_1_3_2_52_2","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1109\/TSE.2006.31","article-title":"On the automatic modularization of software systems using the Bunch tool","volume":"32","author":"Mitchell B. S.","year":"2006","unstructured":"B. S. Mitchell and S. Mancoridis. 2006. On the automatic modularization of software systems using the Bunch tool. IEEE Trans. Softw. Eng. 32, 3 (2006), 193\u2013208.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_53_2","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.infsof.2018.09.001","article-title":"A new algorithm for software clustering considering the knowledge of dependency between artifacts in the source code","volume":"105","author":"Mohammadi Sina","year":"2019","unstructured":"Sina Mohammadi and Habib Izadkhah. 2019. A new algorithm for software clustering considering the knowledge of dependency between artifacts in the source code. Inf. Softw. Technol. 105 (2019), 252\u2013256.","journal-title":"Inf. Softw. Technol."},{"issue":"2","key":"e_1_3_2_54_2","first-page":"1","article-title":"Finding and evaluating community structure in networks","volume":"69","author":"Newman M. E. J.","year":"2003","unstructured":"M. E. J. Newman and M. Girvan. 2003. Finding and evaluating community structure in networks. Phys. Rev. E 69, 2 (2003), 1\u201316.","journal-title":"Phys. Rev. E"},{"key":"e_1_3_2_55_2","doi-asserted-by":"crossref","first-page":"119203","DOI":"10.1016\/j.ins.2023.119203","article-title":"Semi-supervised classifier ensemble model for high-dimensional data","volume":"643","author":"Niu Xufeng","year":"2023","unstructured":"Xufeng Niu and Wenping Ma. 2023. Semi-supervised classifier ensemble model for high-dimensional data. Inf. Sci. 643 (2023), 119203.","journal-title":"Inf. Sci."},{"issue":"3","key":"e_1_3_2_56_2","first-page":"1","article-title":"Breaking bad? Semantic versioning and impact of breaking changes in Maven Central","volume":"27","author":"Ochoa Lina","year":"2022","unstructured":"Lina Ochoa, Thomas Degueule, Jean-R\u00e9my Falleri, and Jurgen Vinju. 2022. Breaking bad? Semantic versioning and impact of breaking changes in Maven Central. Empirical Softw. Eng. 27, 3 (2022), 1\u201342.","journal-title":"Empirical Softw. Eng."},{"issue":"4","key":"e_1_3_2_57_2","doi-asserted-by":"crossref","first-page":"106275","DOI":"10.1016\/j.infsof.2020.106275","article-title":"A survey on the practical use of UML for different software architecture viewpoints","volume":"121","author":"Ozkaya M.","year":"2020","unstructured":"M. Ozkaya and F. Erata. 2020. A survey on the practical use of UML for different software architecture viewpoints. Inf. Softw. Technol. 121, 4 (2020), 106275.","journal-title":"Inf. Softw. Technol."},{"issue":"3","key":"e_1_3_2_58_2","doi-asserted-by":"crossref","first-page":"394","DOI":"10.1109\/TEVC.2017.2691281","article-title":"An empirical study of cohesion and coupling: Balancing optimization and disruption","volume":"22","author":"Paixao M.","year":"2017","unstructured":"M. Paixao, M. Harman, Y. Zhang, and Y. Yu. 2017. An empirical study of cohesion and coupling: Balancing optimization and disruption. IEEE Trans. Evol. Comput. 22, 3 (2017), 394\u2013414.","journal-title":"IEEE Trans. Evol. Comput."},{"issue":"3","key":"e_1_3_2_59_2","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1109\/91.413225","article-title":"On cluster validity for the fuzzy c-means model","volume":"3","author":"Pal Nikhil R.","year":"1995","unstructured":"Nikhil R. Pal and James C. Bezdek. 1995. On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Syst. 3, 3 (1995), 370\u2013379.","journal-title":"IEEE Trans. Fuzzy Syst."},{"issue":"10","key":"e_1_3_2_60_2","doi-asserted-by":"crossref","first-page":"2272","DOI":"10.1109\/TSE.2019.2946357","article-title":"ElementRank: Ranking java software classes and packages using a multilayer complex network-based approach","volume":"47","author":"Pan Weifeng","year":"2021","unstructured":"Weifeng Pan, Hua Ming, Carl Chang, Zijiang Yang, and Dae-Kyoo Kim. 2021. ElementRank: Ranking java software classes and packages using a multilayer complex network-based approach. IEEE Trans. Softw. Eng. 47, 10 (2021), 2272\u20132295.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_61_2","first-page":"27","volume-title":"Proceedings of the 13th Eighth European Conference on Software Maintenance and Reengineering","author":"Patel Chiragkumar","year":"2009","unstructured":"Chiragkumar Patel, Abdelwahab Hamou-Lhadj, and Juergen Rilling. 2009. Software clustering using dynamic analysis and static dependencies. In Proceedings of the 13th Eighth European Conference on Software Maintenance and Reengineering. IEEE, 27\u201336."},{"key":"e_1_3_2_62_2","doi-asserted-by":"crossref","first-page":"106469","DOI":"10.1016\/j.infsof.2020.106469","article-title":"A graph-based clustering algorithm for software systems modularization","volume":"133","author":"Pourasghar Babak","year":"2021","unstructured":"Babak Pourasghar, Habib Izadkhah, Ayaz Isazadeh, and Shahriar Lotfi. 2021. A graph-based clustering algorithm for software systems modularization. Inf. Softw. Technol. 133 (2021), 106469.","journal-title":"Inf. Softw. Technol."},{"issue":"2","key":"e_1_3_2_63_2","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1109\/TSE.2010.26","article-title":"Software module clustering as a multi-objective search problem","volume":"37","author":"Praditwong Kata","year":"2011","unstructured":"Kata Praditwong, Mark Harman, and Xin Yao. 2011. Software module clustering as a multi-objective search problem. IEEE Trans. Softw. Eng. 37, 2 (2011), 264\u2013282.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_64_2","volume-title":"The Analytic Hierarchy Process: Planning, Priority Setting, Resources Allocation","author":"Saaty Thomas L.","year":"1980","unstructured":"Thomas L. Saaty. 1980. The Analytic Hierarchy Process: Planning, Priority Setting, Resources Allocation. McGraw-Hill, New York."},{"key":"e_1_3_2_65_2","first-page":"1","volume-title":"Proceedings of the 15th Turkish National Software Engineering Symposium","author":"Saydemir Abdullah","year":"2021","unstructured":"Abdullah Saydemir, Muhammed E. Simitcioglu, and Hasan Sozer. 2021. On the use of evolutionary coupling for software architecture recovery. In Proceedings of the 15th Turkish National Software Engineering Symposium. IEEE, 1\u20136."},{"key":"e_1_3_2_66_2","doi-asserted-by":"crossref","first-page":"111162","DOI":"10.1016\/j.jss.2021.111162","article-title":"E-SC4R: Explaining software clustering for remodularisation","volume":"186","author":"Tan Alvin J. J.","year":"2022","unstructured":"Alvin J. J. Tan, Chun Y. Chong, and Aldeida Aleti. 2022. E-SC4R: Explaining software clustering for remodularisation. J. Syst. Softw. 186 (2022), 111162.","journal-title":"J. Syst. Softw."},{"key":"e_1_3_2_67_2","first-page":"326","volume-title":"Proceedings of the 45th International Conference on Software Engineering: Companion","author":"Tan Alvin J. J.","year":"2023","unstructured":"Alvin J. J. Tan, Chun Y. Chong, and Aldeida Aleti. 2023. Closing the loop for software remodularisation - REARRANGE: An effort estimation approach for software clustering-based remodularisation. In Proceedings of the 45th International Conference on Software Engineering: Companion. IEEE, 326\u2013327."},{"key":"e_1_3_2_68_2","doi-asserted-by":"crossref","first-page":"957","DOI":"10.1109\/TIP.2024.3354106","article-title":"Compound weakly supervised clustering","volume":"33","author":"Tao Hong","year":"2024","unstructured":"Hong Tao, Jiacheng Jiang, Chenping Hou, Tingjin Luo, Ruidong Fan, and Jing Zhang. 2024. Compound weakly supervised clustering. IEEE Trans. Image Process. 33 (2024), 957\u2013971.","journal-title":"IEEE Trans. Image Process"},{"issue":"4","key":"e_1_3_2_69_2","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1109\/TSE.2020.3022212","article-title":"A fast clustering algorithm for modularization of large-scale software systems","volume":"48","author":"Teymourian Navid","year":"2022","unstructured":"Navid Teymourian, Habib Izadkhah, and Ayaz Isazadeh. 2022. A fast clustering algorithm for modularization of large-scale software systems. IEEE Trans. Softw. Eng. 48, 4 (2022), 1451\u20131462.","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"3","key":"e_1_3_2_70_2","doi-asserted-by":"crossref","first-page":"930","DOI":"10.1109\/TSE.2020.3007722","article-title":"RefactoringMiner 2.0","volume":"48","author":"Tsantalis Nikolaos","year":"2022","unstructured":"Nikolaos Tsantalis, Ameya Ketkar, and Danny Dig. 2022. RefactoringMiner 2.0. IEEE Trans. Softw. Eng. 48, 3 (2022), 930\u2013950.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_71_2","first-page":"577","volume-title":"Proceedings of the 18th International Conference on Machine Learning","author":"Wagstaff Kiri","year":"2001","unstructured":"Kiri Wagstaff, Claire Cardie, Seth Rogers, and Stefan Schr\u00f6dl. 2001. Constrained K-Means Clustering with Background Knowledge. In Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc., 577\u2013584."},{"issue":"3","key":"e_1_3_2_72_2","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1109\/TSE.2017.2679752","article-title":"Automatic software refactoring via weighted clustering in method-level networks","volume":"44","author":"Wang Ying","year":"2018","unstructured":"Ying Wang, Hai Yu, Zhiliang Zhu, Wei Zhang, and Yuli Zhao. 2018. Automatic software refactoring via weighted clustering in method-level networks. IEEE Trans. Softw. Eng. 44, 3 (2018), 202\u2013236.","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"6","key":"e_1_3_2_73_2","doi-asserted-by":"crossref","first-page":"1336","DOI":"10.1109\/TKDE.2012.51","article-title":"Nonnegative matrix factorization: A comprehensive review","volume":"25","author":"Wang Yu-Xiong","year":"2012","unstructured":"Yu-Xiong Wang and Yu-Jin Zhang. 2012. Nonnegative matrix factorization: A comprehensive review. IEEE Trans. Knowl. Data Eng. 25, 6 (2012), 1336\u20131353.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_2_74_2","first-page":"227","volume-title":"Proceedings of the 12th IEEE International Workshop on Program Comprehension","author":"Wen Zhihua","year":"2003","unstructured":"Zhihua Wen and Vassilios Tzerpos. 2003. An optimal algorithm for MoJo distance. In Proceedings of the 12th IEEE International Workshop on Program Comprehension. IEEE, 227\u2013235."},{"key":"e_1_3_2_75_2","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1145\/1101908.1101919","volume-title":"Proceedings of the IEEE\/ACM 20th International Conference on Automated Software Engineering","author":"Xing Zhenchang","year":"2005","unstructured":"Zhenchang Xing and Eleni Stroulia. 2005. UMLDiff: An algorithm for object-oriented design differencing. In Proceedings of the IEEE\/ACM 20th International Conference on Automated Software Engineering. ACM, 54\u201365."},{"issue":"1","key":"e_1_3_2_76_2","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/TKDE.2013.22","article-title":"Active learning of constraints for semi-supervised clustering","volume":"26","author":"Xiong Sicheng","year":"2014","unstructured":"Sicheng Xiong, Javad Azimi, and Xiaoli Z. Fern. 2014. Active learning of constraints for semi-supervised clustering. IEEE Trans. Knowl. Data Eng. 26, 1 (2014), 43\u201354.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_2_77_2","doi-asserted-by":"crossref","first-page":"111349","DOI":"10.1016\/j.jss.2022.111349","article-title":"An efficient heuristic algorithm for software module clustering optimization","volume":"190","author":"Yuste Javier","year":"2022","unstructured":"Javier Yuste, Abraham Duarte, and Eduardo G. Pardo. 2022. An efficient heuristic algorithm for software module clustering optimization. J. Syst. Softw. 190 (2022), 111349.","journal-title":"J. Syst. Softw."},{"key":"e_1_3_2_78_2","first-page":"304","volume-title":"Proceedings of the 7th IEEE International Conference on Software Engineering and Service Science","author":"Zhong Linhui","year":"2016","unstructured":"Linhui Zhong, Liangbo Xue, Nengwei Zhang, Jing Xia, and Jun Chen. 2016. A tool to support software clustering using the software evolution information. In Proceedings of the 7th IEEE International Conference on Software Engineering and Service Science. IEEE, 304\u2013307."},{"key":"e_1_3_2_79_2","first-page":"12","article-title":"User review-based change file localization for mobile applications","volume":"47","author":"Zhou Yu","year":"2021","unstructured":"Yu Zhou, Yanqi Su, Taolue Chen, Zhiqiu Huang, Harald C. Gall, and Sebastiano Panichella. 2021. User review-based change file localization for mobile applications. IEEE Trans. Softw. Eng. 47, 12 (2021), 2755\u20132770.","journal-title":"IEEE Trans. Softw. Eng."}],"container-title":["ACM Transactions on Software Engineering and Methodology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676960","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3676960","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:19:13Z","timestamp":1750295953000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676960"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,23]]},"references-count":78,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,11,30]]}},"alternative-id":["10.1145\/3676960"],"URL":"https:\/\/doi.org\/10.1145\/3676960","relation":{},"ISSN":["1049-331X","1557-7392"],"issn-type":[{"value":"1049-331X","type":"print"},{"value":"1557-7392","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,23]]},"assertion":[{"value":"2024-01-03","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-20","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-11-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}