{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T05:43:00Z","timestamp":1775540580166,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":72,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,9,19]],"date-time":"2022-09-19T00:00:00Z","timestamp":1663545600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,9,19]]},"DOI":"10.1145\/3544902.3546243","type":"proceedings-article","created":{"date-parts":[[2022,9,7]],"date-time":"2022-09-07T04:07:45Z","timestamp":1662523665000},"page":"125-136","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Identifying Source Code File Experts"],"prefix":"10.1145","author":[{"given":"Ot\u00e1vio","family":"Cury","sequence":"first","affiliation":[{"name":"Federal University of Piaui, Brazil"}]},{"given":"Guilherme","family":"Avelino","sequence":"additional","affiliation":[{"name":"Federal University of Piaui, Brazil"}]},{"given":"Pedro","family":"Santos Neto","sequence":"additional","affiliation":[{"name":"Federal University of Piaui, Brazil"}]},{"given":"Ricardo","family":"Britto","sequence":"additional","affiliation":[{"name":"Blekinge Institute of Technology, Sweden"}]},{"given":"Marco","family":"T\u00falio Valente","sequence":"additional","affiliation":[{"name":"Federal University of Minas Gerais, Brazil"}]}],"member":"320","published-online":{"date-parts":[[2022,9,19]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1370750.1370780"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1134285.1134336"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2007.7"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPC.2016.7503718"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-57735-7_15"},{"key":"e_1_3_2_1_6_1","volume-title":"Measuring and analyzing code authorship in 1 + 118 open source projects. Science of Computer Programming 176 (may","author":"Avelino Guilherme","year":"2019","unstructured":"Guilherme Avelino , Leonardo Passos , Andre Hora , and Marco\u00a0Tulio Valente . 2019. Measuring and analyzing code authorship in 1 + 118 open source projects. Science of Computer Programming 176 (may 2019 ), 14\u201332. Guilherme Avelino, Leonardo Passos, Andre Hora, and Marco\u00a0Tulio Valente. 2019. Measuring and analyzing code authorship in 1 + 118 open source projects. Science of Computer Programming 176 (may 2019), 14\u201332."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/MS.2018.185140155"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-12-809633-8.20349-X"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2025113.2025119"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2018.09.016"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2007.14"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393596.2393647"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3196321.3196345"},{"key":"e_1_3_2_1_14_1","unstructured":"Marc Claesen and Bart De\u00a0Moor. 2015. Hyperparameter search in machine learning. arXiv preprint arXiv:1502.02127(2015).  Marc Claesen and Bart De\u00a0Moor. 2015. Hyperparameter search in machine learning. arXiv preprint arXiv:1502.02127(2015)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/SEAA.2016.18"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2950290.2950339"},{"key":"e_1_3_2_1_17_1","volume-title":"Recommending Participants for Collaborative Merge Sessions","author":"Souza Costa Catarina","year":"2019","unstructured":"Catarina de\u00a0 Souza Costa , Jose\u00a0Jair Figueiredo , Joao\u00a0Felipe Pimentel , Anita Sarma , and Leonardo Gresta\u00a0Paulino Murta . 2019. Recommending Participants for Collaborative Merge Sessions . IEEE Transactions on Software Engineering( 2019 ), 1\u20131. https:\/\/doi.org\/10.1109\/TSE.2019.2917191 10.1109\/TSE.2019.2917191 Catarina de\u00a0Souza Costa, Jose\u00a0Jair Figueiredo, Joao\u00a0Felipe Pimentel, Anita Sarma, and Leonardo Gresta\u00a0Paulino Murta. 2019. Recommending Participants for Collaborative Merge Sessions. IEEE Transactions on Software Engineering(2019), 1\u20131. https:\/\/doi.org\/10.1109\/TSE.2019.2917191"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/SANER.2015.7081851"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3275245.3275250"},{"key":"e_1_3_2_1_20_1","unstructured":"Hermann Ebbinghaus. 1885. \u00dcber das ged\u00e4chtnis: untersuchungen zur experimentellen psychologie. Duncker & Humblot.  Hermann Ebbinghaus. 1885. \u00dcber das ged\u00e4chtnis: untersuchungen zur experimentellen psychologie. Duncker & Humblot."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPC.2017.35"},{"key":"e_1_3_2_1_22_1","unstructured":"Jerome\u00a0H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics(2001) 1189\u20131232.  Jerome\u00a0H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics(2001) 1189\u20131232."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1287624.1287673"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2512207"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/IWPSE.2005.21"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2970276.2970306"},{"key":"e_1_3_2_1_27_1","volume-title":"The elements of statistical learning: data mining, inference, and prediction","author":"Hastie Trevor","unstructured":"Trevor Hastie , Robert Tibshirani , and Jerome Friedman . 2009. The elements of statistical learning: data mining, inference, and prediction . Springer Science & Business Media . Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. The elements of statistical learning: data mining, inference, and prediction. Springer Science & Business Media."},{"key":"e_1_3_2_1_28_1","volume-title":"Mining the history of synchronous changes to refine code ownership. In 2009 6th ieee international working conference on mining software repositories","author":"Hattori Lile","unstructured":"Lile Hattori and Michele Lanza . 2009. Mining the history of synchronous changes to refine code ownership. In 2009 6th ieee international working conference on mining software repositories . IEEE , 141\u2013150. Lile Hattori and Michele Lanza. 2009. Mining the history of synchronous changes to refine code ownership. In 2009 6th ieee international working conference on mining software repositories. IEEE, 141\u2013150."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1810295.1810339"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/302405.302455"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2970276.2970358"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1002\/0471722146"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597008.2597147"},{"key":"e_1_3_2_1_34_1","volume-title":"Simp\u00f3sio Brasileiro de Qualidade de Software - SBQS","author":"Ibiapina Irvayne","year":"2017","unstructured":"Irvayne M.\u00a0S. Ibiapina , F.\u00a0V.\u00a0 M. Alves , Werney A.\u00a0L. Lira , Gleison\u00a0 A. Silva , and Pedro A . \u00a0S. Neto. 2017. Infer\u00eancia da Familiaridade de C\u00f3digo por Meio da Minera\u00e7\u00e3o de Reposit\u00f3rios de Software . Simp\u00f3sio Brasileiro de Qualidade de Software - SBQS ( 2017 ). Irvayne M.\u00a0S. Ibiapina, F.\u00a0V.\u00a0M. Alves, Werney A.\u00a0L. Lira, Gleison\u00a0A. Silva, and Pedro A.\u00a0S. Neto. 2017. Infer\u00eancia da Familiaridade de C\u00f3digo por Meio da Minera\u00e7\u00e3o de Reposit\u00f3rios de Software. Simp\u00f3sio Brasileiro de Qualidade de Software - SBQS (2017)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"crossref","unstructured":"Elgun Jabrayilzade Mikhail Evtikhiev Eray T\u00fcz\u00fcn and Vladimir Kovalenko. 2022. Bus Factor In Practice. arXiv preprint arXiv:2202.01523(2022).  Elgun Jabrayilzade Mikhail Evtikhiev Eray T\u00fcz\u00fcn and Vladimir Kovalenko. 2022. Bus Factor In Practice. arXiv preprint arXiv:2202.01523(2022).","DOI":"10.1109\/ICSE-SEIP55303.2022.9793985"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/3138884.3139034"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1002\/smr.530"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSM.2008.4658064"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPC.2009.5090056"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3180155.3180215"},{"key":"e_1_3_2_1_42_1","volume-title":"Matthew Wiener","author":"Liaw Andy","year":"2002","unstructured":"Andy Liaw , Matthew Wiener , 2002 . Classification and regression by randomForest. R news 2, 3 (2002), 18\u201322. Andy Liaw, Matthew Wiener, 2002. Classification and regression by randomForest. R news 2, 3 (2002), 18\u201322."},{"key":"e_1_3_2_1_43_1","volume-title":"Guidelines for conducting surveys in software engineering v. 1.1","author":"Linaker Johan","year":"2015","unstructured":"Johan Linaker , Sardar\u00a0Muhammad Sulaman , Martin H\u00f6st , and Rafael\u00a0Maiani de Mello . 2015. Guidelines for conducting surveys in software engineering v. 1.1 . Lund University ( 2015 ). Johan Linaker, Sardar\u00a0Muhammad Sulaman, Martin H\u00f6st, and Rafael\u00a0Maiani de Mello. 2015. Guidelines for conducting surveys in software engineering v. 1.1. Lund University (2015)."},{"key":"e_1_3_2_1_44_1","unstructured":"Werney Ayala\u00a0Luz Lira. 2016. Um m\u00e9todo para infer\u00eancia da familiaridade de c\u00f3digo em projetos de software. Master\u2019s thesis. Universidade Federal do Piau\u00ed Teresina.  Werney Ayala\u00a0Luz Lira. 2016. Um m\u00e9todo para infer\u00eancia da familiaridade de c\u00f3digo em projetos de software. Master\u2019s thesis. Universidade Federal do Piau\u00ed Teresina."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133909"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/358916.358994"},{"key":"e_1_3_2_1_47_1","first-page":"397","article-title":"The learning-curve sampling method applied to model-based clustering","author":"Meek Christopher","year":"2002","unstructured":"Christopher Meek , Bo Thiesson , and David Heckerman . 2002 . The learning-curve sampling method applied to model-based clustering . Journal of Machine Learning Research 2 , Feb (2002), 397 \u2013 418 . Christopher Meek, Bo Thiesson, and David Heckerman. 2002. The learning-curve sampling method applied to model-based clustering. Journal of Machine Learning Research 2, Feb (2002), 397\u2013418.","journal-title":"Journal of Machine Learning Research 2"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2007.27"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2002.1007994"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2019.00054"},{"key":"e_1_3_2_1_51_1","volume-title":"A guided tour to approximate string matching. ACM computing surveys (CSUR) 33, 1","author":"Navarro Gonzalo","year":"2001","unstructured":"Gonzalo Navarro . 2001. A guided tour to approximate string matching. ACM computing surveys (CSUR) 33, 1 ( 2001 ), 31\u201388. Gonzalo Navarro. 2001. A guided tour to approximate string matching. ACM computing surveys (CSUR) 33, 1 (2001), 31\u201388."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-018-9622-9"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3364641.3364648"},{"key":"e_1_3_2_1_54_1","volume-title":"Biostatistics primer: part 2. Nutrition in clinical practice 23, 1","author":"Overholser R","year":"2008","unstructured":"Brian\u00a0 R Overholser and Kevin\u00a0 M Sowinski . 2008. Biostatistics primer: part 2. Nutrition in clinical practice 23, 1 ( 2008 ), 76\u201384. Brian\u00a0R Overholser and Kevin\u00a0M Sowinski. 2008. Biostatistics primer: part 2. Nutrition in clinical practice 23, 1 (2008), 76\u201384."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597073.2597113"},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.4249\/scholarpedia.1883"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/1985793.1985860"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2593882.2593893"},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-020-09875-y"},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/2635868.2635922"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3186411.3186418"},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"crossref","unstructured":"Ali Sajedi-Badashian and Eleni Stroulia. 2020. Guidelines for evaluating bug-assignment research. Journal of Software: Evolution and Process(2020) e2250.  Ali Sajedi-Badashian and Eleni Stroulia. 2020. Guidelines for evaluating bug-assignment research. Journal of Software: Evolution and Process(2020) e2250.","DOI":"10.1002\/smr.2250"},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1213\/ANE.0000000000002864"},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/3345629.3345637"},{"key":"e_1_3_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2020.106455"},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2017.09.021"},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2593702.2593705"},{"key":"e_1_3_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884852"},{"key":"e_1_3_2_1_69_1","unstructured":"Gary\u00a0M Weiss and Foster Provost. 2001. The effect of class distribution on classifier learning: an empirical study. (2001).  Gary\u00a0M Weiss and Foster Provost. 2001. The effect of class distribution on classifier learning: an empirical study. (2001)."},{"key":"e_1_3_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-4625-2"},{"key":"e_1_3_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/2804360.2804366"},{"key":"e_1_3_2_1_73_1","volume-title":"Proceedings of the 20th international conference on machine learning (ICML-03)","author":"Yu Lei","year":"2003","unstructured":"Lei Yu and Huan Liu . 2003 . Feature selection for high-dimensional data: A fast correlation-based filter solution . In Proceedings of the 20th international conference on machine learning (ICML-03) . 856\u2013863. Lei Yu and Huan Liu. 2003. Feature selection for high-dimensional data: A fast correlation-based filter solution. In Proceedings of the 20th international conference on machine learning (ICML-03). 856\u2013863."},{"key":"e_1_3_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.08.191"}],"event":{"name":"ESEM '22: ACM \/ IEEE International Symposium on Empirical Software Engineering and Measurement","location":"Helsinki Finland","acronym":"ESEM '22","sponsor":["SIGSOFT ACM Special Interest Group on Software Engineering"]},"container-title":["Proceedings of the 16th ACM \/ IEEE International Symposium on Empirical Software Engineering and Measurement"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3544902.3546243","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3544902.3546243","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:01Z","timestamp":1750186801000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3544902.3546243"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,19]]},"references-count":72,"alternative-id":["10.1145\/3544902.3546243","10.1145\/3544902"],"URL":"https:\/\/doi.org\/10.1145\/3544902.3546243","relation":{},"subject":[],"published":{"date-parts":[[2022,9,19]]},"assertion":[{"value":"2022-09-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}