{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T17:14:30Z","timestamp":1772558070051,"version":"3.50.1"},"reference-count":26,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T00:00:00Z","timestamp":1772496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100010801","name":"Xunta de Galicia","doi-asserted-by":"publisher","award":["GPC-ED431B 2024\/26"],"award-info":[{"award-number":["GPC-ED431B 2024\/26"]}],"id":[{"id":"10.13039\/501100010801","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Scientific collaboration is increasingly needed to address complex research challenges, yet identifying promising partners in the absence of prior co-authorship remains difficult. We present a decision-support pipeline for discovering researchers who have not previously worked together and whose collaboration is unlikely to emerge without deliberate intervention or institutional incentives. The approach leverages document-level semantic representations to estimate proximity between publications, aggregates these similarities at the author level, and surfaces collaboration opportunities that are not evident from the co-authorship graph. To support interpretation by decision makers, a separate LLM module proposes potential joint research directions, which are subsequently annotated with multi-label fields of study. We evaluate the pipeline through an institutional case study, analyzing 7531 publications from 2009 to 2024 using retrospective, temporally shifted windows. While only a small fraction of suggested pairs materialized spontaneously in subsequent periods, the collaborations that do emerge exhibit strong semantic alignment with the computed recommendations (high cosine similarity) and substantial thematic overlap. These results indicate that semantic proximity can act as an early indicator of latent complementarity between researchers without prior ties, supporting intentional institutional mediation and complementing topology-driven approaches that predict links under passive evolution.<\/jats:p>","DOI":"10.3390\/info17030254","type":"journal-article","created":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T15:22:21Z","timestamp":1772551341000},"page":"254","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Unlikely Pairs: A Decision-Support Recommendation Pipeline for Discovering Semantically Plausible Research Collaborations"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6027-9854","authenticated-orcid":false,"given":"Jorge","family":"Gal\u00e1n-Mena","sequence":"first","affiliation":[{"name":"AtlanTTic Research Center for Telecommunication Technologies, University of Vigo, 36310 Vigo, Spain"},{"name":"Economics and Business Management Faculty, Pontificia Universidad Cat\u00f3lica del Ecuador, Quito 170143, Ecuador"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4802-607X","authenticated-orcid":false,"given":"Mart\u00edn","family":"L\u00f3pez-Nores","sequence":"additional","affiliation":[{"name":"AtlanTTic Research Center for Telecommunication Technologies, University of Vigo, 36310 Vigo, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Pulla-S\u00e1nchez","sequence":"additional","affiliation":[{"name":"GI-IATa, UNESCO Chair on Support Technologies for Educational Inclusion, Universidad Polit\u00e9cnica Salesiana, Cuenca 010105, Ecuador"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3327-2347","authenticated-orcid":false,"given":"Luis Fernando","family":"Guerrero-V\u00e1squez","sequence":"additional","affiliation":[{"name":"GI-IATa, UNESCO Chair on Support Technologies for Educational Inclusion, Universidad Polit\u00e9cnica Salesiana, Cuenca 010105, Ecuador"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3687-3220","authenticated-orcid":false,"given":"Juan Pablo","family":"Salgado-Guerrero","sequence":"additional","affiliation":[{"name":"Economics and Business Management Faculty, Pontificia Universidad Cat\u00f3lica del Ecuador, Quito 170143, Ecuador"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,3,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1079","DOI":"10.1002\/asi.20371","article-title":"Learning and knowledge networks in interdisciplinary collaborations","volume":"57","author":"Haythornthwaite","year":"2006","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Pedersen, D.B. (2015). Collaborative Knowledge: The future of the academy in the knowledge-based economy. On the Facilitation of the Academy, Brill.","DOI":"10.1007\/978-94-6209-974-6_5"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1019","DOI":"10.1002\/asi.20591","article-title":"The Link-Prediction Problem for Social Networks","volume":"58","author":"Kleinberg","year":"2007","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1016\/j.physa.2010.11.027","article-title":"Link prediction in complex networks: A survey","volume":"390","author":"Zhou","year":"2011","journal-title":"Phys. A Stat. Mech. Its Appl."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1007\/s11192-006-0170-5","article-title":"Collaboration uncovered: Exploring the adequacy of measuring university-industry collaboration through co-authorship and funding","volume":"69","author":"Lundberg","year":"2006","journal-title":"Scientometrics"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2524491","DOI":"10.1155\/2022\/2524491","article-title":"Analyzing interdisciplinary research using Co-authorship networks","volume":"2022","author":"Ullah","year":"2022","journal-title":"Complexity"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/s11192-009-0104-0","article-title":"Assessing public\u2013private research collaboration: Is it possible to compare university performance?","volume":"84","author":"Abramo","year":"2010","journal-title":"Scientometrics"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1093\/biohorizons\/hzp012","article-title":"Measuring interdisciplinary research: Analysis of co-authorship for research staff at the University of York","volume":"2","author":"Bellanca","year":"2009","journal-title":"Biosci. Horizons"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.procs.2017.03.005","article-title":"Capturing the collaboration intensity of research institutions using social network analysis","volume":"106","author":"Schlattmann","year":"2017","journal-title":"Procedia Comput. Sci."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1016\/j.patrec.2021.01.007","article-title":"Analyzing and visualizing scientific research collaboration network with core node evaluation and community detection based on network embedding","volume":"144","author":"Zhao","year":"2021","journal-title":"Pattern Recognit. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1016\/j.future.2017.12.038","article-title":"Modeling of cross-disciplinary collaboration for potential field discovery and recommendation based on scholarly big data","volume":"87","author":"Liang","year":"2018","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_12","first-page":"79","article-title":"Analysis on cross-regional scientific research collaboration model","volume":"45","author":"Ye","year":"2019","journal-title":"J. Libr. Sci. China"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Hoang, D.T., Tran, V.C., Nguyen, T.T., Nguyen, N.T., and Hwang, D. (2017). A consensus-based method to enhance a recommendation system for research collaboration. Proceedings of the Asian Conference on Intelligent Information and Database Systems, Springer.","DOI":"10.1007\/978-3-319-54472-4_17"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Nguyen, T.T., Nguyen, N.T., Hoang, D.T., and Tran, V.C. (2020). Predicting Research Collaboration Trends Based on the Similarity of Publications and Relationship of Scientists. Proceedings of the Asian Conference on Intelligent Information and Database Systems, Springer.","DOI":"10.1007\/978-3-030-41964-6_2"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ye, G., Wei, J., Tan, Q., Wu, C., Song, X., and Li, S. (2024). Academic collaboration recommendation based on graph neural network and multi-attribute embedding. J. Inf. Sci.","DOI":"10.1177\/01655515241287635"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhao, M., Zhang, X., Qin, H., Ma, X., Sun, H., and Sang, Y. (2023). Partner Recommendation Based on Scholar Embedding Method. Proceedings of the 2023 7th International Conference on Communication and Information Systems (ICCIS), IEEE.","DOI":"10.1109\/ICCIS59958.2023.10453669"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Guerra, J., Quan, W., Li, K., Ahumada, L., Winston, F., and Desai, B. (2018). Scosy: A biomedical collaboration recommendation system. Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE.","DOI":"10.1109\/EMBC.2018.8513268"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"116209","DOI":"10.1016\/j.eswa.2021.116209","article-title":"Semantic and explainable research-related recommendation system based on semi-supervised methodology using BERT and LDA models","volume":"190","author":"Yang","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wu, M., Zhang, Y., Lu, J., Lin, H., and Grosser, M. (2020). Recommending scientific collaborators: Bibliometric networks for medical research entities. Proceedings of the Developments of Artificial Intelligence Technologies in Computation and Robotics: Proceedings of the 14th International FLINS Conference (FLINS 2020), World Scientific.","DOI":"10.1142\/9789811223334_0058"},{"key":"ref_20","unstructured":"Priem, J., Piwowar, H., and Orr, R. (2022). OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Singh, A., D\u2019Arcy, M., Cohan, A., Downey, D., and Feldman, S. (2022). SciRepEval: A Multi-Format Benchmark for Scientific Document Representations. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics.","DOI":"10.18653\/v1\/2023.emnlp-main.338"},{"key":"ref_22","unstructured":"Kinney, R., Anastasiades, C., Authur, R., Beltagy, I., Bragg, J., Buraczynski, A., Cachola, I., Candra, S., Chandrasekhar, Y., and Cohan, A. (2025). The Semantic Scholar Open Data Platform. arXiv."},{"key":"ref_23","unstructured":"European Commission (2026, February 26). ERA Country Report 2023: Spain. European Research Area Platform. Available online: https:\/\/european-research-area.ec.europa.eu\/country-report-spain."},{"key":"ref_24","unstructured":"European Commission (2026, February 26). ERA Country Report 2024: Spain. European Research Area Platform. Available online: https:\/\/european-research-area.ec.europa.eu\/documents\/country-report-spain."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"7731","DOI":"10.1038\/s41598-020-64351-3","article-title":"Mixing Patterns in Interdisciplinary Co-Authorship Networks at Multiple Scales","volume":"10","author":"Feng","year":"2020","journal-title":"Sci. Rep."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"18585","DOI":"10.1038\/s41598-022-21821-0","article-title":"The Inverted U-Shaped Relationship between Knowledge Diversity of Researchers and Societal Impact","volume":"12","author":"Wang","year":"2022","journal-title":"Sci. Rep."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/3\/254\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T15:49:22Z","timestamp":1772552962000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/17\/3\/254"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,3]]},"references-count":26,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2026,3]]}},"alternative-id":["info17030254"],"URL":"https:\/\/doi.org\/10.3390\/info17030254","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,3]]}}}