{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,6]],"date-time":"2025-12-06T17:15:13Z","timestamp":1765041313354},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,11,15]],"date-time":"2022-11-15T00:00:00Z","timestamp":1668470400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,11,15]],"date-time":"2022-11-15T00:00:00Z","timestamp":1668470400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Umea University"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Netw Sci"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Correlation networks derived from multivariate data appear in many applications across the sciences. These networks are usually dense and require sparsification to detect meaningful structure. However, current methods for sparsifying correlation networks struggle with balancing overfitting and underfitting. We propose a module-based cross-validation procedure to threshold these networks, making modular structure an integral part of the thresholding. We illustrate our approach using synthetic and real data and find that its ability to recover a planted partition has a step-like dependence on the number of data samples. The reward for sampling more varies non-linearly with the number of samples, with minimal gains after a critical point. A comparison with the well-established WGCNA method shows that our approach allows for revealing more modular structure in the data used here.<\/jats:p>","DOI":"10.1007\/s41109-022-00516-5","type":"journal-article","created":{"date-parts":[[2022,11,15]],"date-time":"2022-11-15T16:30:37Z","timestamp":1668529837000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Cross-validation of correlation networks using modular structure"],"prefix":"10.1007","volume":"7","author":[{"given":"Magnus","family":"Neuman","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Viktor","family":"Jonsson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joaqu\u00edn","family":"Calatayud","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Martin","family":"Rosvall","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,11,15]]},"reference":[{"issue":"2","key":"516_CR1","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1038\/ismej.2011.119","volume":"6","author":"A Barber\u00e1n","year":"2012","unstructured":"Barber\u00e1n A, Bates ST, Casamayor EO, Fierer N (2012) Using network analysis to explore co-occurrence patterns in soil microbial communities. ISME J 6(2):343\u2013351. https:\/\/doi.org\/10.1038\/ismej.2011.119","journal-title":"ISME J"},{"issue":"3","key":"516_CR2","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1038\/nrn2575","volume":"10","author":"E Bullmore","year":"2009","unstructured":"Bullmore E, Sporns O (2009) Complex brain networks: graph theoretical analysis of structural and functional systems. Nat Rev Neurosci 10(3):186\u2013198. https:\/\/doi.org\/10.1038\/nrn2575","journal-title":"Nat Rev Neurosci"},{"key":"516_CR3","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.100.052308","volume":"100","author":"J Calatayud","year":"2019","unstructured":"Calatayud J, Bernardo-Madrid R, Neuman M, Rojas A, Rosvall M (2019) Exploring the solution landscape enables more reliable network community detection. Phys Rev E 100:052308. https:\/\/doi.org\/10.1103\/PhysRevE.100.052308","journal-title":"Phys Rev E"},{"issue":"1","key":"516_CR4","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1038\/s41559-019-1053-5","volume":"4","author":"J Calatayud","year":"2020","unstructured":"Calatayud J, Andivia E, Escudero A, Meli\u00e1n CJ, Bernardo-Madrid R, Stoffel M, Aponte C, Medina NG, Molina-Venegas R, Arnan X et al (2020) Positive associations among rare species and their persistence in ecological assemblages. Nat. Ecol Evol 4(1):40\u201345","journal-title":"Nat. Ecol Evol"},{"key":"516_CR5","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1016\/j.neuroimage.2019.02.039","volume":"194","author":"O Civier","year":"2019","unstructured":"Civier O, Smith RE, Yeh C-H, Connelly A, Calamante F (2019) Is removal of weak connections necessary for graph-theoretical analysis of dense weighted structural connectomes from diffusion mri? Neuroimage 194:68\u201381. https:\/\/doi.org\/10.1016\/j.neuroimage.2019.02.039","journal-title":"Neuroimage"},{"issue":"1","key":"516_CR6","doi-asserted-by":"publisher","first-page":"3033","DOI":"10.1038\/s41467-018-05516-7","volume":"9","author":"FT de Vries","year":"2018","unstructured":"de Vries FT, Griffiths RI, Bailey M, Craig H, Girlanda M, Gweon HS, Hallin S, Kaisermann A, Keith AM, Kretzschmar M, Lemanceau P, Lumini E, Mason KE, Oliver A, Ostle N, Prosser JI, Thion C, Thomson B, Bardgett RD (2018) Soil bacterial networks are less stable under drought than fungal networks. Nat Commun 9(1):3033. https:\/\/doi.org\/10.1038\/s41467-018-05516-7","journal-title":"Nat Commun"},{"key":"516_CR7","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.107.065701","volume":"107","author":"A Decelle","year":"2011","unstructured":"Decelle A, Krzakala F, Moore C, Zdeborov\u00e1 L (2011) Inference and phase transitions in the detection of modules in sparse networks. Phys Rev Lett 107:065701. https:\/\/doi.org\/10.1103\/PhysRevLett.107.065701","journal-title":"Phys Rev Lett"},{"key":"516_CR8","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.93.012304","volume":"93","author":"N Dianati","year":"2016","unstructured":"Dianati N (2016) Unwinding the hairball graph: Pruning algorithms for weighted complex networks. Phys Rev E 93:012304. https:\/\/doi.org\/10.1103\/PhysRevE.93.012304","journal-title":"Phys Rev E"},{"key":"516_CR9","doi-asserted-by":"publisher","DOI":"10.3390\/a10040112","author":"D Edler","year":"2017","unstructured":"Edler D, Bohlin L, Rosvall M (2017) Mapping higher-order network flows in memory and multilayer networks with infomap. Algorithms. https:\/\/doi.org\/10.3390\/a10040112","journal-title":"Algorithms"},{"issue":"3","key":"516_CR10","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","volume":"9","author":"J Friedman","year":"2008","unstructured":"Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9(3):432\u2013441. https:\/\/doi.org\/10.1093\/biostatistics\/kxm045","journal-title":"Biostatistics"},{"issue":"7028","key":"516_CR11","doi-asserted-by":"publisher","first-page":"895","DOI":"10.1038\/nature03288","volume":"433","author":"R Guimera","year":"2005","unstructured":"Guimera R, Nunes Amaral LA (2005) Functional cartography of complex metabolic networks. Nature 433(7028):895\u2013900","journal-title":"Nature"},{"issue":"12","key":"516_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-014-0550-8","volume":"15","author":"MI Love","year":"2014","unstructured":"Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for rna-seq data with deseq2. Genome Biol 15(12):1\u201321","journal-title":"Genome Biol"},{"issue":"8","key":"516_CR13","doi-asserted-by":"publisher","first-page":"796","DOI":"10.1038\/nmeth.2016","volume":"9","author":"D Marbach","year":"2012","unstructured":"Marbach D, Costello JC, K\u00fcffner R, Vega NM, Prill RJ, Camacho DM, Allison KR, Aderhold A, Bonneau R, Chen Y, Collins JJ, Cordero F, Crane M, Dondelinger F, Drton M, Esposito R, Foygel R, de la Fuente A, Gertheiss J, Geurts P, Greenfield A, Grzegorczyk M, Haury A-C, Holmes B, Hothorn T, Husmeier D, Huynh-Thu VA, Irrthum A, Kellis M, Karlebach G, L\u00e8bre S, De Leo V, Madar A, Mani S, Mordelet F, Ostrer H, Ouyang Z, Pandya R, Petri T, Pinna A, Poultney CS, Rezny S, Ruskin HJ, Saeys Y, Shamir R, S\u00eerbu A, Song M, Soranzo N, Statnikov A, Stolovitzky G, Vega N, Vera-Licona P, Vert J-P, Visconti A, Wang H, Wehenkel L, Windhager L, Zhang Y, Zimmer R, Consortium TD (2012) Wisdom of crowds for robust gene network inference. Nat Methods 9(8):796\u2013804. https:\/\/doi.org\/10.1038\/nmeth.2016","journal-title":"Nat Methods"},{"issue":"3","key":"516_CR14","doi-asserted-by":"publisher","first-page":"1436","DOI":"10.1214\/009053606000000281","volume":"34","author":"N Meinshausen","year":"2006","unstructured":"Meinshausen N, B\u00fchlmann P (2006) High-dimensional graphs and variable selection with the Lasso. Ann Stat 34(3):1436\u20131462. https:\/\/doi.org\/10.1214\/009053606000000281","journal-title":"Ann Stat"},{"issue":"4","key":"516_CR15","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1038\/nmeth.4197","volume":"14","author":"R Patro","year":"2017","unstructured":"Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C (2017) Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14(4):417\u2013419","journal-title":"Nat Methods"},{"issue":"4","key":"516_CR16","doi-asserted-by":"publisher","first-page":"1118","DOI":"10.1073\/pnas.0706851105","volume":"105","author":"M Rosvall","year":"2008","unstructured":"Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci 105(4):1118\u20131123. https:\/\/doi.org\/10.1073\/pnas.0706851105 (www.pnas.org\/content\/105\/4\/1118.full.pdf)","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"516_CR17","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1140\/epjst\/e2010-01179-1","volume":"178","author":"M Rosvall","year":"2009","unstructured":"Rosvall M, Axelsson D, Bergstrom CT (2009) The map equation. Eur Phys J Spec Top 178(1):13\u201323. https:\/\/doi.org\/10.1140\/epjst\/e2010-01179-1","journal-title":"Eur Phys J Spec Top"},{"issue":"16","key":"516_CR18","doi-asserted-by":"publisher","first-page":"6483","DOI":"10.1073\/pnas.0808904106","volume":"106","author":"M\u00c1 Serrano","year":"2009","unstructured":"Serrano M\u00c1, Bogu\u00f1\u00e1 M, Vespignani A (2009) Extracting the multiscale backbone of complex weighted networks. Proc Natl Acad Sci 106(16):6483\u20136488. https:\/\/doi.org\/10.1073\/pnas.0808904106","journal-title":"Proc Natl Acad Sci"},{"key":"516_CR19","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.102.012302","volume":"102","author":"J Smiljani\u0107","year":"2020","unstructured":"Smiljani\u0107 J, Edler D, Rosvall M (2020) Mapping flows on sparse networks with missing links. Phys Rev E 102:012302. https:\/\/doi.org\/10.1103\/PhysRevE.102.012302","journal-title":"Phys Rev E"},{"issue":"30","key":"516_CR20","doi-asserted-by":"publisher","first-page":"10421","DOI":"10.1073\/pnas.0500298102","volume":"102","author":"M Tumminello","year":"2005","unstructured":"Tumminello M, Aste T, Matteo TD, Mantegna RN (2005) A tool for filtering information in complex systems. Proc Natl Acad Sci 102(30):10421\u201310426. https:\/\/doi.org\/10.1073\/pnas.0500298102","journal-title":"Proc Natl Acad Sci"},{"key":"516_CR21","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/j.jtbi.2014.03.040","volume":"362","author":"YXR Wang","year":"2014","unstructured":"Wang YXR, Huang H (2014) Review on statistical methods for gene network reconstruction using expression data. J Theor Biol 362:53\u201361. https:\/\/doi.org\/10.1016\/j.jtbi.2014.03.040","journal-title":"J Theor Biol"},{"key":"516_CR22","doi-asserted-by":"publisher","first-page":"17","DOI":"10.2202\/1544-6115.1128","volume":"4","author":"B Zhang","year":"2005","unstructured":"Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4:17. https:\/\/doi.org\/10.2202\/1544-6115.1128","journal-title":"Stat Appl Genet Mol Biol"}],"container-title":["Applied Network Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41109-022-00516-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s41109-022-00516-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41109-022-00516-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T06:35:55Z","timestamp":1668580555000},"score":1,"resource":{"primary":{"URL":"https:\/\/appliednetsci.springeropen.com\/articles\/10.1007\/s41109-022-00516-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,15]]},"references-count":22,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["516"],"URL":"https:\/\/doi.org\/10.1007\/s41109-022-00516-5","relation":{},"ISSN":["2364-8228"],"issn-type":[{"value":"2364-8228","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,15]]},"assertion":[{"value":"15 September 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 November 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"75"}}