{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T03:58:45Z","timestamp":1772423925319,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,8,11]],"date-time":"2020-08-11T00:00:00Z","timestamp":1597104000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100004794","name":"Centre National de la Recherche Scientifique","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004794","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Federal University of Toulouse","award":["2015-230-CIF-D-DRD"],"award-info":[{"award-number":["2015-230-CIF-D-DRD"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>High-throughput data generated by new biotechnologies require specific and adapted statistical treatment in order to be efficiently used in biological studies. In this article, we propose a powerful framework to manage and analyse multi-omics heterogeneous data to carry out an integrative analysis. We have illustrated this using the mixOmics package for R software as it specifically addresses data integration issues. Our work also aims at applying the most recent functionalities of mixOmics to real datasets. Although multi-block integrative methodologies exist, we hope to encourage a more widespread use of such approaches in an operational framework by biologists. We have used natural populations of the model plant Arabidopsis thaliana in this work, but the framework proposed is not limited to this plant and can be deployed whatever the organisms of interest and the biological question may be. Four omics datasets (phenomics, metabolomics, cell wall proteomics and transcriptomics) were collected, analysed and integrated to study the cell wall plasticity of plants exposed to sub-optimal temperature growth conditions. The methodologies presented here start from basic univariate statistics leading to multi-block integration analysis. We have also highlighted the fact that each method, either unsupervised or supervised, is associated with one biological issue. Using this powerful framework enabled us to arrive at novel conclusions on the biological system, which would not have been possible using standard statistical approaches.<\/jats:p>","DOI":"10.1093\/bib\/bbaa166","type":"journal-article","created":{"date-parts":[[2020,7,3]],"date-time":"2020-07-03T15:35:03Z","timestamp":1593790503000},"source":"Crossref","is-referenced-by-count":16,"title":["A powerful framework for an integrative study with heterogeneous omics data: from univariate statistics to multi-block analysis"],"prefix":"10.1093","volume":"22","author":[{"given":"Harold","family":"Durufl\u00e9","sequence":"first","affiliation":[{"name":"INRAE"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Merwann","family":"Selmani","sequence":"additional","affiliation":[{"name":"Laboratoire de Recherche en Sciences V\u00e9g\u00e9tales and the Institut de Math\u00e9matiques de Toulouse"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Philippe","family":"Ranocha","sequence":"additional","affiliation":[{"name":"CNRS, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elisabeth","family":"Jamet","sequence":"additional","affiliation":[{"name":"CNRS and studies plant cell walls"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christophe","family":"Dunand","sequence":"additional","affiliation":[{"name":"Toulouse University III-Paul Sabatier"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"S\u00e9bastien","family":"D\u00e9jean","sequence":"additional","affiliation":[{"name":"Institut de Math\u00e9matiques, Toulouse University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,8,11]]},"reference":[{"key":"2021052110394319500_ref1","first-page":"325","article-title":"A review on machine learning principles for multi-view biological data integration","volume":"19","author":"Li","year":"2016","journal-title":"Brief Bioinform"},{"key":"2021052110394319500_ref2","doi-asserted-by":"crossref","first-page":"764","DOI":"10.1111\/tpj.13485","article-title":"Integrated omics analysis of specialized metabolism in medicinal plants","volume":"90","author":"Rai","year":"2017","journal-title":"Plant J"},{"key":"2021052110394319500_ref3","first-page":"137","article-title":"Experimental design to make the most of microarray studies","volume":"224","author":"Kerr","year":"2003","journal-title":"Methods Mol Biol"},{"key":"2021052110394319500_ref4","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1016\/B978-0-12-801309-0.00013-6","volume-title":"Breeding Oilseed Crops for Sustainable Production","author":"Zargar","year":"2016"},{"key":"2021052110394319500_ref5","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.pbi.2015.12.010","article-title":"More effort - more results: recent advances in integrative \u2018omics\u2019 data analysis","volume":"30","author":"Rajasundaram","year":"2016","journal-title":"Curr Opin Plant Biol"},{"key":"2021052110394319500_ref6","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.plantsci.2017.07.015","article-title":"Cell wall modifications of two Arabidopsis thaliana ecotypes, Col and Sha, in response to sub-optimal growth conditions: an integrative study","volume":"263","author":"Durufl\u00e9","year":"2017","journal-title":"Plant Sci"},{"key":"2021052110394319500_ref7","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1186\/1471-2164-10-505","article-title":"Cell wall biogenesis of Arabidopsis thaliana elongating cells: transcriptomics complements proteomics","volume":"10","author":"Jamet","year":"2009","journal-title":"BMC Genomics"},{"key":"2021052110394319500_ref8","doi-asserted-by":"crossref","first-page":"3966","DOI":"10.1016\/j.febslet.2009.10.036","article-title":"Correlation of mRNA and protein in complex biological samples","volume":"583","author":"Maier","year":"2009","journal-title":"FEBS Lett"},{"key":"2021052110394319500_ref9","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1002789","article-title":"Teaching the fundamentals of biological data integration using classroom games","volume":"8","author":"Schneider","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"2021052110394319500_ref10","doi-asserted-by":"crossref","first-page":"3055","DOI":"10.1093\/bioinformatics\/bty1054","article-title":"DIABLO: an integrative approach for identifying key molecular drivers from multi-omic assays","volume":"35","author":"Singh","year":"2019","journal-title":"Bioinformatics"},{"key":"2021052110394319500_ref11","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1093\/biostatistics\/kxu001","article-title":"Variable selection for generalized canonical correlation analysis","volume":"15","author":"Tenenhaus","year":"2014","journal-title":"Biostatistics"},{"key":"2021052110394319500_ref12","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1093\/bib\/bbx167","article-title":"Multi-omics integration-a comparison of unsupervised clustering methodologies","volume":"20","author":"Tini","year":"2018","journal-title":"Brief Bioinform"},{"key":"2021052110394319500_ref13","article-title":"Evaluation of integrative clustering methods for the analysis of multi-omics data","volume":"21","author":"Chauvel","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110394319500_ref14","article-title":"Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration","author":"Pierre-Jean","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110394319500_ref15","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Core Team","year":"2018"},{"key":"2021052110394319500_ref16","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005752","article-title":"mixOmics: an R package for \u2018omics feature selection and multiple data integration","volume":"13","author":"Rohart","year":"2017","journal-title":"PLoS Comput Biol"},{"key":"2021052110394319500_ref17","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1186\/1756-0381-5-19","article-title":"Insightful graphical outputs to explore relationships between two \u2018omics\u2019 data sets","volume":"5","author":"Gonz\u00e1lez","year":"2013","journal-title":"BioData Min"},{"key":"2021052110394319500_ref18","doi-asserted-by":"crossref","DOI":"10.1016\/j.dib.2019.104318","article-title":"Phenotyping and cell wall polysaccharide composition dataset of five arabidopsis ecotypes grown at optimal or sub-optimal temperatures","volume":"25","author":"Durufl\u00e9","year":"2019","journal-title":"Data Brief"},{"key":"2021052110394319500_ref19","doi-asserted-by":"crossref","DOI":"10.1016\/j.dib.2019.104581","article-title":"Transcriptomic and cell wall proteomic datasets of rosettes and floral stems from five Arabidopsis thaliana ecotypes grown at optimal or sub-optimal temperature","volume":"27","author":"Durufl\u00e9","year":"2019","journal-title":"Data Brief"},{"key":"2021052110394319500_ref20","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1038\/nclimate2958","article-title":"Observations of climate change among subsistence-oriented communities around the world","volume":"6","author":"Savo","year":"2016","journal-title":"Nat Clim Chang"},{"key":"2021052110394319500_ref21","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1016\/j.ydbio.2016.07.023","article-title":"Plant developmental responses to climate change","volume":"419","author":"Gray","year":"2016","journal-title":"Dev Biol"},{"key":"2021052110394319500_ref22","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1046\/j.1365-2699.2002.00647.x","article-title":"Biogeography of Arabidopsis thaliana (L.) Heynh. (Brassicaceae)","volume":"29","author":"Hoffmann","year":"2002","journal-title":"J Biogeogr"},{"key":"2021052110394319500_ref23","article-title":"Crop breeding: turning a lawn into a field","volume":"3","author":"Sibout","year":"2017","journal-title":"Nat Plants"},{"key":"2021052110394319500_ref24","doi-asserted-by":"crossref","first-page":"430","DOI":"10.3389\/fpls.2019.00430","article-title":"Phenotypic trait variation as a response to altitude-related constraints in Arabidopsis populations","volume":"10","author":"Durufl\u00e9","year":"2019","journal-title":"Front Plant Sci"},{"key":"2021052110394319500_ref25","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1016\/j.carres.2011.04.014","article-title":"Comparative study of the cell wall composition of broccoli, carrot, and tomato: structural characterization of the extractable pectins and hemicelluloses","volume":"346","author":"Houben","year":"2011","journal-title":"Carbohydr Res"},{"key":"2021052110394319500_ref26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v059.i10","article-title":"Tidy data","volume":"59","author":"Wickham","year":"2014","journal-title":"J Stat Softw"},{"key":"2021052110394319500_ref27","author":"Swalin","year":"2018"},{"key":"2021052110394319500_ref28","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1186\/s12859-016-1273-5","article-title":"Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework","volume":"17","author":"Voillet","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2021052110394319500_ref29","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol"},{"key":"2021052110394319500_ref30","doi-asserted-by":"crossref","first-page":"2855","DOI":"10.1093\/bioinformatics\/btp515","article-title":"integrOmics: an R package to unravel relationships between two omics datasets","volume":"25","author":"L\u00ea Cao","year":"2009","journal-title":"Bioinformatics"},{"key":"2021052110394319500_ref31","author":"Husson","year":"2019"},{"key":"2021052110394319500_ref32","doi-asserted-by":"crossref","first-page":"3255","DOI":"10.1016\/j.csda.2007.09.023","article-title":"Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequency data","volume":"52","author":"B\u00e9cue-Bertaut","year":"2008","journal-title":"Comput Stat Data Anal"},{"key":"2021052110394319500_ref33","first-page":"31","article-title":"Une nouvelle proposition, l\u2019analyse discriminante multitableaux: Statis-lda","volume":"154","author":"Sabatier","year":"2013","journal-title":"J SFdS"},{"key":"2021052110394319500_ref34","doi-asserted-by":"crossref","DOI":"10.15252\/msb.20178124","article-title":"Multi-Omics factor analysis\u2014a framework for unsupervised integration of multi-omics data sets","volume":"14","author":"Argelaguet","year":"2018","journal-title":"Mol Syst Biol"},{"key":"2021052110394319500_ref35","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005871","article-title":"Ten simple rules for biologists learning to program","volume":"14","author":"Carey","year":"2018","journal-title":"PLoS Comput Biol"},{"key":"2021052110394319500_ref36","volume-title":"Handbook of Biological Statistics","author":"McDonald","year":"2009","edition":"3rd"},{"key":"2021052110394319500_ref37","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1198\/000313002533","article-title":"Corrgrams: exploratory displays for correlation matrices","volume":"56","author":"Friendly","year":"2002","journal-title":"Am Stat"},{"key":"2021052110394319500_ref38","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1080\/00031305.1996.10474371","article-title":"A graphical display of large correlation matrices","volume":"50","author":"Murdoch","year":"1996","journal-title":"Am Stat"},{"key":"2021052110394319500_ref39","first-page":"213","volume-title":"Multivariate Analysis (Probability and Mathematical Statistics)","author":"Mardia","year":"1979"},{"key":"2021052110394319500_ref40","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-12-253","article-title":"Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems","volume":"12","author":"L\u00ea Cao","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2021052110394319500_ref41","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/S0169-7439(01)00155-1","article-title":"PLS-regression: a basic tool of chemometrics","volume":"58","author":"Wold","year":"2001","journal-title":"Chemom Intell Lab Syst"},{"key":"2021052110394319500_ref42","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J R Stat Soc Ser B"},{"key":"2021052110394319500_ref43","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-24277-4","volume-title":"ggplot2: Elegant Graphics for Data Analysis","author":"Wickham","year":"2016"},{"key":"2021052110394319500_ref44","author":"Wei","year":"2016"},{"key":"2021052110394319500_ref45","doi-asserted-by":"crossref","first-page":"12182","DOI":"10.1073\/pnas.220392197","article-title":"Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks","volume":"97","author":"Butte","year":"2000","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2021052110394319500_ref46","first-page":"199","article-title":"Relevance network between chemosensitivity and transcriptome in human hepatoma cells","volume":"2","author":"Moriyama","year":"2003","journal-title":"Mol Cancer Ther"},{"key":"2021052110394319500_ref47","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-84882-969-5","volume-title":"Regression: Linear Models in Statistics","author":"Bingham","year":"2010"},{"key":"2021052110394319500_ref48","doi-asserted-by":"crossref","first-page":"163","DOI":"10.2174\/978160805092511101010163","volume-title":"Omics and Plant Abiotic Stress Tolerance","author":"Chawla","year":"2011"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa166\/37966089\/bbaa166.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa166\/37966089\/bbaa166.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,9]],"date-time":"2024-08-09T01:24:59Z","timestamp":1723166699000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaa166\/5890507"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,11]]},"references-count":48,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,5,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa166","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/357921","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,5]]},"published":{"date-parts":[[2020,8,11]]},"article-number":"bbaa166"}}