{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,23]],"date-time":"2025-10-23T11:12:48Z","timestamp":1761217968855,"version":"3.37.3"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"23","license":[{"start":{"date-parts":[[2017,7,14]],"date-time":"2017-07-14T00:00:00Z","timestamp":1499990400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/501100001665","name":"Agence Nationale de la Recherche","doi-asserted-by":"publisher","award":["ANR-11-INBS-0010"],"award-info":[{"award-number":["ANR-11-INBS-0010"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Flow Injection Analysis coupled to High-Resolution Mass Spectrometry (FIA-HRMS) is a promising approach for high-throughput metabolomics. FIA-HRMS data, however, cannot be preprocessed with current software tools which rely on liquid chromatography separation, or handle low resolution data only.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We thus developed the proFIA package, which implements a suite of innovative algorithms to preprocess FIA-HRMS raw files, and generates the table of peak intensities. The workflow consists of 3 steps: (i) noise estimation, peak detection and quantification, (ii) peak grouping across samples and (iii) missing value imputation. In addition, we have implemented a new indicator to quantify the potential alteration of the feature peak shape due to matrix effect. The preprocessing is fast (less than 15\u2009s per file), and the value of the main parameters (ppm and dmz) can be easily inferred from the mass resolution of the instrument. Application to two metabolomics datasets (including spiked serum samples) showed high precision (96%) and recall (98%) compared with manual integration. These results demonstrate that proFIA achieves very efficient and robust detection and quantification of FIA-HRMS data, and opens new opportunities for high-throughput phenotyping.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The proFIA software (as well as the plasFIA dataset) is available as an R package on the Bioconductor repository (http:\/\/bioconductor.org\/packages\/proFIA), and as a Galaxy module on the Main Toolshed (https:\/\/toolshed.g2.bx.psu.edu), and on the Workflow4Metabolomics online infrastructure (http:\/\/workflow4metabolomics.org).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx458","type":"journal-article","created":{"date-parts":[[2017,7,12]],"date-time":"2017-07-12T11:20:01Z","timestamp":1499858401000},"page":"3767-3775","source":"Crossref","is-referenced-by-count":16,"title":["<i>proFIA<\/i>: a data preprocessing workflow for flow injection analysis coupled to high-resolution mass spectrometry"],"prefix":"10.1093","volume":"33","author":[{"given":"Alexis","family":"Delabri\u00e8re","sequence":"first","affiliation":[{"name":"CEA, LIST, Laboratory for Data Analysis and Systems' Intelligence, MetaboHUB, Gif-Sur-Yvette, France"}]},{"given":"Ulli M","family":"Hohenester","sequence":"additional","affiliation":[{"name":"CEA, DRF\/JOLIOT\/SPI, Laboratoire d'Etude du M\u00e9tabolisme des M\u00e9dicaments, MetaboHUB, Gif-Sur-Yvette, France"}]},{"given":"Benoit","family":"Colsch","sequence":"additional","affiliation":[{"name":"CEA, DRF\/JOLIOT\/SPI, Laboratoire d'Etude du M\u00e9tabolisme des M\u00e9dicaments, MetaboHUB, Gif-Sur-Yvette, France"}]},{"given":"Christophe","family":"Junot","sequence":"additional","affiliation":[{"name":"CEA, DRF\/JOLIOT\/SPI, Laboratoire d'Etude du M\u00e9tabolisme des M\u00e9dicaments, MetaboHUB, Gif-Sur-Yvette, France"}]},{"given":"Fran\u00e7ois","family":"Fenaille","sequence":"additional","affiliation":[{"name":"CEA, DRF\/JOLIOT\/SPI, Laboratoire d'Etude du M\u00e9tabolisme des M\u00e9dicaments, MetaboHUB, Gif-Sur-Yvette, France"}]},{"given":"Etienne A","family":"Th\u00e9venot","sequence":"additional","affiliation":[{"name":"CEA, LIST, Laboratory for Data Analysis and Systems' Intelligence, MetaboHUB, Gif-Sur-Yvette, France"}]}],"member":"286","published-online":{"date-parts":[[2017,7,14]]},"reference":[{"key":"2023020207010844000_btx458-B1","doi-asserted-by":"crossref","first-page":"692","DOI":"10.1038\/nbt823","article-title":"High-throughput classification of yeast mutants for functional genomics using metabolic footprinting","volume":"21","author":"Allen","year":"2003","journal-title":"Nat. Biotechnol"},{"key":"2023020207010844000_btx458-B2","doi-asserted-by":"crossref","first-page":"731","DOI":"10.1002\/pmic.200700694","article-title":"Comparative LC-MS: a landscape of peaks and valleys","volume":"8","author":"America","year":"2008","journal-title":"Proteomics"},{"key":"2023020207010844000_btx458-B3","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/B978-0-12-397922-3.00004-6","volume-title":"Metabolomics Coming of Age with Its Technological Diversity","author":"Boudah","year":"2013","edition":"1st edn."},{"key":"2023020207010844000_btx458-B4","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.chemolab.2011.03.010","article-title":"Algorithms and tools for the preprocessing of LC-MS metabolomics data","volume":"108","author":"Castillo","year":"2011","journal-title":"Chemometr. Intell. Lab"},{"key":"2023020207010844000_btx458-B5","doi-asserted-by":"crossref","first-page":"918","DOI":"10.1038\/nbt.2377","article-title":"A cross-platform toolkit for mass spectrometry and proteomics","volume":"30","author":"Chambers","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023020207010844000_btx458-B6","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1002\/mas.20108","article-title":"Mass spectrometry-based metabolomics","volume":"26","author":"Dettmer","year":"2007","journal-title":"Mass Spectrom. Rev"},{"key":"2023020207010844000_btx458-B7","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1007\/s11306-016-1030-9","article-title":"Non-targeted UHPLC-MS metabolomic data processing methods: a comparative investigation of normalisation, missing value imputation, transformation and scaling","volume":"12","author":"Di Guida","year":"2016","journal-title":"Metabolomics"},{"key":"2023020207010844000_btx458-B8","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1007\/s11306-012-0449-x","article-title":"Flow infusion electrospray ionisation mass spectrometry for high throughput, non-targeted metabolite fingerprinting: a review","volume":"9","author":"Draper","year":"2013","journal-title":"Metabolomics"},{"year":"2016","author":"Elzhov","key":"2023020207010844000_btx458-B9"},{"key":"2023020207010844000_btx458-B10","doi-asserted-by":"crossref","first-page":"4885","DOI":"10.1021\/ac970095w","article-title":"A predictive model for matrix and analyte effects in electrospray ionization of singly-charged ionic analytes","volume":"69","author":"Enke","year":"1997","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B11","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1038\/nprot.2007.511","article-title":"Preprocessing, classification modeling and feature selection using flow injection electrospray mass spectrometry metabolite fingerprint data","volume":"3","author":"Enot","year":"2008","journal-title":"Nat. Protoc"},{"key":"2023020207010844000_btx458-B12","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1023\/A:1013713905833","article-title":"Metabolomics \u2013 the link between genotypes and phenotypes","volume":"48","author":"Fiehn","year":"2002","journal-title":"Plant Mol. Biol"},{"key":"2023020207010844000_btx458-B13","doi-asserted-by":"crossref","first-page":"7074","DOI":"10.1021\/ac201267k","article-title":"High-throughput, accurate mass metabolome profiling of cellular extracts by flow injection-time-of-flight mass spectrometry","volume":"83","author":"Fuhrer","year":"2011","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B14","doi-asserted-by":"crossref","first-page":"R80.","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol"},{"key":"2023020207010844000_btx458-B15","doi-asserted-by":"crossref","first-page":"1493","DOI":"10.1093\/bioinformatics\/btu813","article-title":"Workflow4Metabolomics: a collaborative research infrastructure for computational metabolomics","volume":"31","author":"Giacomoni","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020207010844000_btx458-B16","doi-asserted-by":"crossref","first-page":"433","DOI":"10.3390\/metabo4020433","article-title":"Influence of missing values substitutes on multivariate analysis of metabolomics data","volume":"4","author":"Gromski","year":"2014","journal-title":"Metabolites"},{"key":"2023020207010844000_btx458-B17","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1007\/s11306-017-1179-x","article-title":"An innovative chemometric method for processing direct introduction high resolution mass spectrometry metabolomic data: independent component-discriminant analysis (IC-DA)","volume":"13","author":"Habchi","year":"2017","journal-title":"Metabolomics"},{"key":"2023020207010844000_btx458-B18","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/S0166-526X(08)00601-6","volume-title":"Advances in Flow Injection Analysis and Related Techniques","author":"Hansen","year":"2008","edition":"1st edn."},{"key":"2023020207010844000_btx458-B19","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1007\/s11306-011-0366-4","article-title":"Missing values in mass spectrometry based metabolomics: an undervalued step in the data processing pipeline","volume":"8","author":"Hrydziuszko","year":"2012","journal-title":"Metabolomics"},{"key":"2023020207010844000_btx458-B20","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1093\/bioinformatics\/btk039","article-title":"MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data","volume":"22","author":"Katajamaa","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020207010844000_btx458-B21","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/0003-2670(94)00574-6","article-title":"Mathematical modelling of flow-injection systems","volume":"308","author":"Kolev","year":"1995","journal-title":"Anal. Chim. Acta"},{"key":"2023020207010844000_btx458-B22","volume-title":"IUPAC. Compendium of Chemical Terminology","author":"McNaught","year":"1997","edition":"2nd edn."},{"key":"2023020207010844000_btx458-B23","doi-asserted-by":"crossref","first-page":"3291","DOI":"10.1021\/ac7024915","article-title":"Direct introduction of biological samples into a LTQ-orbitrap hybrid mass spectrometer as a tool for fast metabolome analysis","volume":"80","author":"Madalinski","year":"2008","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B24","first-page":"105","volume-title":"Proceedings of the 1977 Dundee Conference on Numerical Analysis","author":"Mor\u00e9","year":"1978"},{"year":"2017","author":"Morgan","key":"2023020207010844000_btx458-B25"},{"key":"2023020207010844000_btx458-B26","doi-asserted-by":"crossref","first-page":"11866","DOI":"10.1021\/ac402567w","article-title":"Quantitative mass spectrometry independence from matrix effects and detector saturation achieved by flow injection analysis with real-time infinite dilution","volume":"85","author":"Nanita","year":"2013","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B27","doi-asserted-by":"crossref","first-page":"1054","DOI":"10.1038\/4551054a","article-title":"Systems biology: Metabonomics","volume":"455","author":"Nicholson","year":"2008","journal-title":"Nature"},{"year":"2017","author":"Core Team","key":"2023020207010844000_btx458-B28"},{"key":"2023020207010844000_btx458-B29","doi-asserted-by":"crossref","first-page":"6429","DOI":"10.1021\/ac300829f","article-title":"Annotation of the human adult urinary metabolome and metabolite identification using ultra high performance liquid chromatography coupled to a linear quadrupole ion trap-orbitrap mass spectrometer","volume":"84","author":"Roux","year":"2012","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B30","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1039\/AN9901500475","article-title":"Flow injection analysis and chromatography: twins or siblings?","volume":"115","author":"Ruzicka","year":"1990","journal-title":"Analyst"},{"key":"2023020207010844000_btx458-B31","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1038\/nmeth.4103","article-title":"Nontargeted in vitro metabolomics for high-throughput identification of novel enzymes in Escherichia coli","volume":"14","author":"Sevin","year":"2017","journal-title":"Nat. Methods"},{"key":"2023020207010844000_btx458-B32","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1186\/s12859-017-1547-6","article-title":"Distribution based nearest neighbors imputation for truncated high dimensional data with applications to pre-clinical and clinical metabolomics studies","volume":"18","author":"Shah","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2023020207010844000_btx458-B33","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1021\/ac051437y","article-title":"XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification","volume":"78","author":"Smith","year":"2006","journal-title":"Anal. Chem"},{"key":"2023020207010844000_btx458-B34","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1093\/bioinformatics\/btr597","article-title":"MissForest \u2013 non-parametric missing value imputation for mixed-type data","volume":"28","author":"Stekhoven","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020207010844000_btx458-B35","doi-asserted-by":"crossref","first-page":"163.","DOI":"10.1186\/1471-2105-9-163","article-title":"OpenMS \u2013 an open-source software framework for mass spectrometry","volume":"9","author":"Sturm","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020207010844000_btx458-B36","doi-asserted-by":"crossref","first-page":"504.","DOI":"10.1186\/1471-2105-9-504","article-title":"Highly sensitive feature detection for high resolution LC\/MS","volume":"9","author":"Tautenhahn","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020207010844000_btx458-B37","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1016\/j.clinbiochem.2004.11.007","article-title":"Matrix effects: the achilles heel of quantitative high-performance liquid chromatography \u2018electrospray\u2019 tandem mass spectrometry","volume":"38","author":"Taylor","year":"2005","journal-title":"Clin. Biochem"},{"key":"2023020207010844000_btx458-B38","doi-asserted-by":"crossref","first-page":"3322","DOI":"10.1021\/acs.jproteome.5b00354","article-title":"Analysis of the human adult urinary metabolome variations with age, body mass index and gender by implementing a comprehensive workflow for univariate and OPLS statistical analyses","volume":"14","author":"Th\u00e9venot","year":"2015","journal-title":"J. Proteome Res"},{"key":"2023020207010844000_btx458-B39","doi-asserted-by":"crossref","first-page":"2085","DOI":"10.1039\/C5AN02522B","article-title":"Recent advances in flow injection analysis","volume":"141","author":"Trojanowicz","year":"2016","journal-title":"Analyst"},{"key":"2023020207010844000_btx458-B40","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1002\/mas.20298","article-title":"An overview of matrix effects in liquid chromatography-mass spectrometry","volume":"30","author":"Trufelli","year":"2011","journal-title":"Mass Spectrom. Rev"},{"key":"2023020207010844000_btx458-B41","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.aca.2014.08.007","article-title":"Characterization of heteroscedastic measurement noise in the absence of replicates","volume":"847","author":"Wentzell","year":"2014","journal-title":"Anal. Chim. Acta"},{"key":"2023020207010844000_btx458-B42","doi-asserted-by":"crossref","first-page":"160018.","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci. Data"},{"key":"2023020207010844000_btx458-B43","doi-asserted-by":"crossref","first-page":"D521","DOI":"10.1093\/nar\/gkl923","article-title":"HMDB: the Human Metabolome Database","volume":"35","author":"Wishart","year":"2007","journal-title":"Nucl. Acids Res"},{"key":"2023020207010844000_btx458-B44","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1038\/nrd.2016.32","article-title":"Emerging applications of metabolomics in drug discovery and precision medicine","volume":"15","author":"Wishart","year":"2016","journal-title":"Nat. Rev. Drug Discov"},{"key":"2023020207010844000_btx458-B46","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1039\/C4AN02294G","article-title":"A short review of applications of liquid chromatography mass spectrometry based metabolomics techniques to the analysis of human urine","volume":"140","author":"Zhang","year":"2015","journal-title":"Analyst"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/23\/3767\/49042150\/bioinformatics_33_23_3767.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/23\/3767\/49042150\/bioinformatics_33_23_3767.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T07:02:13Z","timestamp":1675321333000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/23\/3767\/3965327"}},"subtitle":[],"editor":[{"given":"Oliver","family":"Stegle","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,7,14]]},"references-count":45,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2017,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx458","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2017,12,1]]},"published":{"date-parts":[[2017,7,14]]}}}