{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T03:55:11Z","timestamp":1774670111962,"version":"3.50.1"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2024,3,4]],"date-time":"2024-03-04T00:00:00Z","timestamp":1709510400000},"content-version":"vor","delay-in-days":3,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62373200"],"award-info":[{"award-number":["62373200"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61973174"],"award-info":[{"award-number":["61973174"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006606","name":"Natural Science Foundation of Tianjin City","doi-asserted-by":"publisher","award":["21JCZDJC00140"],"award-info":[{"award-number":["21JCZDJC00140"]}],"id":[{"id":"10.13039\/501100006606","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Removal of batch effect between multiple datasets from different experimental platforms has become an urgent problem, since single-cell RNA sequencing (scRNA-seq) techniques developed rapidly. Although there have been some methods for this problem, most of them still face the challenge of under-correction or over-correction. Specifically, handling batch effect in highly nonlinear scRNA-seq data requires a more powerful model to address under-correction. In the meantime, some previous methods focus too much on removing difference between batches, which may disturb the biological signal heterogeneity of datasets generated from different experiments, thereby leading to over-correction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article, we propose a novel multi-layer adaptation autoencoder with dual-channel framework to address the under-correction and over-correction problems in batch effect removal, which is called BERMAD and can achieve better results of scRNA-seq data integration and joint analysis. First, we design a multi-layer adaptation architecture to model distribution difference between batches from different feature granularities. The distribution matching on various layers of autoencoder with different feature dimensions can result in more accurate batch correction outcome. Second, we propose a dual-channel framework, where the deep autoencoder processing each single dataset is independently trained. Hence, the heterogeneous information that is not shared between different batches can be retained more completely, which can alleviate over-correction. Comprehensive experiments on multiple scRNA-seq datasets demonstrate the effectiveness and superiority of our method over the state-of-the-art methods.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code implemented in Python and the data used for experiments have been released on GitHub (https:\/\/github.com\/zhanglabNKU\/BERMAD) and Zenodo (https:\/\/zenodo.org\/records\/10695073) with detailed instructions.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae127","type":"journal-article","created":{"date-parts":[[2024,3,5]],"date-time":"2024-03-05T09:02:35Z","timestamp":1709629355000},"source":"Crossref","is-referenced-by-count":7,"title":["BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework"],"prefix":"10.1093","volume":"40","author":[{"given":"Xiangxin","family":"Zhan","sequence":"first","affiliation":[{"name":"Department of Intelligence Engineering, College of Artificial Intelligence, Nankai University , Tianjin 300350, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7667-881X","authenticated-orcid":false,"given":"Yanbin","family":"Yin","sequence":"additional","affiliation":[{"name":"Department of Food Science and Technology, University of Nebraska \u2013 Lincoln , Lincoln, NE 68588, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8498-3451","authenticated-orcid":false,"given":"Han","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Intelligence Engineering, College of Artificial Intelligence, Nankai University , Tianjin 300350, China"}]}],"member":"286","published-online":{"date-parts":[[2024,3,4]]},"reference":[{"key":"2024031520360350200_btae127-B1","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cels.2016.08.011","article-title":"A single-cell transcriptomic map of the human and mouse pancreas reveals inter-and intra-cell population structure","volume":"3","author":"Baron","year":"2016","journal-title":"Cell Syst"},{"key":"2024031520360350200_btae127-B2","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2024031520360350200_btae127-B3","doi-asserted-by":"crossref","first-page":"884","DOI":"10.1038\/s41467-018-03282-0","article-title":"Characterizing the replicability of cell types defined by single cell rna-sequencing data using metaneighbor","volume":"9","author":"Crow","year":"2018","journal-title":"Nat Commun"},{"key":"2024031520360350200_btae127-B4","doi-asserted-by":"crossref","first-page":"e80","DOI":"10.1093\/nar\/gkab380","article-title":"Integrating multiple references for single-cell assignment","volume":"49","author":"Duan","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2024031520360350200_btae127-B5","first-page":"877","author":"Farahani","year":"2021"},{"key":"2024031520360350200_btae127-B6","first-page":"1180","author":"Ganin","year":"2015"},{"key":"2024031520360350200_btae127-B7","first-page":"898","author":"Ghifary","year":"2014"},{"key":"2024031520360350200_btae127-B8","article-title":"Optimal kernel choice for large-scale two-sample tests","volume":"25","author":"Gretton","year":"2012","journal-title":"Adv Neural Inf Process Syst"},{"key":"2024031520360350200_btae127-B9","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1038\/nbt.4091","article-title":"Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors","volume":"36","author":"Haghverdi","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2024031520360350200_btae127-B10","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1038\/s41587-019-0113-3","article-title":"Efficient integration of heterogeneous single-cell transcriptomes using scanorama","volume":"37","author":"Hie","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2024031520360350200_btae127-B11","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1038\/nbt.4042","article-title":"Multiplexed droplet single-cell RNA-sequencing using natural genetic variation","volume":"36","author":"Kang","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2024031520360350200_btae127-B12","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1038\/s41592-021-01171-x","article-title":"The triumphs and limitations of computational methods for scRNA-seq","volume":"18","author":"Kharchenko","year":"2021","journal-title":"Nat Methods"},{"key":"2024031520360350200_btae127-B13","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1038\/s41592-019-0619-0","article-title":"Fast, sensitive and accurate integration of single-cell data with harmony","volume":"16","author":"Korsunsky","year":"2019","journal-title":"Nat Methods"},{"key":"2024031520360350200_btae127-B14","first-page":"97","author":"Long","year":"2015"},{"key":"2024031520360350200_btae127-B15","first-page":"2208","author":"Long","year":"2017"},{"key":"2024031520360350200_btae127-B16","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/s41592-018-0229-2","article-title":"Deep generative modeling for single-cell transcriptomics","volume":"15","author":"Lopez","year":"2018","journal-title":"Nat Methods"},{"key":"2024031520360350200_btae127-B17","volume-title":"Further MNN algorithm development","author":"Lun","year":"2019"},{"key":"2024031520360350200_btae127-B18","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.cels.2016.09.002","article-title":"A single-cell transcriptome atlas of the human pancreas","volume":"3","author":"Muraro","year":"2016","journal-title":"Cell Syst"},{"key":"2024031520360350200_btae127-B19","doi-asserted-by":"crossref","first-page":"964","DOI":"10.1093\/bioinformatics\/btz625","article-title":"BBKNN: fast batch alignment of single cell transcriptomes","volume":"36","author":"Pola\u0144ski","year":"2020","journal-title":"Bioinformatics"},{"key":"2024031520360350200_btae127-B20","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1007\/s10994-021-06043-1","article-title":"Embedding to reference t-SNE space addresses batch effects in single-cell classification","volume":"112","author":"Poli\u010dar","year":"2023","journal-title":"Mach Learn"},{"key":"2024031520360350200_btae127-B21","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1038\/s41467-017-02554-5","article-title":"A general and flexible method for signal extraction from single-cell RNA-seq data","volume":"9","author":"Risso","year":"2018","journal-title":"Nat Commun"},{"key":"2024031520360350200_btae127-B22","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1109\/TPAMI.2018.2814042","article-title":"Beyond sharing weights for deep domain adaptation","volume":"41","author":"Rozantsev","year":"2018","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2024031520360350200_btae127-B23","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1016\/j.cmet.2016.08.020","article-title":"Single-cell transcriptome profiling of human pancreatic islets in health and type 2 diabetes","volume":"24","author":"Segerstolpe","year":"2016","journal-title":"Cell Metab"},{"key":"2024031520360350200_btae127-B24","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1016\/S1046-2023(03)00155-5","article-title":"Normalization of cDNA microarray data","volume":"31","author":"Smyth","year":"2003","journal-title":"Methods"},{"key":"2024031520360350200_btae127-B25","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1016\/j.cell.2019.05.031","article-title":"Comprehensive integration of single-cell data","volume":"177","author":"Stuart","year":"2019","journal-title":"Cell"},{"key":"2024031520360350200_btae127-B26","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s13059-019-1850-9","article-title":"A benchmark of batch-effect correction methods for single-cell RNA sequencing data","volume":"21","author":"Tran","year":"2020","journal-title":"Genome Biol"},{"key":"2024031520360350200_btae127-B27","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1186\/s13059-021-02280-8","article-title":"imap: integration of multiple single-cell datasets by adversarial paired transfer networks","volume":"22","author":"Wang","year":"2021","journal-title":"Genome Biol"},{"key":"2024031520360350200_btae127-B28","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1186\/s13059-019-1764-6","article-title":"Bermuda: a novel deep transfer learning method for single-cell RNA sequencing batch correction reveals hidden high-resolution cellular subtypes","volume":"20","author":"Wang","year":"2019","journal-title":"Genome Biol"},{"key":"2024031520360350200_btae127-B29","doi-asserted-by":"crossref","first-page":"1295","DOI":"10.1093\/bioinformatics\/btab821","article-title":"HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data","volume":"38","author":"Wang","year":"2022","journal-title":"Bioinformatics"},{"key":"2024031520360350200_btae127-B30","first-page":"5085","author":"Ying","year":"2018"},{"key":"2024031520360350200_btae127-B31","first-page":"27","article-title":"How transferable are features in deep neural networks?","author":"Yosinski","year":"2014","journal-title":"Adv Neural Inf Process Syst"},{"key":"2024031520360350200_btae127-B32","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2024031520360350200_btae127-B33","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1007\/s11704-018-8052-6","article-title":"A survey of autoencoder-based recommender systems","volume":"14","author":"Zhang","year":"2020","journal-title":"Front Comput Sci"},{"key":"2024031520360350200_btae127-B34","doi-asserted-by":"crossref","first-page":"3488","DOI":"10.1093\/bioinformatics\/btac350","article-title":"webscst: an interactive web application for single-cell RNA-sequencing data and spatial transcriptomic data integration","volume":"38","author":"Zhang","year":"2022","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae127\/56833204\/btae127.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/3\/btae127\/56990739\/btae127.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/3\/btae127\/56990739\/btae127.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,15]],"date-time":"2024-03-15T20:36:37Z","timestamp":1710534997000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae127\/7619104"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,3,1]]},"references-count":34,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae127","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,3,1]]},"published":{"date-parts":[[2024,3,1]]},"article-number":"btae127"}}