{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T22:26:44Z","timestamp":1774391204907,"version":"3.50.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T00:00:00Z","timestamp":1734480000000},"content-version":"vor","delay-in-days":26,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62372156"],"award-info":[{"award-number":["62372156"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Innovative Research Team of Henan Polytechnic University","award":["T2021-3"],"award-info":[{"award-number":["T2021-3"]}]},{"name":"Henan Provincial Department of Science and Technology Research Project","award":["232102211046"],"award-info":[{"award-number":["232102211046"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Gene polymorphism originates from single-nucleotide polymorphisms (SNPs), and the analysis and study of SNPs are of great significance in the field of biogenetics. The haplotype, which consists of the sequence of SNP loci, carries more genetic information than a single SNP. Haplotype assembly plays a significant role in understanding gene function, diagnosing complex diseases, and pinpointing species genes. We propose a novel method, DeepHapNet, for haplotype assembly through the clustering of reads and learning correlations between read pairs. We employ a sequence model called Retentive Network (RetNet), which utilizes a multiscale retention mechanism to extract read features and learn the global relationships among them. Based on the feature representation of reads learned from the RetNet model, the clustering process of reads is implemented using the SpectralNet model, and, finally, haplotypes are constructed based on the read clusters. Experiments with simulated and real datasets show that the method performs well in the haplotype assembly problem of diploid and polyploid based on either long or short reads. The code implementation of DeepHapNet and the processing scripts for experimental data are publicly available at https:\/\/github.com\/wjj6666\/DeepHapNet.<\/jats:p>","DOI":"10.1093\/bib\/bbae656","type":"journal-article","created":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T06:08:56Z","timestamp":1734502136000},"source":"Crossref","is-referenced-by-count":1,"title":["DeepHapNet: a haplotype assembly method based on RetNet and deep spectral clustering"],"prefix":"10.1093","volume":"26","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6841-9893","authenticated-orcid":false,"given":"Junwei","family":"Luo","sequence":"first","affiliation":[{"name":"School of Software, Henan Polytechnic University , Century Road 2001, Jiaozuo 454003,","place":["China"]}]},{"given":"Jiaojiao","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Software, Henan Polytechnic University , Century Road 2001, Jiaozuo 454003,","place":["China"]}]},{"given":"Jingjing","family":"Wei","sequence":"additional","affiliation":[{"name":"College of Chemical and Environmental Engineering, Anyang Institute of Technology , West Section of Huanghe Avenue, Anyang 455000,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6246-7242","authenticated-orcid":false,"given":"Chaokun","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering, Henan University , North Section of Jinming Avenue, Kaifeng 475001,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5066-5372","authenticated-orcid":false,"given":"Huimin","family":"Luo","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering, Henan University , North Section of Jinming Avenue, Kaifeng 475001,","place":["China"]}]}],"member":"286","published-online":{"date-parts":[[2024,12,17]]},"reference":[{"key":"2024121806084756600_ref1","doi-asserted-by":"publisher","first-page":"489","DOI":"10.1126\/science.1059431","article-title":"Haplotype variation and linkage disequilibrium in 313 human genes","volume":"293","author":"Stephens","year":"2001","journal-title":"Science"},{"key":"2024121806084756600_ref2","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1038\/s41588-022-01015-0","article-title":"Chromosome-scale and haplotype-resolved genome assembly of a tetraploid potato cultivar","volume":"54","author":"Sun","year":"2022","journal-title":"Nat Genet"},{"key":"2024121806084756600_ref3","doi-asserted-by":"publisher","first-page":"1168547","DOI":"10.3389\/fpls.2023.1168547","article-title":"Genomic prediction with haplotype blocks in wheat","volume":"14","author":"Difabachew","year":"2023","journal-title":"Front Plant Sci"},{"key":"2024121806084756600_ref4","doi-asserted-by":"publisher","first-page":"uhad002","DOI":"10.1093\/hr\/uhad002","article-title":"High-quality haplotype-resolved genome assembly of cultivated octoploid strawberry","volume":"10","author":"Mao","year":"2023","journal-title":"Hortic Res"},{"key":"2024121806084756600_ref5","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1038\/s41588-021-00971-3","article-title":"Two divergent haplotypes from a highly heterozygous lychee genome suggest independent domestication events for early and late-maturing cultivars","volume":"54","author":"Hu","year":"2022","journal-title":"Nat Genet"},{"key":"2024121806084756600_ref6","doi-asserted-by":"publisher","first-page":"638","DOI":"10.1016\/j.jmoldx.2024.04.002","article-title":"Targeted linked-read sequencing for direct haplotype phasing of parental GJB2\/SLC26A4 alleles: a universal and dependable noninvasive prenatal diagnosis method applied to autosomal recessive nonsyndromic hearing loss in at-risk families","volume":"26","author":"Gao","year":"2024","journal-title":"J Mol Diagn"},{"key":"2024121806084756600_ref7","doi-asserted-by":"publisher","first-page":"137ra76","DOI":"10.1126\/scitranslmed.3004323","article-title":"Noninvasive whole-genome sequencing of a human fetus","volume":"4","author":"Kitzman","year":"2012","journal-title":"Sci Transl Med"},{"key":"2024121806084756600_ref8","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbaa280","article-title":"Evaluation of consensus strategies for haplotype phasing","volume":"22","author":"Al Bkhetan","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024121806084756600_ref9","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1038\/nrg2626","article-title":"Sequencing technologies - the next generation","volume":"11","author":"Metzker","year":"2010","journal-title":"Nat Rev Genet"},{"key":"2024121806084756600_ref10","doi-asserted-by":"publisher","first-page":"12065","DOI":"10.1038\/ncomms12065","article-title":"Long-read sequencing and de novo assembly of a Chinese genome","volume":"7","author":"Shi","year":"2016","journal-title":"Nat Commun"},{"key":"2024121806084756600_ref11","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1038\/nature20098","article-title":"De novo assembly and phasing of a Korean human genome","volume":"538","author":"Seo","year":"2016","journal-title":"Nature"},{"key":"2024121806084756600_ref12","doi-asserted-by":"publisher","first-page":"1155","DOI":"10.1038\/s41587-019-0217-9","article-title":"Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome","volume":"37","author":"Wenger","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2024121806084756600_ref13","doi-asserted-by":"publisher","first-page":"S4","DOI":"10.1186\/1471-2164-13-S2-S4","article-title":"Haplotype and minimum-chimerism consensus determination using short sequence data","volume":"13","author":"O'Neil","year":"2012","journal-title":"BMC Genomics"},{"key":"2024121806084756600_ref14","first-page":"182","volume-title":"Proceedings of the 9th Annual European Symposium on Algorithms","author":"Lancia","year":"2001"},{"key":"2024121806084756600_ref15","doi-asserted-by":"publisher","first-page":"2041","DOI":"10.1093\/nar\/gkr1042","article-title":"Fosmid-based whole genome haplotyping of a HapMap trio child: evaluation of single individual Haplotyping techniques","volume":"40","author":"Duitama","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2024121806084756600_ref16","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1093\/bib\/3.1.23","article-title":"Algorithmic strategies for the SNP haplotype assembly problem","volume":"3","author":"Lippert","year":"2002","journal-title":"Brief Bioinform"},{"key":"2024121806084756600_ref17","doi-asserted-by":"publisher","first-page":"i153","DOI":"10.1093\/bioinformatics\/btn298","article-title":"HapCUT: an efficient and accurate algorithm for the haplotype assembly problem","volume":"24","author":"Bansal","year":"2008","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref18","doi-asserted-by":"publisher","first-page":"801","DOI":"10.1101\/gr.213462.116","article-title":"HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies","volume":"27","author":"Edge","year":"2017","journal-title":"Genome Res"},{"key":"2024121806084756600_ref19","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1089\/cmb.2014.0157","article-title":"WhatsHap: weighted haplotype assembly for future-generation sequencing reads","volume":"22","author":"Patterson","year":"2015","journal-title":"J Comput Biol"},{"key":"2024121806084756600_ref20","doi-asserted-by":"publisher","first-page":"e1003502","DOI":"10.1371\/journal.pcbi.1003502","article-title":"HapTree: a novel Bayesian framework for single individual polyplotyping using NGS data","volume":"10","author":"Berger","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2024121806084756600_ref21","doi-asserted-by":"publisher","first-page":"e1007843","DOI":"10.1371\/journal.pcbi.1007843","article-title":"Ranbow: a fast and accurate method for polyploid haplotype reconstruction","volume":"16","author":"Moeinzadeh","year":"2020","journal-title":"PLoS Comput Biol"},{"key":"2024121806084756600_ref22","doi-asserted-by":"publisher","first-page":"3735","DOI":"10.1093\/bioinformatics\/btw537","article-title":"H-PoP and H-PoPG: heuristic partitioning algorithms for single individual haplotyping of polyploids","volume":"32","author":"Xie","year":"2016","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref23","doi-asserted-by":"publisher","first-page":"719","DOI":"10.1609\/aaai.v34i01.5414","article-title":"A graph auto-encoder for haplotype assembly and viral Quasispecies reconstruction","volume":"34","author":"Ke","year":"2020","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2024121806084756600_ref24","volume-title":"Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS)","author":"Ke","year":"2020"},{"key":"2024121806084756600_ref25","doi-asserted-by":"publisher","DOI":"10.1093\/bioadv\/vbad169","article-title":"XHap: haplotype assembly using long-distance read correlations learned by transformers","volume":"3","author":"Consul","year":"2023","journal-title":"Bioinform Adv"},{"key":"2024121806084756600_ref26","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv Neural Inf Proces Syst"},{"key":"2024121806084756600_ref27","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1186\/s13059-021-02512-x","article-title":"Phasebook: haplotype-aware de novo assembly of diploid genomes from long reads","volume":"22","author":"Luo","year":"2021","journal-title":"Genome Biol"},{"key":"2024121806084756600_ref28","doi-asserted-by":"publisher","first-page":"2385","DOI":"10.1093\/bioinformatics\/btz942","article-title":"A haplotype-aware de novo assembly of related individuals using pedigree sequence graph","volume":"36","author":"Garg","year":"2020","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref29","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1038\/s41477-019-0487-8","article-title":"Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on hi-C data","volume":"5","author":"Zhang","year":"2019","journal-title":"Nature Plants"},{"key":"2024121806084756600_ref30","doi-asserted-by":"publisher","first-page":"1050","DOI":"10.1038\/nmeth.4035","article-title":"Phased diploid genome assembly with single-molecule real-time sequencing","volume":"13","author":"Chin","year":"2016","journal-title":"Nat Methods"},{"key":"2024121806084756600_ref31","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1038\/s41587-020-0711-0","article-title":"Chromosome-scale, haplotype-resolved assembly of human genomes","volume":"39","author":"Garg","year":"2021","journal-title":"Nat Biotechnol"},{"key":"2024121806084756600_ref32","doi-asserted-by":"publisher","first-page":"170","DOI":"10.1038\/s41592-020-01056-5","article-title":"Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm","volume":"18","author":"Cheng","year":"2021","journal-title":"Nat Methods"},{"key":"2024121806084756600_ref33","doi-asserted-by":"publisher","first-page":"1291","DOI":"10.1101\/gr.263566.120","article-title":"HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads","volume":"30","author":"Nurk","year":"2020","journal-title":"Genome Res"},{"key":"2024121806084756600_ref34","doi-asserted-by":"publisher","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res"},{"key":"2024121806084756600_ref35","first-page":"abs\/2307.08621","article-title":"Retentive network: a successor to transformer for large language models","author":"Sun","year":"2023","journal-title":"CoRR"},{"key":"2024121806084756600_ref36","doi-asserted-by":"publisher","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with burrows-wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref37","first-page":"1026","volume-title":"Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV)","author":"He","year":"2015"},{"key":"2024121806084756600_ref38","first-page":"539","article-title":"Learning a similarity metric discriminatively, with application to face verification","volume-title":"Computer Vision and Pattern Recognition, 2005 CVPR 2005 IEEE Computer Society Conference","author":"Chopra","year":"2005"},{"key":"2024121806084756600_ref39","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1109\/TIT.1982.1056489","article-title":"Least squares quantization in PCM","volume":"28","author":"Lloyd","year":"1982","journal-title":"IEEE Trans Inf Theory"},{"key":"2024121806084756600_ref40","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1126\/science.220.4598.671","article-title":"Optimization by simulated annealing","volume":"220","author":"Kirkpatrick","year":"1983","journal-title":"Science"},{"key":"2024121806084756600_ref41","first-page":"1735","volume-title":"Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference","author":"Hadsell","year":"2006"},{"key":"2024121806084756600_ref42","volume-title":"Adam: A Method for Stochastic Optimization [M]","author":"Kingma","year":"2014"},{"key":"2024121806084756600_ref43","doi-asserted-by":"publisher","first-page":"1624","DOI":"10.1101\/gr.2204604","article-title":"Haplotype and missing data inference In nuclear families","volume":"14","author":"Lin","year":"2004","journal-title":"Genome Res"},{"key":"2024121806084756600_ref44","doi-asserted-by":"publisher","first-page":"593","DOI":"10.1093\/bioinformatics\/btr708","article-title":"ART: a next-generation sequencing read simulator","volume":"28","author":"Huang","year":"2012","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref45","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1093\/bib\/bbw126","article-title":"Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study","volume":"19","author":"Motazedi","year":"2018","journal-title":"Brief Bioinform"},{"key":"2024121806084756600_ref46","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0062355","article-title":"A next-generation sequencing method for genotyping-by-sequencing of highly heterozygous autotetraploid potato","volume":"8","author":"Uitdewilligen","year":"2013","journal-title":"PLoS One"},{"key":"2024121806084756600_ref47","doi-asserted-by":"publisher","first-page":"3864","DOI":"10.1093\/bioinformatics\/bty442","article-title":"TriPoly: haplotype estimation for polyploids using sequencing data of related individuals","volume":"34","author":"Motazedi","year":"2018","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref48","doi-asserted-by":"publisher","first-page":"589","DOI":"10.1093\/bioinformatics\/btaa835","article-title":"PBSIM2: a simulator for long-read sequencers with a novel generative model of quality scores","volume":"37","author":"Ono","year":"2021","journal-title":"Bioinformatics"},{"key":"2024121806084756600_ref49","doi-asserted-by":"publisher","first-page":"100128","DOI":"10.1016\/j.xgen.2022.100128","article-title":"Benchmarking challenging small variants with linked and long reads","volume":"2","author":"Wagner","year":"2022","journal-title":"Cell Genom"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbae656\/61218290\/bbae656.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbae656\/61218290\/bbae656.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T06:09:08Z","timestamp":1734502148000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae656\/7926917"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,22]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,11,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae656","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,1]]},"published":{"date-parts":[[2024,11,22]]},"article-number":"bbae656"}}