{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T08:32:09Z","timestamp":1771662729168,"version":"3.50.1"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,12,1]],"date-time":"2021-12-01T00:00:00Z","timestamp":1638316800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,12,4]],"date-time":"2021-12-04T00:00:00Z","timestamp":1638576000000},"content-version":"vor","delay-in-days":3,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>With the increase in the size of genomic datasets describing variability in populations, extracting relevant information becomes increasingly useful as well as complex. Recently, computational methodologies such as Supervised Machine Learning and specifically Convolutional Neural Networks have been proposed to make inferences on demographic and adaptive processes using genomic data. Even though it was already shown to be powerful and efficient in different fields of investigation, Supervised Machine Learning has still to be explored as to unfold its enormous potential in evolutionary genomics.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>The paper proposes a method based on Supervised Machine Learning for classifying genomic data, represented as windows of genomic sequences from a sample of individuals belonging to the same population. A Convolutional Neural Network is used to test whether a genomic window shows the signature of natural selection. Training performed on simulated data show that the proposed model can accurately predict neutral and selection processes on portions of genomes taken from real populations with almost 90% accuracy.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s13040-021-00280-9","type":"journal-article","created":{"date-parts":[[2021,12,4]],"date-time":"2021-12-04T09:02:48Z","timestamp":1638608568000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Identification of natural selection in genomic data with deep convolutional neural network"],"prefix":"10.1186","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3352-5260","authenticated-orcid":false,"given":"Arnaud","family":"Nguembang Fadja","sequence":"first","affiliation":[]},{"given":"Fabrizio","family":"Riguzzi","sequence":"additional","affiliation":[]},{"given":"Giorgio","family":"Bertorelle","sequence":"additional","affiliation":[]},{"given":"Emiliano","family":"Trucchi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,12,4]]},"reference":[{"issue":"3","key":"280_CR1","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1534\/genetics.119.302581","volume":"213","author":"V Buffalo","year":"2019","unstructured":"Buffalo V, Coop G. The linked selection signature of rapid adaptation in temporal genomic data. Genetics. 2019; 213(3):1007\u201345.","journal-title":"Genetics"},{"issue":"3","key":"280_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41559-017-0077","volume":"1","author":"M L\u00e4ssig","year":"2017","unstructured":"L\u00e4ssig M, Mustonen V, Walczak AM. Predicting evolution. Nat Ecol Evol. 2017; 1(3):1\u20139.","journal-title":"Nat Ecol Evol"},{"key":"280_CR3","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1146\/annurev.genet.39.073003.112420","volume":"39","author":"R Nielsen","year":"2005","unstructured":"Nielsen R. Molecular signatures of natural selection. Annu Rev Genet. 2005; 39:197\u2013218.","journal-title":"Annu Rev Genet"},{"issue":"12","key":"280_CR4","doi-asserted-by":"publisher","first-page":"883","DOI":"10.1016\/j.cub.2008.04.074","volume":"18","author":"R Blekhman","year":"2008","unstructured":"Blekhman R, Man O, Herrmann L, Boyko AR, Indap A, Kosiol C, Bustamante CD, Teshima KM, Przeworski M. Natural selection on genes that underlie human disease susceptibility. Curr Biol. 2008; 18(12):883\u20139.","journal-title":"Curr Biol"},{"issue":"2","key":"280_CR5","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1038\/s41477-021-00848-7","volume":"7","author":"E Trucchi","year":"2021","unstructured":"Trucchi E, Benazzo A, Lari M, Iob A, Vai S, Nanni L, Bellucci E, Bitocchi E, Raffini F, Xu C, et al.Ancient genomes reveal early andean farmers selected common beans while preserving diversity. Nat Plants. 2021; 7(2):123\u20138.","journal-title":"Nat Plants"},{"issue":"6","key":"280_CR6","doi-asserted-by":"publisher","first-page":"1997","DOI":"10.1093\/bib\/bby064","volume":"20","author":"C Horscroft","year":"2019","unstructured":"Horscroft C, Ennis S, Pengelly RJ, Sluckin TJ, Collins A. Sequencing era methods for identifying signatures of selection in the genome. Brief Bioinforma. 2019; 20(6):1997\u20132008.","journal-title":"Brief Bioinforma"},{"issue":"1","key":"280_CR7","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1186\/s12915-017-0434-y","volume":"15","author":"TR Booker","year":"2017","unstructured":"Booker TR, Jackson BC, Keightley PD. Detecting positive selection in the genome. BMC Biol. 2017; 15(1):98.","journal-title":"BMC Biol"},{"issue":"7","key":"280_CR8","doi-asserted-by":"publisher","first-page":"1002195","DOI":"10.1371\/journal.pbio.1002195","volume":"13","author":"ZD Stephens","year":"2015","unstructured":"Stephens ZD, Lee SY, Faghri F, Campbell RH, Zhai C, Efron MJ, Iyer R, Schatz MC, Sinha S, Robinson GE. Big data: astronomical or genomical?PLoS Biol. 2015; 13(7):1002195.","journal-title":"PLoS Biol"},{"key":"280_CR9","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1016\/j.csbj.2020.06.017","volume":"18","author":"L Koumakis","year":"2020","unstructured":"Koumakis L. Deep learning models in genomics; are we there yet?Comput Struct Biotechnol J. 2020; 18:1466\u201373.","journal-title":"Comput Struct Biotechnol J"},{"issue":"7","key":"280_CR10","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1038\/s41576-019-0122-6","volume":"20","author":"G Eraslan","year":"2019","unstructured":"Eraslan G, Avsec \u017e, Gagneur J, Theis FJ. Deep learning: new computational modelling techniques for genomics. Nat Rev Genet. 2019; 20(7):389\u2013403.","journal-title":"Nat Rev Genet"},{"issue":"1","key":"280_CR11","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1038\/s41588-018-0295-5","volume":"51","author":"J Zou","year":"2019","unstructured":"Zou J, Huss M, Abid A, Mohammadi P, Torkamani A, Telenti A. A primer on deep learning in genomics. Nat Genet. 2019; 51(1):12\u20138.","journal-title":"Nat Genet"},{"issue":"9","key":"280_CR12","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1186\/s12859-019-2927-x","volume":"20","author":"L Torada","year":"2019","unstructured":"Torada L, Lorenzon L, Beddis A, Isildak U, Pattini L, Mathieson S, Fumagalli M. Imagene: a convolutional neural network to quantify natural selection from genomic data. BMC Bioinformatics. 2019; 20(9):337.","journal-title":"BMC Bioinformatics"},{"issue":"1994","key":"280_CR13","first-page":"1","volume":"13","author":"D Michie","year":"1994","unstructured":"Michie D, Spiegelhalter DJ, Taylor C, et al.Machine learning. Neural Stat Classif. 1994; 13(1994):1\u2013298.","journal-title":"Neural Stat Classif"},{"issue":"7553","key":"280_CR14","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015; 521(7553):436\u201344.","journal-title":"Nature"},{"key":"280_CR15","volume-title":"Machine Learning: a Probabilistic Perspective","author":"KP Murphy","year":"2012","unstructured":"Murphy KP. Machine Learning: a Probabilistic Perspective. Cambridge: The MIT Press; 2012."},{"key":"280_CR16","volume-title":"2017 International Conference on Engineering and Technology (ICET)","author":"S Albawi","year":"2017","unstructured":"Albawi S, Mohammed TA, Al-Zawi S. Understanding of a convolutional neural network. In: 2017 International Conference on Engineering and Technology (ICET). Antalya: IEEE: 2017. p. 1\u20136."},{"issue":"1","key":"280_CR17","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1134\/S1054661816010065","volume":"26","author":"P Druzhkov","year":"2016","unstructured":"Druzhkov P, Kustikova V. A survey of deep learning methods and software tools for image classification and object detection. Patt Recogn Image Anal. 2016; 26(1):9\u201315.","journal-title":"Patt Recogn Image Anal"},{"key":"280_CR18","volume-title":"RiCeRcA@ AI* IA","author":"AN Fadja","year":"2018","unstructured":"Fadja AN, Lamma E, Riguzzi F, et al.Vision inspection with neural networks. In: RiCeRcA@ AI* IA. Aachen: CEUR Workshop Proceedings: 2018."},{"key":"280_CR19","doi-asserted-by":"crossref","unstructured":"Bottou L. Stochastic gradient descent tricks. In: Neural Networks: Tricks of the Trade. Springer: 2012. p. 421\u201336.","DOI":"10.1007\/978-3-642-35289-8_25"},{"issue":"3","key":"280_CR20","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1109\/72.286925","volume":"5","author":"V Phansalkar","year":"1994","unstructured":"Phansalkar V, Sastry P. Analysis of the back-propagation algorithm with momentum. IEEE Trans Neural Netw. 1994; 5(3):505\u20136.","journal-title":"IEEE Trans Neural Netw"},{"key":"280_CR21","volume-title":"2017 IEEE 56th Annual Conference on Decision and Control (CDC)","author":"S Khirirat","year":"2017","unstructured":"Khirirat S, Feyzmahdavian HR, Johansson M. Mini-batch gradient descent: Faster convergence under data sparsity. In: 2017 IEEE 56th Annual Conference on Decision and Control (CDC). Melbourne: IEEE: 2017. p. 2880\u20137."},{"issue":"1","key":"280_CR22","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014; 15(1):1929\u201358.","journal-title":"J Mach Learn Res"},{"key":"280_CR23","volume-title":"Deep Learning, vol. 1","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A, Bengio Y. Deep Learning, vol. 1. Cambridge: MIT press; 2016."},{"issue":"2","key":"280_CR24","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1093\/bioinformatics\/18.2.337","volume":"18","author":"RR Hudson","year":"2002","unstructured":"Hudson RR. Generating samples under a wright\u2013fisher neutral model of genetic variation. Bioinformatics. 2002; 18(2):337\u20138.","journal-title":"Bioinformatics"},{"issue":"16","key":"280_CR25","doi-asserted-by":"publisher","first-page":"2064","DOI":"10.1093\/bioinformatics\/btq322","volume":"26","author":"G Ewing","year":"2010","unstructured":"Ewing G, Hermisson J. MSMS: a coalescent simulation program including recombination, demographic structure and selection at a single locus. Bioinformatics. 2010; 26(16):2064\u20135. https:\/\/doi.org\/10.1093\/bioinformatics\/btq322.","journal-title":"Bioinformatics"},{"issue":"12","key":"280_CR26","doi-asserted-by":"publisher","first-page":"3511","DOI":"10.1093\/gbe\/evv228","volume":"7","author":"DR Schrider","year":"2015","unstructured":"Schrider DR, Kern AD. Inferring selective constraint from population genomic data suggests recent regulatory turnover in the human brain. Genome Biol Evol. 2015; 7(12):3511\u201328.","journal-title":"Genome Biol Evol"},{"issue":"7","key":"280_CR27","doi-asserted-by":"publisher","first-page":"1673","DOI":"10.1093\/molbev\/msq053","volume":"27","author":"AD Kern","year":"2010","unstructured":"Kern AD, Haussler D. A population genetic hidden markov model for detecting genomic regions under selection. Mol Biol Evol. 2010; 27(7):1673\u201385.","journal-title":"Mol Biol Evol"},{"issue":"4","key":"280_CR28","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1016\/j.tig.2017.12.005","volume":"34","author":"DR Schrider","year":"2018","unstructured":"Schrider DR, Kern AD. Supervised machine learning for population genetics: a new paradigm. Trends Genet. 2018; 34(4):301\u2013312.","journal-title":"Trends Genet"},{"issue":"3","key":"280_CR29","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1016\/0304-4149(82)90011-4","volume":"13","author":"JFC Kingman","year":"1982","unstructured":"Kingman JFC. The coalescent. Stoch Process Appl. 1982; 13(3):235\u201348.","journal-title":"Stoch Process Appl"},{"key":"280_CR30","unstructured":"Sutskever I, Martens J, Dahl G, Hinton G. On the importance of initialization and momentum in deep learning. In: Int Conf Mach Learn: 2013. p. 1139\u201347."},{"key":"280_CR31","volume-title":"Icml","author":"AM Saxe","year":"2011","unstructured":"Saxe AM, Koh PW, Chen Z, Bhand M, Suresh B, Ng AY. On random weights and unsupervised feature learning. In: Icml. New York: ACM: 2011."},{"issue":"7063","key":"280_CR32","first-page":"04226","volume":"437","author":"D Altshuler","year":"2005","unstructured":"Altshuler D, Donnelly P, Consortium IH, et al.A haplotype map of the human genome. Nature. 2005; 437(7063):04226.","journal-title":"Nature"},{"issue":"13","key":"280_CR33","doi-asserted-by":"publisher","first-page":"1431","DOI":"10.1002\/1873-3468.13447","volume":"593","author":"M Szpak","year":"2019","unstructured":"Szpak M, Xue Y, Ayub Q, Tyler-Smith C. How well do we understand the basis of classic selective sweeps in humans?. FEBS Lett. 2019; 593(13):1431\u201348.","journal-title":"FEBS Lett"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00280-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13040-021-00280-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00280-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,4]],"date-time":"2021-12-04T21:35:59Z","timestamp":1638653759000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/s13040-021-00280-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["280"],"URL":"https:\/\/doi.org\/10.1186\/s13040-021-00280-9","relation":{},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12]]},"assertion":[{"value":"15 June 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 October 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 December 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"51"}}