{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:25:33Z","timestamp":1772166333601,"version":"3.50.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,2,15]],"date-time":"2021-02-15T00:00:00Z","timestamp":1613347200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2021,2,15]],"date-time":"2021-02-15T00:00:00Z","timestamp":1613347200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Open access funding provided by the Qatar National Library"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>The data explosion caused by unprecedented advancements in the field of genomics is constantly challenging the conventional methods used in the interpretation of the human genome. The demand for robust algorithms over the recent years has brought huge success in the field of Deep Learning (DL) in solving many difficult tasks in image, speech and natural language processing by automating the manual process of architecture design. This has been fueled through the development of new DL architectures. Yet genomics possesses unique challenges that requires customization and development of new DL models.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>We proposed a new model, DASSI, by adapting a differential architecture search method and applying it to the Splice Site (SS) recognition task on DNA sequences to discover new high-performance convolutional architectures in an automated manner. We evaluated the discovered model against state-of-the-art tools to classify true and false SS in Homo sapiens (Human), Arabidopsis thaliana (Plant), Caenorhabditis elegans (Worm) and Drosophila melanogaster (Fly).<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Our experimental evaluation demonstrated that the discovered architecture outperformed baseline models and fixed architectures and showed competitive results against state-of-the-art models used in classification of splice sites. The proposed model - DASSI has a compact architecture and showed very good results on a transfer learning task. The benchmarking experiments of execution time and precision on architecture search and evaluation process showed better performance on recently available GPUs making it feasible to adopt architecture search based methods on large datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>We proposed the use of differential architecture search method (DASSI) to perform SS classification on raw DNA sequences, and discovered new neural network models with low number of tunable parameters and competitive performance compared with manually engineered architectures. We have extensively benchmarked DASSI model with other state-of-the-art models and assessed its computational efficiency. The results have shown a high potential of using automated architecture search mechanism for solving various problems in the field of genomics.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s13040-021-00237-y","type":"journal-article","created":{"date-parts":[[2021,2,17]],"date-time":"2021-02-17T09:16:29Z","timestamp":1613553389000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["DASSI: differential architecture search for splice identification from DNA sequences"],"prefix":"10.1186","volume":"14","author":[{"given":"Shabir","family":"Moosa","sequence":"first","affiliation":[]},{"given":"Prof. Abbes","family":"Amira","sequence":"additional","affiliation":[]},{"given":"Dr. Sabri","family":"Boughorbel","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,15]]},"reference":[{"key":"237_CR1","doi-asserted-by":"publisher","first-page":"4308","DOI":"10.1038\/ncomms5308","volume":"5","author":"P Baldi","year":"2014","unstructured":"Baldi P, Sadowski P, Whiteson D. Searching for exotic particles in high-energy physics with deep learning. Nat Commun. 2014; 5:4308.","journal-title":"Nat Commun"},{"issue":"16","key":"237_CR2","doi-asserted-by":"publisher","first-page":"1291","DOI":"10.1002\/jcc.24764","volume":"38","author":"G Goh","year":"2017","unstructured":"Goh G, Hodas N, Vishnu A. Deep learning for computational chemistry. J Comput Chem. 2017; 38(16):1291\u2013307.","journal-title":"J Comput Chem"},{"issue":"7639","key":"237_CR3","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1038\/nature21056","volume":"542","author":"A Esteva","year":"2017","unstructured":"Esteva A, Kuprel B, Novoa R, Ko J, Swetter S, Blau H, Thrun S. Dermatologist-level classification of skin cancer with deep neural networks. Nat. 2017; 542(7639):115.","journal-title":"Nat"},{"key":"237_CR4","unstructured":"Liu H, Simonyan K, Vinyals O, Fernando C, Kavukcuoglu K. Hierarchical representations for efficient architecture search. 2017. Preprint at https:\/\/arxiv.org\/abs\/1711.00436."},{"issue":"1","key":"237_CR5","doi-asserted-by":"publisher","first-page":"4780","DOI":"10.1609\/aaai.v33i01.33014780","volume":"33","author":"E Real","year":"2019","unstructured":"Real E, Aggarwal A, Huang Y, Le Q. Regularized evolution for image classifier architecture search. Proceedings of the Thirty-Third AAAI conference on artificial intelligence. 2019; 33(1):4780\u20134789.","journal-title":"Proceedings of the Thirty-Third AAAI conference on artificial intelligence"},{"key":"237_CR6","unstructured":"Zoph B, Le QV. Neural architecture search with reinforcement learning. 2016. Preprint at https:\/\/arxiv.org\/abs\/1611.01578."},{"key":"237_CR7","doi-asserted-by":"crossref","unstructured":"Zoph B, Vasudevan V, Shlens J, Le Q. Learning transferable architectures for scalable image recognition. In: Proceedings of the Thirty-First IEEE conference on computer vision and pattern recognition: 18-22 June 2018. Utah: 2017. p. 8697\u20138710.","DOI":"10.1109\/CVPR.2018.00907"},{"issue":"1","key":"237_CR8","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1145\/321992.322002","volume":"24","author":"P Summers","year":"1977","unstructured":"Summers P. A methodology for lisp program construction from examples. J ACM (JACM). 1977; 24(1):161\u201375.","journal-title":"J ACM (JACM)"},{"key":"237_CR9","unstructured":"Baker B, Gupta O, Raskar R, Naik N. Accelerating neural architecture search using performance prediction. 2017. Preprint at https:\/\/arxiv.org\/abs\/1705.10823."},{"key":"237_CR10","unstructured":"Brock A, Lim T, Ritchie JM, Weston N. Smash: one-shot model architecture search through hypernetworks. 2017. Preprint at https:\/\/arxiv.org\/abs\/1708.05344."},{"key":"237_CR11","unstructured":"Pham H, Guan M, Zoph B, Le Q, Dean J. Efficient neural architecture search via parameters sharing. In: Proceedings of the Thirty-Fifth International Conference on Machine Learning: 10-15 July. Stockholm: 2018. p. 4095\u20134104."},{"key":"237_CR12","unstructured":"Liu H, Simonyan K, Yang Y. Darts: Differentiable architecture search. 2018. Preprint at https:\/\/arxiv.org\/abs\/1806.09055."},{"key":"237_CR13","unstructured":"Lee B, Lee T, Na B, Yoon S. DNA-level splice junction prediction using deep recurrent neural networks. 2015. Preprint at https:\/\/arxiv.org\/abs\/1512.05135."},{"issue":"14","key":"237_CR14","doi-asserted-by":"publisher","first-page":"4570","DOI":"10.1093\/nar\/gkq211","volume":"38","author":"K Au","year":"2010","unstructured":"Au K, Jiang H, Lin L, Xing Y, Wong W. Detection of splice junctions from paired-end rna-seq data by splicemap. Nucleic Acids Res. 2010; 38(14):4570\u20138.","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"237_CR15","doi-asserted-by":"publisher","first-page":"1105","DOI":"10.1093\/bioinformatics\/btp120","volume":"25","author":"C Trapnell","year":"2009","unstructured":"Trapnell C, Pachter L, Salzberg S. Tophat: discovering splice junctions with rna-seq. Bioinforma. 2009; 25(9):1105\u201311.","journal-title":"Bioinforma"},{"issue":"5","key":"237_CR16","first-page":"1","volume":"7","author":"AK Baten","year":"2006","unstructured":"Baten AK, Chang BC, Halgamuge SK, Li J. Splice site identification using probabilistic parameters and svm classification. BMC Bioinformatics BioMed Central. 2006; 7(5):1\u201315.","journal-title":"BMC Bioinformatics BioMed Central"},{"issue":"1","key":"237_CR17","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1186\/s13015-016-0078-4","volume":"11","author":"P Meher","year":"2016","unstructured":"Meher P, Sahu T, Rao A, Wahi S. Identification of donor splice sites using support vector machine: a computational approach based on positional, compositional and dependency features. Algorithm Mol Biol. 2016; 11(1):16.","journal-title":"Algorithm Mol Biol"},{"issue":"1","key":"237_CR18","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1016\/j.eswa.2005.09.052","volume":"30","author":"Y Zhang","year":"2006","unstructured":"Zhang Y, Chu C-H, Chen Y, Zha H, Ji X. Splice site prediction using support vector machines with a bayes kernel. Expert Syst Appl. 2006; 30(1):73\u201381.","journal-title":"Expert Syst Appl"},{"key":"237_CR19","doi-asserted-by":"crossref","unstructured":"Wei D, Zhuang W, Jiang Q, Wei Y. A new classification method for human gene splice site prediction In: He J, Liu X, Krupinski EA, Xu G, editors. Health Information Science. Springer: 2012. p. 121\u201330.","DOI":"10.1007\/978-3-642-29361-0_16"},{"key":"237_CR20","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/j.compbiolchem.2018.02.005","volume":"73","author":"E Pashaei","year":"2018","unstructured":"Pashaei E, Aydin N. Markovian encoding models in human splice site recognition using svm. Comput Biol Chem. 2018; 73:159\u201370.","journal-title":"Comput Biol Chem"},{"key":"237_CR21","volume-title":"Proceedings of the Sixth International Conference on Computer and Knowledge Engineering (ICCKE): 20-21 October 2016","author":"E Pashaei","year":"2016","unstructured":"Pashaei E, Yilmaz A, Aydin N. A combined SVM and Markov model approach for splice site identification. In: Proceedings of the Sixth International Conference on Computer and Knowledge Engineering (ICCKE): 20-21 October 2016. Mashhad: IEEE: 2016. p. 200\u20134."},{"issue":"1","key":"237_CR22","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1186\/s13040-016-0086-4","volume":"9","author":"P Meher","year":"2016","unstructured":"Meher P, Sahu T, Rao A. Prediction of donor splice sites using random forest with a new sequence encoding approach. BioData Min. 2016; 9(1):4.","journal-title":"BioData Min"},{"issue":"1","key":"237_CR23","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1007\/s12553-016-0157-z","volume":"7","author":"E Pashaei","year":"2017","unstructured":"Pashaei E, Ozen M, Aydin N. Splice site identification in human genome using random forest. Health Technol. 2017; 7(1):141\u201352.","journal-title":"Health Technol"},{"key":"237_CR24","volume-title":"Proceedings of the Fourteenth Mediterranean Conference on Medical and Biological Engineering and Computing: 31 March-2 April 2016","author":"E Pashaei","year":"2016","unstructured":"Pashaei E, Ozen M, Aydin N. Random forest in splice site prediction of human genome. In: Proceedings of the Fourteenth Mediterranean Conference on Medical and Biological Engineering and Computing: 31 March-2 April 2016. Paphos: Springer: 2016. p. 518\u201323."},{"issue":"04","key":"237_CR25","doi-asserted-by":"publisher","first-page":"527","DOI":"10.1142\/S0218126607003885","volume":"16","author":"H Lopes","year":"2007","unstructured":"Lopes H, Erig Lima C, Murata N. A configware approach for high-speed parallel analysis of genomic data. J Circ Syst Comput. 2007; 16(04):527\u201340.","journal-title":"J Circ Syst Comput"},{"issue":"7","key":"237_CR26","doi-asserted-by":"publisher","first-page":"99982","DOI":"10.1371\/journal.pone.0099982","volume":"9","author":"U Kamath","year":"2014","unstructured":"Kamath U, De Jong K, Shehu A. Effective automated feature construction and selection for classification of biological sequences. PloS one. 2014; 9(7):99982.","journal-title":"PloS one"},{"issue":"4","key":"237_CR27","doi-asserted-by":"publisher","first-page":"2771","DOI":"10.1016\/j.eswa.2009.09.014","volume":"37","author":"Q Zhang","year":"2010","unstructured":"Zhang Q, Peng Q, Zhang Q, Yan Y, Li K, Li J. Splice sites prediction of human genome using length-variable markov model and feature selection. Expert Syst Appl. 2010; 37(4):2771\u201382.","journal-title":"Expert Syst Appl"},{"key":"237_CR28","volume-title":"Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC): 9-12 October 2016","author":"E Pashaei","year":"2016","unstructured":"Pashaei E, Yilmaz A, Ozen M, Aydin N. Prediction of splice site using AdaBoost with a new sequence encoding approach. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC): 9-12 October 2016. Budapest: IEEE: 2016. p. 3853\u20133858."},{"key":"237_CR29","volume-title":"Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC):16-20 August 2016","author":"E Pashaei","year":"2016","unstructured":"Pashaei E, Yilmaz A, Ozen M, Aydin N. A novel method for splice sites prediction using sequence component and hidden markov model. In: Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC):16-20 August 2016. Florida: IEEE: 2016. p. 3076\u20139."},{"key":"237_CR30","volume-title":"Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI):24-27 February 2016","author":"E Pashaei","year":"2016","unstructured":"Pashaei E, Ozen M, Aydin N. Splice sites prediction of human genome using AdaBoost. In: Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI):24-27 February 2016. Las Vegas: IEEE: 2016. p. 300\u20133."},{"key":"237_CR31","volume-title":"Proceedings of the International Conference on Computer Science and Engineering (UBMK):5-7 July 2017","author":"E Pashaei","year":"2017","unstructured":"Pashaei E, Aydin N. Frequency difference based DNA encoding methods in human splice site recognition. In: Proceedings of the International Conference on Computer Science and Engineering (UBMK):5-7 July 2017. London: IEEE: 2017. p. 586\u201391."},{"key":"237_CR32","volume-title":"Proceedings of the Fifth International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics:3-4 October 2008","author":"T Ryen","year":"2008","unstructured":"Ryen T, Eftes T, Kjosmoen T, Ruoff P, et al. Splice site prediction using artificial neural networks. In: Proceedings of the Fifth International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics:3-4 October 2008. Berlin: Springer: 2008. p. 102\u201313."},{"key":"237_CR33","doi-asserted-by":"crossref","unstructured":"Elsousy R, Kathiresan N, Boughorbel S. On the depth of deep learning models for splice site identification. bioRxiv,. 2018:380667.","DOI":"10.1101\/380667"},{"key":"237_CR34","doi-asserted-by":"publisher","first-page":"32958","DOI":"10.1109\/ACCESS.2018.2848847","volume":"6","author":"X Du","year":"2018","unstructured":"Du X, Yao Y, Diao Y, Zhu H, Zhang Y, Li S. Deepss: Exploring splice site motif through convolutional neural network directly from dna sequence. IEEE Access. 2018; 6:32958\u201378.","journal-title":"IEEE Access"},{"key":"237_CR35","doi-asserted-by":"crossref","first-page":"100035","DOI":"10.1016\/j.gene.2020.100035","volume":"5","author":"S Albaradei","year":"2020","unstructured":"Albaradei S, Magana-Mora A, Thafar M, Uludag M, Bajic VB, Gojobori T, Magbubah E, Jankovic BR. Splice2Deep: An ensemble of deep convolutional neural networks for improved splice site prediction in genomic DNA. Gene: X. 2020; 5:100035.","journal-title":"Gene: X"},{"issue":"23","key":"237_CR36","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1186\/s12859-019-3306-3","volume":"20","author":"R Wang","year":"2019","unstructured":"Wang R, Wang Z, Wang J, Li S. Splicefinder: ab initio prediction of splice sites using convolutional neural network. BMC Bioinforma. 2019; 20(23):652.","journal-title":"BMC Bioinforma"},{"issue":"8","key":"237_CR37","doi-asserted-by":"publisher","first-page":"954","DOI":"10.1089\/cmb.2018.0041","volume":"25","author":"T Naito","year":"2018","unstructured":"Naito T. Human splice-site prediction with deep neural networks. J Comput Biol. 2018; 25(8):954\u201361.","journal-title":"J Comput Biol"},{"key":"237_CR38","unstructured":"Kothen-Hill ST, Zviran A, Schulman RC, Deochand S, Gaiti F, Maloney D, Huang K, Liao W, Robine N, Omans ND, Landau D. Deep learning mutation prediction enables early stage lung cancer detection in liquid biopsy. In: Proceedings of the Sixth International Conference on Learning Representations: 30 April-3 May 2018. Vancouver: 2018."},{"key":"237_CR39","unstructured":"Lee T, Yoon S. Boosted categorical restricted Boltzmann machine for computational prediction of splice junctions. In: Proceedings of the Thirty-Second International conference on machine learning: 6-11 July 2015. France: 2015. p. 2483\u201392."},{"key":"237_CR40","volume-title":"Proceedings of the Seventh ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics: 2-5 October 2016","author":"B Lee","year":"2016","unstructured":"Lee B, Baek J, Park S, Yoon S. deepTarget: end-to-end learning framework for microRNA target prediction using deep recurrent neural networks. In: Proceedings of the Seventh ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics: 2-5 October 2016. Seattle: ACM: 2016. p. 434\u201342."},{"issue":"1","key":"237_CR41","doi-asserted-by":"publisher","first-page":"8222","DOI":"10.1038\/s41598-017-08523-8","volume":"7","author":"Z-C Xu","year":"2017","unstructured":"Xu Z-C, Wang P, Qiu W-R, Xiao X. iss-pc: Identifying splicing sites via physical-chemical properties using deep sparse auto-encoder. Sci Rep. 2017; 7(1):8222.","journal-title":"Sci Rep"},{"key":"237_CR42","doi-asserted-by":"crossref","unstructured":"Sonnenburg S, Schweikert G, Philips P, Behr J, R\u00e4tsch G. Accurate splice site prediction using support vector machines. In: BMC Bioinforma, vol. 8. Springer: 2007. p. 7.","DOI":"10.1186\/1471-2105-8-S10-S7"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00237-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13040-021-00237-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00237-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,17]],"date-time":"2022-12-17T15:54:06Z","timestamp":1671292446000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/s13040-021-00237-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,15]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["237"],"URL":"https:\/\/doi.org\/10.1186\/s13040-021-00237-y","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-39558\/v2","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-39558\/v1","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-39558\/v3","asserted-by":"object"}]},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,15]]},"assertion":[{"value":"1 July 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 January 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 February 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"15"}}