{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T11:16:10Z","timestamp":1763810170288,"version":"3.37.3"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2020,1,6]],"date-time":"2020-01-06T00:00:00Z","timestamp":1578268800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61872395","U1611265"],"award-info":[{"award-number":["61872395","U1611265"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100003453","name":"Natural Science Foundation of Guangdong","doi-asserted-by":"publisher","award":["2018A030313285"],"award-info":[{"award-number":["2018A030313285"]}],"id":[{"id":"10.13039\/501100003453","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Pearl River Nova Program of Guangzhou","award":["201710010044"],"award-info":[{"award-number":["201710010044"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Polyadenylation plays a regulatory role in transcription. The recognition of polyadenylation signal (PAS) motif sequence is an important step in polyadenylation. In the past few years, some statistical machine learning-based and deep learning-based methods have been proposed for PAS identification. Although these methods predict PAS with success, there is room for their improvement on PAS identification.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we proposed a deep neural network-based computational method, called SANPolyA, for identifying PAS in human and mouse genomes. SANPolyA requires no manually crafted sequence features. We compared our method SANPolyA with several previous PAS identification methods on several PAS benchmark datasets. Our results showed that SANPolyA outperforms the state-of-art methods. SANPolyA also showed good performance on leave-one-motif-out evaluation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/github.com\/yuht4\/SANPolyA.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz970","type":"journal-article","created":{"date-parts":[[2020,1,1]],"date-time":"2020-01-01T12:09:17Z","timestamp":1577880557000},"page":"2393-2400","source":"Crossref","is-referenced-by-count":23,"title":["SANPolyA: a deep learning method for identifying Poly(A) signals"],"prefix":"10.1093","volume":"36","author":[{"given":"Haitao","family":"Yu","sequence":"first","affiliation":[{"name":"School of Data and Computer Science"}]},{"given":"Zhiming","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Data and Computer Science"},{"name":"Guangdong Province Key Laboratory of Big Data Analysis and Processing, Sun Yat-Sen University , Guangzhou 510006, China"}]}],"member":"286","published-online":{"date-parts":[[2020,1,6]]},"reference":[{"key":"2023013110170055200_btz970-B1","doi-asserted-by":"crossref","first-page":"646","DOI":"10.1186\/1471-2164-11-646","article-title":"POLYAR, a new computer program for prediction of poly(A) sites in human sequences","volume":"11","author":"Akhtar","year":"2010","journal-title":"BMC Genomics"},{"key":"2023013110170055200_btz970-B2","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.ymeth.2019.04.001","article-title":"Hybrid model for efficient prediction of poly(A) signals in human genomic DNA","volume":"166","author":"Albalawi","year":"2019","journal-title":"Methods"},{"key":"2023013110170055200_btz970-B3","doi-asserted-by":"crossref","first-page":"4577","DOI":"10.1093\/bioinformatics\/btz283","article-title":"DeepPASTA: deep neural network based polyadenylation site analysis","volume":"35","author":"Arefeen","year":"2019","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B4","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1101\/gr.10.7.1001","article-title":"Patterns of variant polyadenylation signal usage in human genes","volume":"10","author":"Beaudoing","year":"2000","journal-title":"Genome Res"},{"key":"2023013110170055200_btz970-B5","doi-asserted-by":"crossref","first-page":"2614","DOI":"10.1093\/nar\/23.14.2614","article-title":"Cleavage site determinants in the mammalian polyadenylation signal","volume":"23","author":"Chen","year":"1995","journal-title":"Nucleic Acids Res"},{"key":"2023013110170055200_btz970-B6","doi-asserted-by":"crossref","first-page":"2796","DOI":"10.1093\/bioinformatics\/btz015","article-title":"i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome","volume":"35","author":"Chen","year":"2019","journal-title":"Bioinformatics"},{"year":"2015","author":"Chollet","key":"2023013110170055200_btz970-B7"},{"year":"2015","author":"Clevert","key":"2023013110170055200_btz970-B8"},{"key":"2023013110170055200_btz970-B9","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1101\/gr.849004","article-title":"WebLogo: a sequence logo generator","volume":"14","author":"Crooks","year":"2004","journal-title":"Genome Res"},{"key":"2023013110170055200_btz970-B10","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1093\/bioinformatics\/bts565","article-title":"CD-HIT: accelerated for clustering the next-generation sequencing data","volume":"28","author":"Fu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B11","doi-asserted-by":"crossref","first-page":"1760","DOI":"10.1101\/gr.135350.111","article-title":"GENCODE: the reference human genome annotation for The ENCODE Project","volume":"22","author":"Harrow","year":"2012","journal-title":"Genome Res"},{"key":"2023013110170055200_btz970-B12","doi-asserted-by":"crossref","first-page":"1485","DOI":"10.1261\/rna.2107305","article-title":"Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation","volume":"11","author":"Hu","year":"2005","journal-title":"RNA"},{"key":"2023013110170055200_btz970-B13","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1186\/1471-2164-9-220","article-title":"Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of protein-protein interactions and gene expression profiling","volume":"9","author":"Hunt","year":"2008","journal-title":"BMC Genomics"},{"key":"2023013110170055200_btz970-B14","doi-asserted-by":"crossref","first-page":"1125","DOI":"10.1093\/bioinformatics\/bty752","article-title":"DeepGSR: an optimized deep-learning structure for the recognition of genomic signals and regions","volume":"35","author":"Kalkatawi","year":"2019","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B15","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1093\/bioinformatics\/btr602","article-title":"Dragon PolyA Spotter: predictor of poly(A) motifs within human genomic DNA sequences","volume":"28","author":"Kalkatawi","year":"2012","journal-title":"Bioinformatics"},{"year":"2016","author":"Lei Ba","key":"2023013110170055200_btz970-B16"},{"key":"2023013110170055200_btz970-B17","doi-asserted-by":"crossref","first-page":"2889","DOI":"10.1093\/bioinformatics\/bty211","article-title":"Inference of the human polyadenylation code","volume":"34","author":"Leung","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B18","doi-asserted-by":"crossref","first-page":"8460","DOI":"10.1093\/nar\/gks637","article-title":"An in-depth map of polyadenylation sites in cancer","volume":"40","author":"Lin","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023013110170055200_btz970-B19","first-page":"84","article-title":"An in-silico method for prediction of polyadenylation signals in human sequences","volume":"14","author":"Liu","year":"2003","journal-title":"Genome Inform"},{"key":"2023013110170055200_btz970-B20","doi-asserted-by":"crossref","first-page":"620","DOI":"10.1186\/s12864-017-4033-7","article-title":"Omni-PolyA: a method and tool for accurate recognition of Poly(A) signals in human genomic DNA","volume":"18","author":"Magana-Mora","year":"2017","journal-title":"BMC Genomics"},{"key":"2023013110170055200_btz970-B21","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/S0097-8485(96)80015-5","article-title":"Detection of RNA polymerase II promoters and polyadenylation sites in human DNA sequence","volume":"20","author":"Matis","year":"1996","journal-title":"Comput. Chem"},{"key":"2023013110170055200_btz970-B22","doi-asserted-by":"crossref","first-page":"2320","DOI":"10.1093\/bioinformatics\/btl394","article-title":"Prediction of mRNA polyadenylation sites by support vector machine","volume":"22","author":"Miura","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B23","doi-asserted-by":"crossref","first-page":"1770","DOI":"10.1101\/gad.17268411","article-title":"Ending the message: poly(A) signals then and now","volume":"25","author":"Proudfoot","year":"2011","journal-title":"Genes Dev"},{"key":"2023013110170055200_btz970-B24","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1093\/bioinformatics\/13.1.23","article-title":"Recognition of 3' -processing sites of human mRNA precursors","volume":"13","author":"Salamov","year":"1997","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B25","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1093\/nar\/gkn158","article-title":"Genome level analysis of rice mRNA 3'-end processing signals and alternative polyadenylation","volume":"36","author":"Shen","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023013110170055200_btz970-B26","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/S0378-1119(99)00104-3","article-title":"Detection of polyadenylation signals in human DNA sequences","volume":"231","author":"Tabaska","year":"1999","journal-title":"Gene"},{"key":"2023013110170055200_btz970-B27","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1093\/nar\/gki158","article-title":"A large-scale analysis of mRNA polyadenylation of human and mouse genes","volume":"33","author":"Tian","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023013110170055200_btz970-B28","doi-asserted-by":"crossref","first-page":"1000","DOI":"10.1093\/nar\/28.4.1000","article-title":"Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals","volume":"28","author":"van Helden","year":"2000","journal-title":"Nucleic Acids Res"},{"year":"2017","author":"Vaswani","key":"2023013110170055200_btz970-B29"},{"key":"2023013110170055200_btz970-B30","doi-asserted-by":"crossref","first-page":"D315","DOI":"10.1093\/nar\/gkx1000","article-title":"PolyA_DB 3 catalogs cleavage and polyadenylation sites identified by deep sequencing in multiple genomes","volume":"46","author":"Wang","year":"2018","journal-title":"Nucleic Acids Res"},{"first-page":"3","year":"2018","author":"Wu","key":"2023013110170055200_btz970-B31"},{"key":"2023013110170055200_btz970-B32","doi-asserted-by":"crossref","first-page":"2371","DOI":"10.1093\/bioinformatics\/bty991","article-title":"DeeReCT-PolyA: a robust and generic deep learning method for PAS identification","volume":"35","author":"Xia","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B33","doi-asserted-by":"crossref","first-page":"i316","DOI":"10.1093\/bioinformatics\/btt218","article-title":"Poly(A) motif prediction using spectral latent features from human DNA sequences","volume":"29","author":"Xie","year":"2013","journal-title":"Bioinformatics"},{"key":"2023013110170055200_btz970-B34","doi-asserted-by":"crossref","first-page":"1375","DOI":"10.1093\/nar\/gkg241","article-title":"Downstream elements of mammalian pre-mRNA polyadenylation signals: primary, secondary and higher-order structures","volume":"31","author":"Zarudnaya","year":"2003","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz970\/31911507\/btz970.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2393\/48984266\/bioinformatics_36_8_2393.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2393\/48984266\/bioinformatics_36_8_2393.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T20:22:20Z","timestamp":1675196540000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/8\/2393\/5697087"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,1,6]]},"references-count":34,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2020,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz970","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,4,15]]},"published":{"date-parts":[[2020,1,6]]}}}