{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T11:17:28Z","timestamp":1772191048781,"version":"3.50.1"},"reference-count":73,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2024,5,14]],"date-time":"2024-05-14T00:00:00Z","timestamp":1715644800000},"content-version":"vor","delay-in-days":1017,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"},{"start":{"date-parts":[[2021,8,1]],"date-time":"2021-08-01T00:00:00Z","timestamp":1627776000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.elsevier.com\/tdm\/userlicense\/1.0\/"},{"start":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T00:00:00Z","timestamp":1631059200000},"content-version":"vor","delay-in-days":38,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2016YFC0901702"],"award-info":[{"award-number":["2016YFC0901702"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2017YFC0907503"],"award-info":[{"award-number":["2017YFC0907503"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2016YFC0901002"],"award-info":[{"award-number":["2016YFC0901002"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2018YFA0106901"],"award-info":[{"award-number":["2018YFA0106901"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81902519"],"award-info":[{"award-number":["81902519"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["91940306"],"award-info":[{"award-number":["91940306"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31871294"],"award-info":[{"award-number":["31871294"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31701117"],"award-info":[{"award-number":["31701117"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31970647"],"award-info":[{"award-number":["31970647"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Strategic Priority Research Program of Chinese Academy of Sciences","award":["XDB38040300"],"award-info":[{"award-number":["XDB38040300"]}]},{"name":"13th Five-year Informatization Plan of Chinese Academy of Sciences","award":["XXH13505-05"],"award-info":[{"award-number":["XXH13505-05"]}]},{"name":"Special Investigation on Science and Technology Basic Resources, Ministry of Science and Technology, China","award":["2019FY100102"],"award-info":[{"award-number":["2019FY100102"]}]}],"content-domain":{"domain":["elsevier.com","sciencedirect.com"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Small proteins specifically refer to proteins consisting of less than 100 amino acids translated from small open reading frames (sORFs), which were usually missed in previous genome annotation. The significance of small proteins has been revealed in current years, along with the discovery of their diverse functions. However, systematic annotation of small proteins is still insufficient. SmProt was specially developed to provide valuable information on small proteins for scientific community. Here we present the update of SmProt, which emphasizes reliability of translated sORFs, genetic variants in translated sORFs, disease-specific sORF translation events or sequences, and remarkably increased data volume. More components such as non-ATG translation initiation, function, and new sources are also included. SmProt incorporated 638,958 unique small proteins curated from 3,165,229 primary records, which were computationally predicted from 419 ribosome profiling (Ribo-seq) datasets or collected from literature and other sources from 370 cell lines or tissues in 8 species (Homo sapiens, Mus musculus, Rattus norvegicus, Drosophila melanogaster, Danio rerio, Saccharomyces cerevisiae, Caenorhabditis elegans, and Escherichia coli). In addition, small protein families identified from human microbiomes were also collected. All datasets in SmProt are free to access, and available for browse, search, and bulk downloads at http:\/\/bigdata.ibp.ac.cn\/SmProt\/.<\/jats:p>","DOI":"10.1016\/j.gpb.2021.09.002","type":"journal-article","created":{"date-parts":[[2021,9,15]],"date-time":"2021-09-15T11:59:43Z","timestamp":1631707183000},"page":"602-610","update-policy":"https:\/\/doi.org\/10.1016\/elsevier_cm_policy","source":"Crossref","is-referenced-by-count":79,"title":["SmProt: A Reliable Repository with Comprehensive Annotation of Small Proteins Identified from Ribosome Profiling"],"prefix":"10.1093","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5256-6696","authenticated-orcid":false,"given":"Yanyan","family":"Li","sequence":"first","affiliation":[{"name":"College of Life Sciences, University of Chinese Academy of Sciences , Beijing 100049 , China"},{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7409-3092","authenticated-orcid":false,"given":"Honghong","family":"Zhou","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0633-2984","authenticated-orcid":false,"given":"Xiaomin","family":"Chen","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4936-8407","authenticated-orcid":false,"given":"Yu","family":"Zheng","sequence":"additional","affiliation":[{"name":"College of Life Sciences, University of Chinese Academy of Sciences , Beijing 100049 , China"},{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6790-5259","authenticated-orcid":false,"given":"Quan","family":"Kang","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0082-0730","authenticated-orcid":false,"given":"Di","family":"Hao","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3601-0150","authenticated-orcid":false,"given":"Lili","family":"Zhang","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2967-7704","authenticated-orcid":false,"given":"Tingrui","family":"Song","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9944-0345","authenticated-orcid":false,"given":"Huaxia","family":"Luo","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1384-4176","authenticated-orcid":false,"given":"Yajing","family":"Hao","sequence":"additional","affiliation":[{"name":"Department of Cellular and Molecular Medicine, University of California , San Diego, La Jolla, CA 92093 , USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6049-8347","authenticated-orcid":false,"given":"Runsheng","family":"Chen","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"},{"name":"Guangdong Geneway Decoding Bio-Tech Co. Ltd , Foshan 528316 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9303-1639","authenticated-orcid":false,"given":"Peng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7294-0865","authenticated-orcid":false,"given":"Shunmin","family":"He","sequence":"additional","affiliation":[{"name":"College of Life Sciences, University of Chinese Academy of Sciences , Beijing 100049 , China"},{"name":"Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences , Beijing 100101 , China"}]}],"member":"286","published-online":{"date-parts":[[2021,9,15]]},"reference":[{"key":"2024051404095927800_b0005","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1101\/gr.7.8.768","article-title":"Small open reading frames: beautiful needles in the haystack","volume":"7","author":"Basrai","year":"1997","journal-title":"Genome Res"},{"key":"2024051404095927800_b0010","doi-asserted-by":"crossref","DOI":"10.1016\/j.cell.2019.07.016","article-title":"Large-scale analyses of human microbiomes reveal thousands of small, novel genes","volume":"178","author":"Sberro","year":"2019","journal-title":"Cell"},{"key":"2024051404095927800_b0015","doi-asserted-by":"crossref","first-page":"981","DOI":"10.1002\/embj.201488411","article-title":"Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation","volume":"33","author":"Bazzini","year":"2014","journal-title":"EMBO J"},{"key":"2024051404095927800_b0020","doi-asserted-by":"crossref","first-page":"1858","DOI":"10.1016\/j.celrep.2014.05.023","article-title":"Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae","volume":"7","author":"Smith","year":"2014","journal-title":"Cell Rep"},{"key":"2024051404095927800_b0025","doi-asserted-by":"crossref","DOI":"10.1016\/j.cell.2019.05.010","article-title":"The translational landscape of the human heart","volume":"178","author":"van Heesch","year":"2019","journal-title":"Cell"},{"key":"2024051404095927800_b0030","doi-asserted-by":"crossref","first-page":"7507","DOI":"10.1073\/pnas.0810916106","article-title":"Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans","volume":"106","author":"Calvo","year":"2009","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024051404095927800_b0035","doi-asserted-by":"crossref","first-page":"1295","DOI":"10.3389\/fphar.2018.01295","article-title":"Peptides\/proteins encoded by non-coding RNA: a novel resource bank for drug targets and biomarkers","volume":"9","author":"Zhu","year":"2018","journal-title":"Front Pharmacol"},{"key":"2024051404095927800_b0040","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.yexcr.2017.10.010","article-title":"Translation of noncoding RNAs: focus on lncRNAs, pri-miRNAs, and circRNAs","volume":"361","author":"Li","year":"2017","journal-title":"Exp Cell Res"},{"key":"2024051404095927800_b0045","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1166\/jpsp.2017.1070","article-title":"Decoding of non-coding DNA and non-coding RNA: pri-micro RNA-encoded novel peptides regulate migration of cancer cells","volume":"3","author":"Fang","year":"2017","journal-title":"J Pharm Sci Pharmacol"},{"key":"2024051404095927800_b0050","doi-asserted-by":"crossref","first-page":"206","DOI":"10.3390\/genes8080206","article-title":"Viral infection identifies micropeptides differentially regulated in smORF-containing lncRNAs","volume":"8","author":"Razooky","year":"2017","journal-title":"Genes (Basel)"},{"key":"2024051404095927800_b0055","doi-asserted-by":"crossref","DOI":"10.1016\/j.molcel.2017.09.015","article-title":"A peptide encoded by a putative lncRNA HOXB-AS3 suppresses colon cancer growth","volume":"68","author":"Huang","year":"2017","journal-title":"Mol Cell"},{"key":"2024051404095927800_b0060","doi-asserted-by":"crossref","first-page":"4475","DOI":"10.1038\/s41467-018-06862-2","article-title":"A peptide encoded by circular form of LINC-PINT suppresses oncogenic transcriptional elongation in glioblastoma","volume":"9","author":"Zhang","year":"2018","journal-title":"Nat Commun"},{"key":"2024051404095927800_b0065","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/nrm.2017.58","article-title":"Classification and function of small open reading frames","volume":"18","author":"Couso","year":"2017","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2024051404095927800_b0070","doi-asserted-by":"crossref","first-page":"2116","DOI":"10.1016\/j.celrep.2017.08.014","article-title":"Loss of Apela peptide in mice causes low penetrance embryonic lethality and defects in early mesodermal derivatives","volume":"20","author":"Freyer","year":"2017","journal-title":"Cell Rep"},{"key":"2024051404095927800_b0075","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pbio.0050106","article-title":"Peptides encoded by short ORFs control development and define a new eukaryotic gene family","volume":"5","author":"Galindo","year":"2007","journal-title":"PLoS Biol"},{"key":"2024051404095927800_b0080","doi-asserted-by":"crossref","first-page":"456","DOI":"10.1038\/nature01627","article-title":"Humanin peptide suppresses apoptosis by interfering with Bax activation","volume":"423","author":"Guo","year":"2003","journal-title":"Nature"},{"key":"2024051404095927800_b0085","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/j.cell.2015.01.009","article-title":"A micropeptide encoded by a putative long noncoding RNA regulates muscle performance","volume":"160","author":"Anderson","year":"2015","journal-title":"Cell"},{"key":"2024051404095927800_b0090","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.2174\/0929866523666160719124712","article-title":"Proline-rich antimicrobial peptides optimized for binding to Escherichia coli chaperone DnaK","volume":"23","author":"Knappe","year":"2016","journal-title":"Protein Pept Lett"},{"key":"2024051404095927800_b0095","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1038\/ng.276","article-title":"Loss-of-function mutations of an inhibitory upstream ORF in the human hairless transcript cause Marie Unna hereditary hypotrichosis","volume":"41","author":"Wen","year":"2009","journal-title":"Nat Genet"},{"key":"2024051404095927800_b0100","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1038\/s41467-017-02495-z","article-title":"C9ORF72 GGGGCC repeat-associated non-AUG translation is upregulated by stress through eIF2alpha phosphorylation","volume":"9","author":"Cheng","year":"2018","journal-title":"Nat Commun"},{"key":"2024051404095927800_b0105","doi-asserted-by":"crossref","DOI":"10.1002\/pmic.201700038","article-title":"Small but mighty: functional peptides encoded by small ORFs in plants","volume":"18","author":"Hsu","year":"2018","journal-title":"Proteomics"},{"key":"2024051404095927800_b0110","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1126\/science.1168978","article-title":"Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling","volume":"324","author":"Ingolia","year":"2009","journal-title":"Science"},{"key":"2024051404095927800_b0115","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1016\/j.cell.2011.10.002","article-title":"Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes","volume":"147","author":"Ingolia","year":"2011","journal-title":"Cell"},{"key":"2024051404095927800_b0120","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1126\/science.1216974","article-title":"Translation goes global","volume":"334","author":"Weiss","year":"2011","journal-title":"Science"},{"key":"2024051404095927800_b0125","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1038\/nchembio.304","article-title":"Inhibition of eukaryotic translation elongation by cycloheximide and lactimidomycin","volume":"6","author":"Schneider-Poetsch","year":"2010","journal-title":"Nat Chem Biol"},{"key":"2024051404095927800_b0130","doi-asserted-by":"crossref","first-page":"728","DOI":"10.1016\/j.tig.2017.08.003","article-title":"Beyond read-counts: Ribo-seq data analysis to understand the functions of the transcriptome","volume":"33","author":"Calviello","year":"2017","journal-title":"Trends Genet"},{"key":"2024051404095927800_b0135","doi-asserted-by":"crossref","first-page":"1534","DOI":"10.1038\/nprot.2012.086","article-title":"The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments","volume":"7","author":"Ingolia","year":"2012","journal-title":"Nat Protoc"},{"key":"2024051404095927800_b0140","doi-asserted-by":"crossref","first-page":"E2424","DOI":"10.1073\/pnas.1207846109","article-title":"Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution","volume":"109","author":"Lee","year":"2012","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024051404095927800_b0145","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1007\/s00438-005-1152-7","article-title":"The role of alternative translation start sites in the generation of human protein diversity","volume":"273","author":"Kochetov","year":"2005","journal-title":"Mol Genet Genomics"},{"key":"2024051404095927800_b0150","doi-asserted-by":"crossref","first-page":"1000","DOI":"10.1074\/mcp.M600297-MCP200","article-title":"Diversity of translation start sites may define increased complexity of the human short ORFeome","volume":"6","author":"Oyama","year":"2007","journal-title":"Mol Cell Proteomics"},{"key":"2024051404095927800_b0155","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1038\/nmeth.3688","article-title":"Detecting actively translated open reading frames in ribosome profiling data","volume":"13","author":"Calviello","year":"2016","journal-title":"Nat Methods"},{"key":"2024051404095927800_b0160","doi-asserted-by":"crossref","first-page":"816","DOI":"10.1016\/j.molcel.2015.11.013","article-title":"A regression-based analysis of ribosome-profiling data reveals a conserved complexity to mammalian translation","volume":"60","author":"Fields","year":"2015","journal-title":"Mol Cell"},{"key":"2024051404095927800_b0165","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.08890","article-title":"Many lncRNAs, 5\u2032 UTRs, and pseudogenes are translated and some are likely to express functional proteins","volume":"4","author":"Ji","year":"2015","journal-title":"eLife"},{"key":"2024051404095927800_b0170","doi-asserted-by":"crossref","first-page":"1749","DOI":"10.1038\/s41467-017-01981-8","article-title":"Genome-wide identification and differential analysis of translational initiation","volume":"8","author":"Zhang","year":"2017","journal-title":"Nat Commun"},{"key":"2024051404095927800_b0175","first-page":"2960","article-title":"Bayesian prediction of RNA translation from ribosome profiling","volume":"45","author":"Malone","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0180","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.13328","article-title":"Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling","volume":"5","author":"Raj","year":"2016","journal-title":"eLife"},{"key":"2024051404095927800_b0185","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1186\/s12859-016-1355-4","article-title":"SPECtre: a spectral coherence\u2013based classifier of actively translated transcripts from ribosome profiling sequence data","volume":"17","author":"Chun","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2024051404095927800_b0190","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gku1283","article-title":"PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration","volume":"43","author":"Crappe","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0195","doi-asserted-by":"crossref","first-page":"1382","DOI":"10.1093\/nar\/gkh305","article-title":"5\u2019-Untranslated regions with multiple upstream AUG codons can support low-level translation via leaky scanning and reinitiation","volume":"32","author":"Wang","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0200","doi-asserted-by":"crossref","first-page":"5880","DOI":"10.1093\/nar\/gku204","article-title":"Fail-safe mechanism of GCN4 translational control\u2013uORF2 promotes reinitiation by analogous mechanism to uORF1 and thus secures its key role in GCN4 expression","volume":"42","author":"Guni\u0161ov\u00e1","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0205","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1126\/science.1249749","article-title":"Ribosome stalling induced by mutation of a CNS-specific tRNA causes neurodegeneration","volume":"345","author":"Ishimura","year":"2014","journal-title":"Science"},{"key":"2024051404095927800_b0210","doi-asserted-by":"crossref","first-page":"2523","DOI":"10.1038\/s41467-019-10717-9","article-title":"Characterising the loss-of-function impact of 5\u2019 untranslated region variants in 15,708 individuals","volume":"11","author":"Whiffin","year":"2020","journal-title":"Nat Commun"},{"key":"2024051404095927800_b0215","first-page":"636","article-title":"SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci","volume":"19","author":"Hao","year":"2018","journal-title":"Brief Bioinform"},{"key":"2024051404095927800_b0220","doi-asserted-by":"crossref","first-page":"D991","DOI":"10.1093\/nar\/gks1193","article-title":"NCBI GEO: archive for functional genomics data sets\u2013update","volume":"41","author":"Barrett","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0225","doi-asserted-by":"crossref","first-page":"D36","DOI":"10.1093\/nar\/gkx1125","article-title":"The European Nucleotide Archive in 2017","volume":"46","author":"Silvester","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0230","doi-asserted-by":"crossref","first-page":"D754","DOI":"10.1093\/nar\/gkx1098","article-title":"Ensembl 2018","volume":"46","author":"Zerbino","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0235","doi-asserted-by":"crossref","first-page":"D766","DOI":"10.1093\/nar\/gky955","article-title":"GENCODE reference annotation for the human and mouse genomes","volume":"47","author":"Frankish","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0240","doi-asserted-by":"crossref","first-page":"10","DOI":"10.14806\/ej.17.1.200","article-title":"Cutadapt removes adapter sequences from high-throughput sequencing reads","volume":"17","author":"Martin","year":"2011","journal-title":"EMBnet J"},{"key":"2024051404095927800_b0245","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","article-title":"STAR: ultrafast universal RNA-seq aligner","volume":"29","author":"Dobin","year":"2013","journal-title":"Bioinformatics"},{"key":"2024051404095927800_b0250","doi-asserted-by":"crossref","first-page":"D175","DOI":"10.1093\/nar\/gky1043","article-title":"piRBase: a comprehensive database of piRNA sequences","volume":"47","author":"Wang","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0255","first-page":"201178","author":"Poplin","year":"2018","journal-title":"Scaling accurate genetic variant discovery to tens of thousands of samples"},{"key":"2024051404095927800_b0260","doi-asserted-by":"crossref","first-page":"11.10.1\u221233","DOI":"10.1002\/0471250953.bi1110s43","article-title":"From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline","volume":"43","author":"Van der Auwera","year":"2013","journal-title":"Curr Protoc Bioinformatics"},{"key":"2024051404095927800_b0265","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat Genet"},{"key":"2024051404095927800_b0270","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome Res"},{"key":"2024051404095927800_b0275","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/nature15394","article-title":"An integrated map of structural variation in 2,504 human genomes","volume":"526","author":"Sudmant","year":"2015","journal-title":"Nature"},{"key":"2024051404095927800_b0280","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1038\/s41586-019-1793-z","article-title":"The GenomeAsia 100K Project enables genetic discoveries across Asia","volume":"576","author":"GenomeAsia100K Consortium","year":"2019","journal-title":"Nature"},{"key":"2024051404095927800_b0285","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1038\/s41586-021-03205-y","article-title":"Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program","volume":"590","author":"Taliun","year":"2021","journal-title":"Nature"},{"key":"2024051404095927800_b0290","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1038\/s41586-020-2308-7","article-title":"The mutational constraint spectrum quantified from variation in 141,456 humans","volume":"581","author":"Karczewski","year":"2020","journal-title":"Nature"},{"key":"2024051404095927800_b0295","doi-asserted-by":"crossref","DOI":"10.1016\/j.celrep.2021.110017","article-title":"NyuWa Genome resource: a deep whole-genome sequencing-based variation profile and reference panel for the Chinese population","volume":"37","author":"Zhang","year":"2021","journal-title":"Cell Rep"},{"key":"2024051404095927800_b0300","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s13059-016-0974-4","article-title":"The Ensembl Variant Effect Predictor","volume":"17","author":"McLaren","year":"2016","journal-title":"Genome Biol"},{"key":"2024051404095927800_b0305","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1093\/bioinformatics\/btaa783","article-title":"Annotating high-impact 5\u2019untranslated region variants with the UTRannotator","volume":"37","author":"Zhang","year":"2021","journal-title":"Bioinformatics"},{"key":"2024051404095927800_b0310","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1093\/bioinformatics\/btu031","article-title":"InterProScan 5: genome-scale protein function classification","volume":"30","author":"Jones","year":"2014","journal-title":"Bioinformatics"},{"key":"2024051404095927800_b0315","doi-asserted-by":"crossref","first-page":"D204","DOI":"10.1093\/nar\/gku989","article-title":"UniProt: a hub for protein information","volume":"43","author":"UniProt Consortium","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0320","doi-asserted-by":"crossref","first-page":"i275","DOI":"10.1093\/bioinformatics\/btr209","article-title":"PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions","volume":"27","author":"Lin","year":"2011","journal-title":"Bioinformatics"},{"key":"2024051404095927800_b0325","doi-asserted-by":"crossref","first-page":"D170","DOI":"10.1093\/nar\/gkm1011","article-title":"NONCODE v2.0: decoding the non-coding","volume":"36","author":"He","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0330","doi-asserted-by":"crossref","first-page":"D221","DOI":"10.1093\/nar\/gkx1031","article-title":"Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation","volume":"46","author":"Pujar","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0335","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","author":"UniProt Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0340","doi-asserted-by":"crossref","first-page":"D853","DOI":"10.1093\/nar\/gky1095","article-title":"The UCSC Genome Browser database: 2019 update","volume":"47","author":"Haeussler","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0345","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1186\/s12859-016-1458-y","article-title":"ARA-PEPs: a repository of putative sORF-encoded peptides in Arabidopsis thaliana","volume":"18","author":"Hazarika","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2024051404095927800_b0350","doi-asserted-by":"crossref","first-page":"2158","DOI":"10.1111\/pbi.13389","article-title":"PsORF: a database of small ORFs in plants","volume":"18","author":"Chen","year":"2020","journal-title":"Plant Biotechnol J"},{"key":"2024051404095927800_b0355","doi-asserted-by":"crossref","first-page":"D497","DOI":"10.1093\/nar\/gkx1130","article-title":"An update on sORFs.org: a repository of small ORFs identified by ribosome profiling","volume":"46","author":"Olexiouk","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0360","doi-asserted-by":"crossref","first-page":"D177","DOI":"10.1093\/nar\/gkw1062","article-title":"The neXtProt knowledgebase on human proteins: 2017 update","volume":"45","author":"Gaudet","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2024051404095927800_b0365","first-page":"D403","article-title":"OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes","volume":"47","author":"Brunet","year":"2019","journal-title":"Nucleic Acids Res"}],"container-title":["Genomics, Proteomics &amp; Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S1672022921001807?httpAccept=text\/xml","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S1672022921001807?httpAccept=text\/plain","content-type":"text\/plain","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/19\/4\/602\/57581945\/gpb_19_4_602.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/19\/4\/602\/57581945\/gpb_19_4_602.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,14]],"date-time":"2024-05-14T00:46:16Z","timestamp":1715647576000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/gpb\/article\/19\/4\/602\/7230393"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,1]]},"references-count":73,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,8,1]]}},"URL":"https:\/\/doi.org\/10.1016\/j.gpb.2021.09.002","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.04.29.441405","asserted-by":"object"}]},"ISSN":["1672-0229","2210-3244"],"issn-type":[{"value":"1672-0229","type":"print"},{"value":"2210-3244","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,8]]},"published":{"date-parts":[[2021,8,1]]},"assertion":[{"value":"Elsevier","name":"publisher","label":"This article is maintained by"},{"value":"SmProt: A Reliable Repository with Comprehensive Annotation of Small Proteins Identified from Ribosome Profiling","name":"articletitle","label":"Article Title"},{"value":"Genomics, Proteomics & Bioinformatics","name":"journaltitle","label":"Journal Title"},{"value":"https:\/\/doi.org\/10.1016\/j.gpb.2021.09.002","name":"articlelink","label":"CrossRef DOI link to publisher maintained version"},{"value":"article","name":"content_type","label":"Content Type"},{"value":"\u00a9 2021 The Authors. Published by Elsevier B.V. and Science Press on behalf of Beijing Institute of Genomics, Chinese Academy of Sciences \/ China National Center for Bioinformation and Genetics Society of China.","name":"copyright","label":"Copyright"}]}}