{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,6,28]],"date-time":"2023-06-28T17:33:16Z","timestamp":1687973596983},"reference-count":12,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Speed, accuracy and robustness of building protein fragment library have important implications in de novo protein structure prediction since fragment-based methods are one of the most successful approaches in template-free modeling (FM). Majority of the existing fragment detection methods rely on database-driven search strategies to identify candidate fragments, which are inherently time-consuming and often hinder the possibility to locate longer fragments due to the limited sizes of databases. Also, it is difficult to alleviate the effect of noisy sequence-based predicted features such as secondary structures on the quality of fragment.<\/jats:p>\n               <jats:p>Results: Here, we present FRAGSION, a database-free method to efficiently generate protein fragment library by sampling from an Input\u2013Output Hidden Markov Model. FRAGSION offers some unique features compared to existing approaches in that it (i) is lightning-fast, consuming only few seconds of CPU time to generate fragment library for a protein of typical length (300 residues); (ii) can generate dynamic-size fragments of any length (even for the whole protein sequence) and (iii) offers ways to handle noise in predicted secondary structure during fragment sampling. On a FM dataset from the most recent Critical Assessment of Structure Prediction, we demonstrate that FGRAGSION provides advantages over the state-of-the-art fragment picking protocol of ROSETTA suite by speeding up computation by several orders of magnitude while achieving comparable performance in fragment quality.<\/jats:p>\n               <jats:p>Availability and implementation: Source code and executable versions of FRAGSION for Linux and MacOS is freely available to non-commercial users at http:\/\/sysbio.rnet.missouri.edu\/FRAGSION\/. It is bundled with a manual and example data.<\/jats:p>\n               <jats:p>Contact: \u00a0chengji@missouri.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw067","type":"journal-article","created":{"date-parts":[[2016,2,20]],"date-time":"2016-02-20T01:33:13Z","timestamp":1455931993000},"page":"2059-2061","source":"Crossref","is-referenced-by-count":7,"title":["FRAGSION: ultra-fast protein fragment library generation by IOHMM sampling"],"prefix":"10.1093","volume":"32","author":[{"given":"Debswapna","family":"Bhattacharya","sequence":"first","affiliation":[{"name":"1 Department of Computer Science,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Badri","family":"Adhikari","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jilong","family":"Li","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianlin","family":"Cheng","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science,"},{"name":"2 Informatics Institute and"},{"name":"3 C. Bond Life Science Center, University of Missouri, Columbia, MO 65211, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2016,2,18]]},"reference":[{"key":"2023020112334590100_btw067-B1","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023020112334590100_btw067-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep16332","article-title":"De novo protein conformational sampling using a probabilistic graphical model","volume":"5","author":"Bhattacharya","year":"2015","journal-title":"Sci. Rep"},{"key":"2023020112334590100_btw067-B3","doi-asserted-by":"crossref","first-page":"8932","DOI":"10.1073\/pnas.0801715105","article-title":"A generative, probabilistic model of local protein structure","volume":"105","author":"Boomsma","year":"2008","journal-title":"Proc. Natl. Acad. Sci. U. S. A"},{"key":"2023020112334590100_btw067-B4","doi-asserted-by":"crossref","first-page":"e23294","DOI":"10.1371\/journal.pone.0023294","article-title":"Generalized fragment picking in Rosetta: design, protocols and applications","volume":"6","author":"Gront","year":"2011","journal-title":"PloS One"},{"key":"2023020112334590100_btw067-B5","doi-asserted-by":"crossref","first-page":"1121","DOI":"10.1371\/journal.pcbi.0020131","article-title":"Sampling realistic protein conformations using local structural bias","volume":"2","author":"Hamelryck","year":"2006","journal-title":"PLoS Comput. Biol"},{"key":"2023020112334590100_btw067-B6","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1006\/jmbi.1999.3091","article-title":"Protein secondary structure prediction based on position-specific scoring matrices","volume":"292","author":"Jones","year":"1999","journal-title":"J. Mol. Biol"},{"key":"2023020112334590100_btw067-B7","doi-asserted-by":"crossref","first-page":"3110","DOI":"10.1093\/bioinformatics\/btr541","article-title":"HHfrag: HMM-based fragment detection using HHpred","volume":"27","author":"Kalev","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020112334590100_btw067-B8","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1002\/bip.10262","article-title":"Protein decoy assembly using short fragments under geometric constraints","volume":"68","author":"Kolodny","year":"2003","journal-title":"Biopolymers"},{"key":"2023020112334590100_btw067-B9","volume-title":"Directional Statistics","author":"Mardia","year":"2009"},{"key":"2023020112334590100_btw067-B10","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1111\/j.1541-0420.2006.00682.x","article-title":"Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data","volume":"63","author":"Mardia","year":"2007","journal-title":"Biometrics"},{"key":"2023020112334590100_btw067-B11","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1006\/jmbi.1997.0959","article-title":"Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions","volume":"268","author":"Simons","year":"1997","journal-title":"J. Mol. Biol"},{"key":"2023020112334590100_btw067-B12","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1002\/prot.20264","article-title":"Scoring function for automated assessment of protein structure template quality","volume":"57","author":"Zhang","year":"2004","journal-title":"Proteins: Struct. Funct. Bioinf"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/13\/2059\/49019952\/bioinformatics_32_13_2059.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/13\/2059\/49019952\/bioinformatics_32_13_2059.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T22:45:10Z","timestamp":1675291510000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/13\/2059\/1742802"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,2,18]]},"references-count":12,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2016,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw067","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,7,1]]},"published":{"date-parts":[[2016,2,18]]}}}