{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T17:41:52Z","timestamp":1768585312807,"version":"3.49.0"},"reference-count":42,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,3,15]],"date-time":"2024-03-15T00:00:00Z","timestamp":1710460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>Recent advancements in contact map-based protein three-dimensional (3D) structure prediction have been driven by the evolution of deep learning algorithms. However, the gap in accessible software tools for novices in this domain remains a significant challenge. This study introduces GoFold, a novel, standalone graphical user interface (GUI) designed for beginners to perform contact map overlap (CMO) problems for better template selection. Unlike existing tools that cater more to research needs or assume foundational knowledge, GoFold offers an intuitive, user-friendly platform with comprehensive tutorials. It stands out in its ability to visually represent the CMO problem, allowing users to input proteins in various formats and explore the CMO problem. The educational value of GoFold is demonstrated through benchmarking against the state-of-the-art contact map overlap method, map_align, using two datasets: PSICOV and CAMEO. GoFold exhibits superior performance in terms of TM-score and Z-score metrics across diverse qualities of contact maps and target difficulties. Notably, GoFold runs efficiently on personal computers without any third-party dependencies, thereby making it accessible to the general public for promoting citizen science. The tool is freely available for download for macOS, Linux, and Windows.<jats:xref><jats:sup>1<\/jats:sup><\/jats:xref><\/jats:p>","DOI":"10.3389\/fbinf.2024.1358550","type":"journal-article","created":{"date-parts":[[2024,3,15]],"date-time":"2024-03-15T04:56:25Z","timestamp":1710478585000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["An interactive visualization tool for educational outreach in protein contact map overlap analysis"],"prefix":"10.3389","volume":"4","author":[{"given":"Kevan","family":"Baker","sequence":"first","affiliation":[]},{"given":"Nathaniel","family":"Hughes","sequence":"additional","affiliation":[]},{"given":"Sutanu","family":"Bhattacharya","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,3,15]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1002\/prot.25787","article-title":"A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments","volume":"87","author":"Abriata","year":"2019","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B2","doi-asserted-by":"publisher","first-page":"e66856","DOI":"10.1371\/journal.pone.0066856","article-title":"The PyRosetta toolkit: a graphical user interface for the Rosetta software suite","volume":"8","author":"Adolf-Bryfogle","year":"2013","journal-title":"PLOS ONE"},{"key":"B3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.cbpa.2021.04.005","article-title":"Machine learning in protein structure prediction","volume":"65","author":"AlQuraishi","year":"2021","journal-title":"Curr. Opin. Chem. Biol."},{"key":"B4","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"B5","article-title":"New computational and data-driven methods for protein homology modeling","author":"Bhattacharya","year":"2021","journal-title":"Doctoral dissertation, Auburn University"},{"key":"B6","doi-asserted-by":"crossref","DOI":"10.1145\/3307339.3343243","article-title":"Contact-assisted protein threading: an evolving new direction","author":"Bhattacharya","year":""},{"key":"B7","doi-asserted-by":"publisher","first-page":"596","DOI":"10.1002\/prot.25684","article-title":"Does inclusion of residue-residue contact information boost protein threading?","volume":"87","author":"Bhattacharya","year":"","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B8","doi-asserted-by":"publisher","first-page":"2908","DOI":"10.1038\/s41598-020-59834-2","article-title":"Evaluating the significance of contact maps in low-homology protein modeling using contact-assisted threading","volume":"10","author":"Bhattacharya","year":"2020","journal-title":"Sci. Rep."},{"key":"B9","doi-asserted-by":"publisher","first-page":"579","DOI":"10.1002\/prot.26254","article-title":"DisCovER: distance- and orientation-based covariational threading for weakly homologous proteins","volume":"90","author":"Bhattacharya","year":"2022","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B10","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-0716-2974-1_3","article-title":"Contact-Assisted threading in low-homology protein modeling","volume-title":"Homology modeling: methods and protocols","author":"Bhattacharya","year":"2023"},{"key":"B11","doi-asserted-by":"publisher","first-page":"2684","DOI":"10.1093\/bioinformatics\/btx217","article-title":"EigenTHREADER: analogous protein fold recognition by efficient contact map threading","volume":"33","author":"Buchan","year":"2017","journal-title":"Bioinformatics"},{"key":"B12","doi-asserted-by":"publisher","first-page":"506","DOI":"10.1093\/bioinformatics\/btaa833","article-title":"Fold recognition by scoring protein maps using the congruence coefficient","volume":"37","author":"Di Lena","year":"2021","journal-title":"Bioinformatics"},{"key":"B13","doi-asserted-by":"publisher","first-page":"2250","DOI":"10.1093\/bioinformatics\/btq402","article-title":"Fast overlapping of protein contact maps by alignment of eigenvectors","volume":"26","author":"Di Lena","year":"2010","journal-title":"Bioinformatics"},{"key":"B14","doi-asserted-by":"publisher","first-page":"1378","DOI":"10.1002\/prot.25815","article-title":"Introducing \u201cbest single template\u201d models as reference baseline for the Continuous Automated Model Evaluation (CAMEO)","volume":"87","author":"Haas","year":"2019","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B15","doi-asserted-by":"publisher","first-page":"1165","DOI":"10.1002\/prot.25697","article-title":"Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13","volume":"87","author":"Hou","year":"2019","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B16","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1093\/bioinformatics\/btr638","article-title":"PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments","volume":"28","author":"Jones","year":"2012","journal-title":"Bioinformatics"},{"key":"B17","doi-asserted-by":"publisher","first-page":"999","DOI":"10.1093\/bioinformatics\/btu791","article-title":"MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins","volume":"31","author":"Jones","year":"2015","journal-title":"Bioinformatics"},{"key":"B18","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"B19","doi-asserted-by":"publisher","first-page":"2765","DOI":"10.1093\/bioinformatics\/btx283","article-title":"Foldit Standalone: a video game-derived protein structure manipulation interface using Rosetta","volume":"33","author":"Kleffner","year":"2017","journal-title":"Bioinformatics"},{"key":"B20","doi-asserted-by":"publisher","first-page":"1987","DOI":"10.1002\/prot.26231","article-title":"Modeling SARS-CoV-2 proteins in the CASP-commons experiment","volume":"89","author":"Kryshtafovych","year":"2021","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B21","first-page":"545","article-title":"Rosetta3","volume-title":"Methods in enzymology","author":"Leaver-Fay","year":"2011"},{"key":"B22","doi-asserted-by":"publisher","first-page":"i59","DOI":"10.1093\/bioinformatics\/bts213","article-title":"A conditional neural fields model for protein threading","volume":"28","author":"Ma","year":"2012","journal-title":"Bioinformatics"},{"key":"B23","doi-asserted-by":"publisher","first-page":"1259","DOI":"10.1093\/bioinformatics\/btu020","article-title":"GR-Align: fast and flexible alignment of protein 3D structures using graphlet degree similarity","volume":"30","author":"Malod-Dognin","year":"2014","journal-title":"Bioinformatics"},{"key":"B24","doi-asserted-by":"publisher","first-page":"e0243331","DOI":"10.1371\/journal.pone.0243331","article-title":"PolyFold: an interactive visual simulator for distance-based protein folding","volume":"15","author":"McGehee","year":"2020","journal-title":"PLOS ONE"},{"key":"B25","doi-asserted-by":"publisher","first-page":"679","DOI":"10.1038\/s41592-022-01488-1","article-title":"ColabFold: making protein folding accessible to all","volume":"19","author":"Mirdita","year":"2022","journal-title":"Nat. Methods"},{"key":"B26","doi-asserted-by":"publisher","first-page":"294","DOI":"10.1126\/science.aah4043","article-title":"Protein structure determination using metagenome sequence data","volume":"355","author":"Ovchinnikov","year":"2017","journal-title":"Science"},{"key":"B27","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1016\/j.sbi.2021.01.007","article-title":"Deep learning techniques have significantly impacted protein structure prediction and protein design","volume":"68","author":"Pearce","year":"2021","journal-title":"Curr. Opin. Struct. Biol."},{"key":"B28","doi-asserted-by":"publisher","first-page":"1605","DOI":"10.1002\/jcc.20084","article-title":"UCSF Chimera\u2014a visualization system for exploratory research and analysis","volume":"25","author":"Pettersen","year":"2004","journal-title":"J. Comput. Chem."},{"key":"B29","doi-asserted-by":"publisher","first-page":"725","DOI":"10.1038\/nprot.2010.5","article-title":"I-TASSER: a unified platform for automated protein structure and function prediction","volume":"5","author":"Roy","year":"2010","journal-title":"Nat. Protoc."},{"key":"B30","doi-asserted-by":"publisher","first-page":"4023","DOI":"10.1093\/bioinformatics\/btv492","article-title":"InteractiveROSETTA: a graphical user interface for the PyRosetta protein modeling suite","volume":"31","author":"Schenkelberg","year":"2015","journal-title":"Bioinformatics"},{"key":"B31","doi-asserted-by":"publisher","first-page":"1141","DOI":"10.1002\/prot.25834","article-title":"Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13)","volume":"87","author":"Senior","year":"2019","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B32","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"B33","doi-asserted-by":"publisher","first-page":"654","DOI":"10.1110\/ps.8.3.654","article-title":"Protein structure comparison using iterated double dynamic programming","volume":"8","author":"Taylor","year":"1999","journal-title":"Protein Sci."},{"key":"B34","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4939-0366-5_1","article-title":"Protein structure modeling with MODELLER","volume-title":"Protein structure prediction","author":"Webb","year":"2014"},{"key":"B35","doi-asserted-by":"publisher","first-page":"1069","DOI":"10.1002\/prot.25810","article-title":"Analysis of distance-based protein structure prediction by deep learning in CASP13","volume":"87","author":"Xu","year":"2019","journal-title":"Proteins Struct. Funct. Bioinforma."},{"key":"B36","doi-asserted-by":"publisher","first-page":"889","DOI":"10.1093\/bioinformatics\/btq066","article-title":"How significant is a protein structure similarity with TM-score = 0.5?","volume":"26","author":"Xu","year":"2010","journal-title":"Bioinformatics"},{"key":"B37","doi-asserted-by":"publisher","first-page":"1496","DOI":"10.1073\/pnas.1914677117","article-title":"Improved protein structure prediction using predicted interresidue orientations","volume":"117","author":"Yang","year":"2020","journal-title":"Proc. Natl. Acad. Sci."},{"key":"B38","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1038\/nmeth.3213","article-title":"The I-TASSER Suite: protein structure and function prediction","volume":"12","author":"Yang","year":"2015","journal-title":"Nat. Methods"},{"key":"B39","doi-asserted-by":"publisher","first-page":"2076","DOI":"10.1093\/bioinformatics\/btr350","article-title":"Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates","volume":"27","author":"Yang","year":"2011","journal-title":"Bioinformatics"},{"key":"B40","doi-asserted-by":"publisher","first-page":"878","DOI":"10.1186\/s12864-020-07249-8","article-title":"Template-based prediction of protein structure with deep learning","volume":"21","author":"Zhang","year":"2020","journal-title":"BMC Genomics"},{"key":"B41","doi-asserted-by":"publisher","first-page":"2302","DOI":"10.1093\/nar\/gki524","article-title":"TM-align: a protein structure alignment algorithm based on the TM-score","volume":"33","author":"Zhang","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"B42","doi-asserted-by":"publisher","first-page":"e1007411","DOI":"10.1371\/journal.pcbi.1007411","article-title":"Detecting distant-homology protein structures by aligning deep neural-network based contact maps","volume":"15","author":"Zheng","year":"2019","journal-title":"PLOS Comput. Biol."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1358550\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,18]],"date-time":"2024-03-18T08:17:06Z","timestamp":1710749826000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1358550\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,15]]},"references-count":42,"alternative-id":["10.3389\/fbinf.2024.1358550"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2024.1358550","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,15]]},"article-number":"1358550"}}