{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,6,7]],"date-time":"2022-06-07T06:49:49Z","timestamp":1654584589032},"reference-count":27,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Source Code Biol Med"],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Rapidly accumulating genome sequence data from multiple species offer powerful opportunities for the detection of DNA sequence evolution. Phylogenetic tree construction and codon-based tests for natural selection are the prevailing tools used to detect functionally important evolutionary change in protein coding sequences. These analyses often require multiple DNA sequence alignments that maintain the correct reading frame for each collection of putative orthologous sequences. Since this feature is not available in most alignment tools, codon reading frames often must be checked manually before evolutionary analyses can commence.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Here we report an online codon-preserved alignment tool (OCPAT) that generates multiple sequence alignments automatically from the coding sequences of any list of human gene IDs and their putative orthologs from genomes of other vertebrate tetrapods. OCPAT is programmed to extract putative orthologous genes from genomes and to align the orthologs with the reading frame maintained in all species. OCPAT also optimizes the alignment by trimming the most variable alignment regions at the 5' and 3' ends of each gene. The resulting output of alignments is returned in several formats, which facilitates further molecular evolutionary analyses by appropriate available software. Alignments are generally robust and reliable, retaining the correct reading frame. The tool can serve as the first step for comparative genomic analyses of protein-coding gene sequences including phylogenetic tree reconstruction and detection of natural selection. We aligned 20,658 human RefSeq mRNAs using OCPAT. Most alignments are missing sequence(s) from at least one species; however, functional annotation clustering of the ~1700 transcripts that were alignable to all species shows that genes involved in multi-subunit protein complexes are highly conserved.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The OCPAT program facilitates large-scale evolutionary and phylogenetic analyses of entire biological processes, pathways, and diseases.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1751-0473-2-5","type":"journal-article","created":{"date-parts":[[2007,9,18]],"date-time":"2007-09-18T18:13:58Z","timestamp":1190139238000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["OCPAT: an online codon-preserved alignment tool for evolutionary genomic analysis of protein coding sequences"],"prefix":"10.1186","volume":"2","author":[{"given":"Guozhen","family":"Liu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Monica","family":"Uddin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Munirul","family":"Islam","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Morris","family":"Goodman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lawrence I","family":"Grossman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Roberto","family":"Romero","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Derek E","family":"Wildman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2007,9,18]]},"reference":[{"issue":"9","key":"13_CR1","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1016\/j.tig.2005.06.012","volume":"21","author":"M Goodman","year":"2005","unstructured":"Goodman M, Grossman LI, Wildman DE: Moving primate genomics beyond the chimpanzee genome. Trends Genet. 2005, 21 (9): 511-517. 10.1016\/j.tig.2005.06.012.","journal-title":"Trends Genet"},{"issue":"1","key":"13_CR2","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","volume":"302","author":"C Notredame","year":"2000","unstructured":"Notredame C, Higgins DG, Heringa J: T-Coffee: A novel method for fast and accurate multiple sequence alignment. Journal of molecular biology. 2000, 302 (1): 205-217. 10.1006\/jmbi.2000.4042.","journal-title":"Journal of molecular biology"},{"issue":"22","key":"13_CR3","doi-asserted-by":"publisher","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"JD Thompson","year":"1994","unstructured":"Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic acids research. 1994, 22 (22): 4673-4680. 10.1093\/nar\/22.22.4673.","journal-title":"Nucleic acids research"},{"issue":"4","key":"13_CR4","doi-asserted-by":"publisher","first-page":"708","DOI":"10.1101\/gr.1933104","volume":"14","author":"M Blanchette","year":"2004","unstructured":"Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AF, Roskin KM, Baertsch R, Rosenbloom K, Clawson H, Green ED, et al.: Aligning multiple genomic sequences with the threaded blockset aligner. Genome research. 2004, 14 (4): 708-715. 10.1101\/gr.1933104.","journal-title":"Genome research"},{"issue":"5","key":"13_CR5","first-page":"555","volume":"13","author":"Z Yang","year":"1997","unstructured":"Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13 (5): 555-556.","journal-title":"Comput Appl Biosci"},{"issue":"1","key":"13_CR6","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1093\/bioinformatics\/17.1.73","volume":"17","author":"C Letondal","year":"2001","unstructured":"Letondal C: A Web interface generator for molecular biology programs in Unix. Bioinformatics. 2001, 17 (1): 73-82. 10.1093\/bioinformatics\/17.1.73.","journal-title":"Bioinformatics"},{"issue":"6822","key":"13_CR7","doi-asserted-by":"publisher","first-page":"860","DOI":"10.1038\/35057062","volume":"409","author":"ES Lander","year":"2001","unstructured":"Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al.: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038\/35057062.","journal-title":"Nature"},{"issue":"7055","key":"13_CR8","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1038\/nature04072","volume":"437","author":"The Chimpanzee Sequencing and Analysis Consortium","year":"2005","unstructured":"The Chimpanzee Sequencing and Analysis Consortium: Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005, 437 (7055): 69-87. 10.1038\/nature04072.","journal-title":"Nature"},{"issue":"5822","key":"13_CR9","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1126\/science.1139247","volume":"316","author":"RA Gibbs","year":"2007","unstructured":"Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, et al.: Evolutionary and biomedical insights from the rhesus macaque genome. Science. 2007, 316 (5822): 222-234. 10.1126\/science.1139247.","journal-title":"Science"},{"issue":"6915","key":"13_CR10","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1038\/nature01262","volume":"420","author":"RH Waterston","year":"2002","unstructured":"Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, et al.: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420 (6915): 520-562. 10.1038\/nature01262.","journal-title":"Nature"},{"issue":"6982","key":"13_CR11","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1038\/nature02426","volume":"428","author":"RA Gibbs","year":"2004","unstructured":"Gibbs RA, Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, Scherer S, Scott G, Steffen D, Worley KC, Burch PE, et al.: Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature. 2004, 428 (6982): 493-521. 10.1038\/nature02426.","journal-title":"Nature"},{"issue":"7069","key":"13_CR12","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1038\/nature04338","volume":"438","author":"K Lindblad-Toh","year":"2005","unstructured":"Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ, Zody MC, et al.: Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005, 438 (7069): 803-819. 10.1038\/nature04338.","journal-title":"Nature"},{"issue":"7141","key":"13_CR13","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1038\/nature05805","volume":"447","author":"TS Mikkelsen","year":"2007","unstructured":"Mikkelsen TS, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A, et al.: Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature. 2007, 447 (7141): 167-177. 10.1038\/nature05805.","journal-title":"Nature"},{"issue":"7018","key":"13_CR14","doi-asserted-by":"publisher","first-page":"695","DOI":"10.1038\/nature03154","volume":"432","author":"International Chicken Genome Sequencing Consortium","year":"2004","unstructured":"International Chicken Genome Sequencing Consortium: Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432 (7018): 695-716. 10.1038\/nature03154.","journal-title":"Nature"},{"key":"13_CR15","unstructured":"RefSeq mRNA databases. [ftp:\/\/ftp.ncbi.nih.gov\/refseq\/]"},{"key":"13_CR16","unstructured":"Ensembl. [ftp:\/\/ftp.ensembl.org\/pub\/release-41\/]"},{"key":"13_CR17","unstructured":"FASTA nr.gz. [ftp:\/\/ftp.ncbi.nih.gov\/blast\/db\/FASTA\/]"},{"key":"13_CR18","unstructured":"BLAST Executables. [ftp:\/\/ftp.ncbi.nih.gov\/blast\/executables\/release\/2.2.13\/]"},{"key":"13_CR19","unstructured":"Human RefSeq database. [ftp:\/\/ftp.ncbi.nih.gov\/refseq\/H_sapiens\/mRNA_Prot\/]"},{"key":"13_CR20","doi-asserted-by":"crossref","unstructured":"Ye J, McGinnis S, Madden TL: BLAST: improvements for better sequence analysis. Nucleic acids research. 2006, W6-9. 10.1093\/nar\/gkl164. 34 Web Server","DOI":"10.1093\/nar\/gkl164"},{"issue":"2","key":"13_CR21","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1111\/j.1574-6968.1999.tb13575.x","volume":"174","author":"TA Tatusova","year":"1999","unstructured":"Tatusova TA, Madden TL: BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS microbiology letters. 1999, 174 (2): 247-250. 10.1111\/j.1574-6968.1999.tb13575.x.","journal-title":"FEMS microbiology letters"},{"key":"13_CR22","volume-title":"MacClade 4: Analysis of Phylogeny and Character Evolution","author":"DR Maddison","year":"2000","unstructured":"Maddison DR, Maddison WP: MacClade 4: Analysis of Phylogeny and Character Evolution. 2000, Sunderland, MA: Sinauer"},{"issue":"12","key":"13_CR23","doi-asserted-by":"publisher","first-page":"1572","DOI":"10.1093\/bioinformatics\/btg180","volume":"19","author":"F Ronquist","year":"2003","unstructured":"Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093\/bioinformatics\/btg180.","journal-title":"Bioinformatics"},{"key":"13_CR24","volume-title":"PAUP*. Phylogenetic analysis using parsimony (*and other methods","author":"DL Swofford","year":"2002","unstructured":"Swofford DL: PAUP*. Phylogenetic analysis using parsimony (*and other methods. 2002, Sunderland, MA: Sinauer"},{"key":"13_CR25","unstructured":"OCPAT All. [http:\/\/homopan.wayne.edu\/OCPAT_withPlatypus\/]"},{"issue":"36","key":"13_CR26","doi-asserted-by":"publisher","first-page":"14395","DOI":"10.1073\/pnas.0704342104","volume":"104","author":"DE Wildman","year":"2007","unstructured":"Wildman DE, Uddin M, Opazo JC, Liu G, Lefort V, Guindon S, Gascuel O, Grossman LI, Romero R, Goodman M: Genomics, biogeography, and the diversification of placental mammals. Proc Natl Acad Sci USA. 2007, 104 (36): 14395-14400. 10.1073\/pnas.0704342104.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"5","key":"13_CR27","doi-asserted-by":"publisher","first-page":"P3","DOI":"10.1186\/gb-2003-4-5-p3","volume":"4","author":"G Dennis Jr","year":"2003","unstructured":"Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA: DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome biology. 2003, 4 (5): P3-10.1186\/gb-2003-4-5-p3.","journal-title":"Genome biology"}],"container-title":["Source Code for Biology and Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1751-0473-2-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T07:14:53Z","timestamp":1630480493000},"score":1,"resource":{"primary":{"URL":"https:\/\/scfbm.biomedcentral.com\/articles\/10.1186\/1751-0473-2-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,9,18]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["13"],"URL":"https:\/\/doi.org\/10.1186\/1751-0473-2-5","relation":{},"ISSN":["1751-0473"],"issn-type":[{"value":"1751-0473","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,9,18]]},"assertion":[{"value":"16 April 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 September 2007","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 September 2007","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"5"}}