{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T19:42:26Z","timestamp":1760384546602},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2006,3,7]],"date-time":"2006-03-07T00:00:00Z","timestamp":1141689600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Processing raw DNA sequence data is an especially challenging task for relatively small laboratories and core facilities that produce as many as 5000 or more DNA sequences per week from multiple projects in widely differing species. To meet this challenge, we have developed the flexible, scalable, and automated sequence processing package described here.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>MAGIC-SPP is a DNA sequence processing package consisting of an Oracle 9i relational database, a Perl pipeline, and user interfaces implemented either as JavaServer Pages (JSP) or as a Java graphical user interface (GUI). The database not only serves as a data repository, but also controls processing of trace files. MAGIC-SPP includes an administrative interface, a laboratory information management system, and interfaces for exploring sequences, monitoring quality control, and troubleshooting problems related to sequencing activities. In the sequence trimming algorithm it employs new features designed to improve performance with respect to concerns such as concatenated linkers, identification of the expected start position of a vector insert, and extending the useful length of trimmed sequences by bridging short regions of low quality when the following high quality segment is sufficiently long to justify doing so.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>MAGIC-SPP has been designed to minimize human error, while simultaneously being robust, versatile, flexible and automated. It offers a unique combination of features that permit administration by a biologist with little or no informatics background. It is well suited to both individual research programs and core facilities.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-7-115","type":"journal-article","created":{"date-parts":[[2006,3,10]],"date-time":"2006-03-10T19:19:09Z","timestamp":1142018349000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools"],"prefix":"10.1186","volume":"7","author":[{"given":"Chun","family":"Liang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Feng","family":"Sun","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haiming","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junfeng","family":"Qu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"suffix":"Jr","given":"Robert M","family":"Freeman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lee H","family":"Pratt","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marie-Mich\u00e8le","family":"Cordonnier-Pratt","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,3,7]]},"reference":[{"key":"854_CR1","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1002\/cfg.399","volume":"5","author":"M-M Cordonnier-Pratt","year":"2004","unstructured":"Cordonnier-Pratt M-M, Liang C, Wang H, Kolychev D, Sun F, Freeman R, Sullivan R, Pratt LH: MAGIC Database and interfaces: an integrated package for gene discovery and expression. Comp Funct Genom 2004, 5: 268\u2013275. 10.1002\/cfg.399","journal-title":"Comp Funct Genom"},{"key":"854_CR2","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1093\/bioinformatics\/17.12.1093","volume":"17","author":"H-H Chou","year":"2001","unstructured":"Chou H-H, Holmes MH: DNA sequence quality trimming and vector removal. Bioinformatics 2001, 17: 1093\u20131104. 10.1093\/bioinformatics\/17.12.1093","journal-title":"Bioinformatics"},{"key":"854_CR3","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1104\/pp.103.025015","volume":"133","author":"F Qiu","year":"2003","unstructured":"Qiu F, Guo L, Wen T-J, Liu F, Ashlock DA, Schnable PS: DNA sequence-based \"Bar codes\" for tracking the origins of expressed sequence tags from a maize cDNA library constructed using multiple mRNA sources. Plant Physiol 2003, 133: 475\u2013481. 10.1104\/pp.103.025015","journal-title":"Plant Physiol"},{"key":"854_CR4","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1093\/nar\/gkg170","volume":"31","author":"R Sorek","year":"2003","unstructured":"Sorek R, Safer HM: A novel algorithm for computational identification of contaminated EST libraries. Nucleic Acids Res 2003, 31: 1067\u20131074. 10.1093\/nar\/gkg170","journal-title":"Nucleic Acids Res"},{"key":"854_CR5","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1038\/ng1114","volume":"33","author":"JC Venter","year":"2003","unstructured":"Venter JC, Levy S, Stockwell T, Remington K, Halpern A: Massive parallelism, randomness and genomic advances. Nature Genetics 2003, 33: 219\u2013227. 10.1038\/ng1114","journal-title":"Nature Genetics"},{"key":"854_CR6","doi-asserted-by":"publisher","first-page":"1318","DOI":"10.1093\/bioinformatics\/btg159","volume":"19","author":"TE Scheetz","year":"2003","unstructured":"Scheetz TE, Trivedi N, Roberts CA, Kucaba T, Berger B, Robinson NL, Birkett CL, Gavin AJ, O'Leary B, Braun TA, Bonaldo MF, Robinson JP, Sheffield VC, Soares MB, Casavant TL: ESTprep: preprocessing cDNA sequence reads. Bioinformatics 2003, 19: 1318\u20131324. 10.1093\/bioinformatics\/btg159","journal-title":"Bioinformatics"},{"key":"854_CR7","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1007\/BF02900361","volume":"5","author":"R Staden","year":"1996","unstructured":"Staden R: The Staden sequence analysis package. Mol Biotechnol 1996, 5: 233\u2013241.","journal-title":"Mol Biotechnol"},{"key":"854_CR8","first-page":"179","volume":"1","author":"E Veklerov","year":"1996","unstructured":"Veklerov E, Eeckman FH, Martin CH: MTT: a software tool for quality control in sequence assembly. Microb Comp Genomics 1996, 1: 179\u2013184.","journal-title":"Microb Comp Genomics"},{"key":"854_CR9","first-page":"175","volume":"13","author":"TM Smith","year":"1997","unstructured":"Smith TM, Abajian C, Hood L: Hopper: software for automating data tracking and flow in DNA sequencing. Comput Appl Biosci 1997, 13: 175\u201382.","journal-title":"Comput Appl Biosci"},{"key":"854_CR10","doi-asserted-by":"crossref","first-page":"975","DOI":"10.1101\/gr.8.9.975","volume":"8","author":"MC Wendl","year":"1998","unstructured":"Wendl MC, Dear S, Hodgson D, Hillier L: Automated sequence preprocessing in a large-scale sequencing environment. Genome Res 1998, 8: 975\u2013984.","journal-title":"Genome Res"},{"key":"854_CR11","doi-asserted-by":"publisher","first-page":"464","DOI":"10.1147\/sj.402.0464","volume":"40","author":"JT Inman","year":"2001","unstructured":"Inman JT, Flores HR, May GD, Weller JW, Bell CJ: A high-throughput distributed DNA sequence analysis and database system. IBM SYSTEMS JOURNAL 2001, 40: 464\u2013486.","journal-title":"IBM SYSTEMS JOURNAL"},{"key":"854_CR12","doi-asserted-by":"publisher","first-page":"4761","DOI":"10.1093\/nar\/gkf585","volume":"30","author":"P Ayoubi","year":"2002","unstructured":"Ayoubi P, Jin X, Leite S, Liu X, Martajaja J, Abduraham A, Wan Q, Yan W, Misawa E, Prade RA: PipeOnline 2.0: automated EST processing and functional data sorting. Nucleic Acids Res 2002, 30: 4761\u20134769. 10.1093\/nar\/gkf585","journal-title":"Nucleic Acids Res"},{"key":"854_CR13","doi-asserted-by":"publisher","first-page":"1398","DOI":"10.1093\/bioinformatics\/bth101","volume":"20","author":"J Parkinson","year":"2004","unstructured":"Parkinson J, Anthony A, Wasmuth J, Schmid R, Hedley A, Blaxter M: PartiGene \u2013 constructing partial genomes. Bioinformatics 2004, 20: 1398\u20131404. 10.1093\/bioinformatics\/bth101","journal-title":"Bioinformatics"},{"key":"854_CR14","doi-asserted-by":"publisher","first-page":"1720","DOI":"10.1093\/bioinformatics\/btg205","volume":"19","author":"C Mao","year":"2003","unstructured":"Mao C, Cushman JC, May GD, Weller JW: ESTAP \u2013 an automated system for the analysis of EST data. Bioinformatics 2003, 19: 1720\u20131722. 10.1093\/bioinformatics\/btg205","journal-title":"Bioinformatics"},{"key":"854_CR15","doi-asserted-by":"publisher","first-page":"1587","DOI":"10.1093\/bioinformatics\/btg196","volume":"19","author":"ACM Paquola","year":"2003","unstructured":"Paquola ACM, Nishyiama MY Jr, Reis EM, da Silva AM, Verjovski-Almeida S: ESTWeb: bioinformatics services for EST sequencing projects. Bioinformatics 2003, 19: 1587\u20131588. 10.1093\/bioinformatics\/btg196","journal-title":"Bioinformatics"},{"key":"854_CR16","unstructured":"MAGIC SeqView[http:\/\/fungen.org\/seqview]"},{"key":"854_CR17","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1145\/320434.320440","volume":"1","author":"PP Chen","year":"1976","unstructured":"Chen PP: The entity-relationship model \u2013 toward a unified view of data. ACM Transactions on Database Systems 1976, 1: 9\u201336. 10.1145\/320434.320440","journal-title":"ACM Transactions on Database Systems"},{"key":"854_CR18","doi-asserted-by":"publisher","DOI":"10.4018\/978-1-931777-44-5","volume-title":"UML and the unified process","author":"L Favre","year":"2003","unstructured":"Favre L: UML and the unified process. Hershey, PA, IRM Press; 2003."},{"key":"854_CR19","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1101\/gr.8.3.175","volume":"8","author":"B Ewing","year":"1998","unstructured":"Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res 1998, 8: 175\u2013185.","journal-title":"Genome Res"},{"key":"854_CR20","unstructured":"CROSS_MATCH[http:\/\/Bozeman.mbt.washington.edu\/phrap.docs\/phrap.html]"},{"key":"854_CR21","doi-asserted-by":"publisher","first-page":"1725","DOI":"10.1101\/gr.194201","volume":"11","author":"Z Ning","year":"2001","unstructured":"Ning Z, Cox AJ, Mullikin JC: SSAHA: A Fast Search Method for Large DNA Databases. Genome Res 2001, 11: 1725\u20131729. 10.1101\/gr.194201","journal-title":"Genome Res"},{"key":"854_CR22","unstructured":"MAGIC DB[http:\/\/fungen.org]"},{"key":"854_CR23","doi-asserted-by":"publisher","first-page":"362","DOI":"10.1002\/cfg.408","volume":"5","author":"D Weems","year":"2004","unstructured":"Weems D, Miller N, Garcia-Hernandez M, Huala E, Rhee SY: Design, implementation and maintenance of a model organism database for Arabidopsis thaliana. Comp Funct Genom 2004, 5: 362\u2013369. 10.1002\/cfg.408","journal-title":"Comp Funct Genom"},{"key":"854_CR24","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1186\/1471-2105-5-40","volume":"5","author":"SP Shah","year":"2004","unstructured":"Shah SP, He DYM, Sawkins JN, Druce JC, Quon G, Lett D, Zheng GXY, Xu T, Ouellette BFF: Pegasys: software for executing and integrating analyses of biological sequences. BMC Bioinformatics 2004, 5: 40. 10.1186\/1471-2105-5-40","journal-title":"BMC Bioinformatics"},{"key":"854_CR25","doi-asserted-by":"publisher","first-page":"934","DOI":"10.1101\/gr.1859804","volume":"14","author":"SC Potter","year":"2004","unstructured":"Potter SC, Clarke L, Curwen V, Keenan S, Mongin E, Searle SMJ, Stabenau A, Storey R, Clamp M: The Ensembl Analysis Pipeline. Genome Res 2004, 14: 934\u2013941. 10.1101\/gr.1859804","journal-title":"Genome Res"},{"key":"854_CR26","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1101\/gr.1363103","volume":"13","author":"H Hoon","year":"2003","unstructured":"Hoon H, Ratnapu KK, Chia J-M, Kumarasamy B, Xiao J, Clamp M, Stabenau A, Potter S, Clarke L, Stupka E: Biopipe: A Flexible Framework for Protocol-Based Bioinformatics Analysis. Genome Res 2003, 13: 1904\u20131915.","journal-title":"Genome Res"},{"key":"854_CR27","unstructured":"Lucy Demonstration Data Set[http:\/\/www.tigr.org\/software\/sequencing.shtml]"},{"key":"854_CR28","unstructured":"MySQL[http:\/\/www.mysql.com]"},{"key":"854_CR29","unstructured":"PostgreSQL[http:\/\/www.postgresql.org]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-115.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1471-2105-7-115\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-115.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,6]],"date-time":"2023-05-06T18:37:31Z","timestamp":1683398251000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-115"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,3,7]]},"references-count":29,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["854"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-115","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,3,7]]},"assertion":[{"value":"14 March 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 March 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 March 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"115"}}