{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:17:44Z","timestamp":1764688664591},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy\/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/www.ebi.ac.uk\/Tools\/picr\" ext-link-type=\"uri\">http:\/\/www.ebi.ac.uk\/Tools\/picr<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-8-401","type":"journal-article","created":{"date-parts":[[2007,10,18]],"date-time":"2007-10-18T18:14:03Z","timestamp":1192731243000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":107,"title":["The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases"],"prefix":"10.1186","volume":"8","author":[{"given":"Richard G","family":"C\u00f4t\u00e9","sequence":"first","affiliation":[]},{"given":"Philip","family":"Jones","sequence":"additional","affiliation":[]},{"given":"Lennart","family":"Martens","sequence":"additional","affiliation":[]},{"given":"Samuel","family":"Kerrien","sequence":"additional","affiliation":[]},{"given":"Florian","family":"Reisinger","sequence":"additional","affiliation":[]},{"given":"Quan","family":"Lin","sequence":"additional","affiliation":[]},{"given":"Rasko","family":"Leinonen","sequence":"additional","affiliation":[]},{"given":"Rolf","family":"Apweiler","sequence":"additional","affiliation":[]},{"given":"Henning","family":"Hermjakob","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2007,10,18]]},"reference":[{"key":"1773_CR1","doi-asserted-by":"crossref","unstructured":"The UniProt Consortium: The Universal Protein Resource (UniProt). Nucleic Acids Res 2007, (35 Database):D193\u20137. Epub 2006 Nov 16, PMID: 17142230 Epub 2006 Nov 16, PMID: 17142230 10.1093\/nar\/gkl929","DOI":"10.1093\/nar\/gkl929"},{"key":"1773_CR2","volume-title":"Nucleic Acids Res","author":"TJ Hubbard","year":"2007","unstructured":"Hubbard TJ, et al.: Ensembl 2007. Nucleic Acids Res 2007, (35 Database):D610\u20137. Epub 2006 Dec 5, PMID: 17148474 Epub 2006 Dec 5, PMID: 17148474 10.1093\/nar\/gkl996"},{"key":"1773_CR3","volume-title":"NCBI reference sequences (RefSeq)","author":"KD Pruitt","year":"2007","unstructured":"Pruitt KD, Tatusova T, Maglott DR: NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2007, (35 Database):D61\u20135. Epub 2006 Nov 27, PMID: 17130148 Epub 2006 Nov 27, PMID: 17130148 10.1093\/nar\/gkl842"},{"issue":"1","key":"1773_CR4","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1093\/bib\/5.1.59","volume":"5","author":"T Clark","year":"2004","unstructured":"Clark T, Martin S, Liefeld T: Globally distributed object identification for biological knowledgebases. Brief Bioinform 2004, 5(1):59\u201370. PMID: 15153306 PMID: 15153306 10.1093\/bib\/5.1.59","journal-title":"Brief Bioinform"},{"issue":"16","key":"1773_CR5","doi-asserted-by":"publisher","first-page":"4514","DOI":"10.1002\/pmic.200600032","volume":"6","author":"G Babnigg","year":"2006","unstructured":"Babnigg G, Giometti CS: A database of unique protein sequence identifiers for proteome studies. Proteomics 2006, 6(16):4514\u201322. PMID: 16858731 PMID: 16858731 10.1002\/pmic.200600032","journal-title":"Proteomics"},{"issue":"15","key":"1773_CR6","doi-asserted-by":"publisher","first-page":"4223","DOI":"10.1002\/pmic.200600018","volume":"6","author":"AM Boehm","year":"2006","unstructured":"Boehm AM, Sickmann A: A comprehensive dictionary of protein accession codes for complete protein accession identifier alias resolving. Proteomics 2006, 6(15):4223\u20136. PMID: 16888720 PMID: 16888720 10.1002\/pmic.200600018","journal-title":"Proteomics"},{"key":"1773_CR7","volume-title":"IDconverter and IDClight: conversion and annotation of gene and protein IDs","author":"A Alibes","year":"2007","unstructured":"Alibes A, Yankilevich P, Canada A, Diaz-Uriarte R: IDconverter and IDClight: conversion and annotation of gene and protein IDs. BMC Bioinformatics 2007 Jan 10; PMID: 17214880 2007 Jan 10; PMID: 17214880"},{"key":"1773_CR8","unstructured":"IDCLight[http:\/\/idclight.bioinfo.cnio.es\/]"},{"key":"1773_CR9","unstructured":"caBIG GeneConnect[https:\/\/cabig.nci.nih.gov\/tools\/GeneConnect\/]"},{"key":"1773_CR10","unstructured":"PIR ID Mapping[http:\/\/pir.georgetown.edu\/pirwww\/search\/idmapping.shtml]"},{"issue":"4","key":"1773_CR11","doi-asserted-by":"publisher","first-page":"R27","DOI":"10.1186\/gb-2003-4-4-r27","volume":"4","author":"KJ Bussey","year":"2003","unstructured":"Bussey KJ, Kane D, Sunshine M, Narasimhan S, Nishizuka S, Reinhold WC, Zeeberg B, Ajay W, Weinstein JN: MatchMiner: a tool for batch navigation among gene and gene product identifiers. Genome Biol 2003, 4(4):R27. Epub 2003 Mar 25, PMID: 12702208 Epub 2003 Mar 25, PMID: 12702208 10.1186\/gb-2003-4-4-r27","journal-title":"Genome Biol"},{"key":"1773_CR12","unstructured":"Onto-Translate[http:\/\/vortex.cs.wayne.edu\/projects.htm#Onto-Translate]"},{"key":"1773_CR13","unstructured":"SOURCE[http:\/\/source.stanford.edu]"},{"key":"1773_CR14","unstructured":"Resourcerer[http:\/\/compbio.dfci.harvard.edu\/tgi\/cgi-bin\/magic\/p1.pl]"},{"key":"1773_CR15","volume-title":"PROMPT","author":"T Schmidt","year":"2006","unstructured":"Schmidt T, Frishman D: PROMPT: a protein mapping and comparison tool. BMC Bioinformatics 7: 331. 2006 Jul 4, PMID: 16817977 2006 Jul 4, PMID: 16817977 10.1186\/1471-2105-7-331"},{"issue":"13","key":"1773_CR16","doi-asserted-by":"publisher","first-page":"3537","DOI":"10.1002\/pmic.200401303","volume":"5","author":"L Martens","year":"2005","unstructured":"Martens L, Hermjakob H, Jones P, Adamski M, Taylor C, States D, Gevaert K, Vandekerckhove J, Apweiler R: PRIDE: the proteomics identifications database. Proteomics 2005, 5(13):3537\u201345. Erratum in: Proteomics. 2005 Oct;5(15):4046 Erratum in: Proteomics. 2005 Oct;5(15):4046 10.1002\/pmic.200401303","journal-title":"Proteomics"},{"key":"1773_CR17","volume-title":"PRIDE","author":"P Jones","year":"2006","unstructured":"Jones P, Cote RG, Martens L, Quinn AF, Taylor CF, Derache W, Hermjakob H, Apweiler R: PRIDE: a public repository of protein and peptide identifications for the proteomics community. Nucleic Acids Res (34 Database):D659\u201363. 2006 Jan 1, PMID: 16381953 2006 Jan 1, PMID: 16381953"},{"key":"1773_CR18","volume-title":"IntAct\u2013open source resource for molecular interaction data","author":"S Kerrien","year":"2007","unstructured":"Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, Kohler C, Khadake J, Leroy C, Liban A, Lieftink C, Montecchi-Palazzi L, Orchard S, Risse J, Robbe K, Roechert B, Thorneycroft D, Zhang Y, Apweiler R, Hermjakob H: IntAct\u2013open source resource for molecular interaction data. Nucleic Acids Res 2007, (35 Database):D561\u20135. Epub 2006 Dec 1, PMID: 17145710 Epub 2006 Dec 1, PMID: 17145710 10.1093\/nar\/gkl958"},{"issue":"17","key":"1773_CR19","first-page":"3236","volume":"20","author":"R Leinonen","year":"2004","unstructured":"Leinonen R, Diez FG, Binns D, Fleischmann W, Lopez R, Apweiler R: UniProt archive. Bioinformatics 20(17):3236\u20137. 2004 Nov 22; Epub 2004 Mar 25, PMID: 15044231 2004 Nov 22; Epub 2004 Mar 25, PMID: 15044231 10.1093\/bioinformatics\/bth191","journal-title":"UniProt archive"},{"key":"1773_CR20","unstructured":"The Java API[http:\/\/java.sun.com\/]"},{"key":"1773_CR21","unstructured":"JAXB Reference Implementation[https:\/\/jaxb.dev.java.net\/]"},{"key":"1773_CR22","unstructured":"The Apache Struts Web Application Framework[http:\/\/struts.apache.org\/1.2.9\/]"},{"key":"1773_CR23","unstructured":"JAX-WS Reference Implementation[https:\/\/jax-ws.dev.java.net\/]"},{"key":"1773_CR24","unstructured":"Apache Commons DBCP[http:\/\/jakarta.apache.org\/commons\/dbcp\/]"},{"key":"1773_CR25","unstructured":"OpenSymphony Cache[http:\/\/www.opensymphony.com\/]"},{"key":"1773_CR26","unstructured":"Log4J Logging Services[http:\/\/logging.apache.org\/log4j\/docs\/]"},{"key":"1773_CR27","unstructured":"The JavaMail API[http:\/\/java.sun.com\/products\/javamail\/]"},{"issue":"13","key":"1773_CR28","doi-asserted-by":"publisher","first-page":"3822","DOI":"10.1093\/nar\/gkg516","volume":"31","author":"IQ Phan","year":"2003","unstructured":"Phan IQ, Pilbout SF, Fleischmann W, Bairoch A: NEWT, a new taxonomy portal. Nucleic Acids Res 31(13):3822\u20133. 2003 Jul 1, PMID: 12824428 2003 Jul 1, PMID: 12824428 10.1093\/nar\/gkg516","journal-title":"Nucleic Acids Res"},{"key":"1773_CR29","unstructured":"NCBI eUtilities[http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query\/static\/esoap_help.html]"},{"key":"1773_CR30","first-page":"7","volume-title":"BMC Bioinformatics","author":"RG Cote","year":"2006","unstructured":"Cote RG, Jones P, Apweiler R, Hermjakob H: The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries. BMC Bioinformatics 7: 97. 2006 Feb 28, PMID: 16507094 2006 Feb 28, PMID: 16507094 10.1186\/1471-2105-7-97"},{"key":"1773_CR31","volume-title":"The Vertebrate Genome Annotation (Vega) database","author":"JL Ashurst","year":"2005","unstructured":"Ashurst JL, Chen CK, Gilbert JG, Jekosch K, Keenan S, Meidl P, Searle SM, Stalker J, Storey R, Trevanion S, Wilming L, Hubbard T: The Vertebrate Genome Annotation (Vega) database. Nucleic Acids Res (33 Database):D459\u201365. 2005 Jan 1, PMID: 15608237 2005 Jan 1, PMID: 15608237"},{"key":"1773_CR32","volume-title":"databases of predicted protein sequences","author":"P Sperisen","year":"2004","unstructured":"Sperisen P, Iseli C, Pagni M, Stevenson BJ, Bucher P, Jongeneel CV: trome, trEST and trGEN: databases of predicted protein sequences. Nucleic Acids Res (32 Database):D509\u201311. 2004 Jan 1, PMID: 14681469 2004 Jan 1, PMID: 14681469"},{"issue":"11","key":"1773_CR33","doi-asserted-by":"publisher","first-page":"1048","DOI":"10.1093\/bioinformatics\/16.11.1048","volume":"16","author":"P Kersey","year":"2000","unstructured":"Kersey P, Hermjakob H, Apweiler R: VARSPLIC: alternatively-spliced protein sequences derived from SWISS-PROT and TrEMBL. Bioinformatics 2000, 16(11):1048\u20139. PMID: 11159319 PMID: 11159319 10.1093\/bioinformatics\/16.11.1048","journal-title":"Bioinformatics"},{"key":"1773_CR34","unstructured":"PICR SOAP developer documentation[http:\/\/www.ebi.ac.uk\/Tools\/picr\/WSDLDocumentation.do]"},{"key":"1773_CR35","unstructured":"PICR REST developer documentation[http:\/\/www.ebi.ac.uk\/Tools\/picr\/RESTDocumentation.do]"},{"key":"1773_CR36","unstructured":"PICR main search page[http:\/\/www.ebi.ac.uk\/Tools\/picr]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-8-401.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T01:53:17Z","timestamp":1630461197000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-8-401"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10,18]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["1773"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-8-401","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,10,18]]},"assertion":[{"value":"30 May 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 October 2007","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 October 2007","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"401"}}