{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T04:52:08Z","timestamp":1776315128380,"version":"3.50.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"12","license":[{"start":{"date-parts":[[2024,9,28]],"date-time":"2024-09-28T00:00:00Z","timestamp":1727481600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,9,28]],"date-time":"2024-09-28T00:00:00Z","timestamp":1727481600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["101095129"],"award-info":[{"award-number":["101095129"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Alma Mater Studiorum - Universit\u00e0 di Bologna"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Scientometrics"],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This article presents the OpenCitations Index, a collection of open citation data maintained by OpenCitations, an independent, not-for-profit infrastructure organisation for open scholarship dedicated to publishing open bibliographic and citation data using Semantic Web and Linked Open Data technologies. The collection involves citation data harvested from multiple sources. To address the possibility of different sources providing citation data for bibliographic entities represented with different identifiers, therefore potentially representing same citation, a deduplication mechanism has been implemented. This ensures that citations integrated into OpenCitations Index are accurately identified uniquely, even when different identifiers are used. This mechanism follows a specific workflow, which encompasses a preprocessing of the original source data, a management of the provided bibliographic metadata, and the generation of new citation data to be integrated into the OpenCitations Index. The process relies on another data collection\u2014OpenCitations Meta, and on the use of a new globally persistent identifier, namely OMID (OpenCitations Meta Identifier). As of July 2024, OpenCitations Index stores over 2 billion unique citation links, harvest from Crossref, the National Institute of Heath Open Citation Collection (NIH-OCC), DataCite, OpenAIRE, and the Japan Link Center (JaLC). OpenCitations Index can be systematically accessed and queried through several services, including SPARQL endpoint, REST APIs, and web interfaces. Additionally, dataset dumps are available for free download and reuse (under CC0 waiver) in various formats (CSV, N-Triples, and Scholix), including provenance and change tracking information.<\/jats:p>","DOI":"10.1007\/s11192-024-05160-7","type":"journal-article","created":{"date-parts":[[2024,9,28]],"date-time":"2024-09-28T09:01:37Z","timestamp":1727514097000},"page":"7923-7942","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["The OpenCitations Index: description of a database providing open citation data"],"prefix":"10.1007","volume":"129","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5366-5194","authenticated-orcid":false,"given":"Ivan","family":"Heibi","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5486-7070","authenticated-orcid":false,"given":"Arianna","family":"Moretti","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0530-4305","authenticated-orcid":false,"given":"Silvio","family":"Peroni","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1466-7742","authenticated-orcid":false,"given":"Marta","family":"Soricetti","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,9,28]]},"reference":[{"key":"5160_CR1","unstructured":"Albertoni, R., Browning, D., Cox, S. J. D., Gonzalez\u00a0Beltran, A., Perego, A., & Winstanley, P. (2024). Data catalog vocabulary (DCAT)\u2014version 3 W3C recommendation. World Wide Web Consortium. Retrieved from https:\/\/www.w3.org\/TR\/vocab-dcat-3\/."},{"key":"5160_CR2","unstructured":"Alexander, K., Cyganiak, R., Hausenblas, M., & Zhao, J. (2009). Describing linked datasets. In C.\u00a0Bizer, T.\u00a0Heath, T.\u00a0Berners-Lee, & K.\u00a0Idehen (Eds.), Proceedings of the WWW 2009 workshop on linked data on the web, LDOW 2009. Madrid, Spain: CEUR-WS. Retrieved from https:\/\/ceur-ws.org\/Vol-538\/ldow2009_paper20.pdf."},{"key":"5160_CR3","doi-asserted-by":"publisher","unstructured":"Beck, F., & Krause, C. (2022). Visually explaining publication ranks in citation-based literature search with PURE suggest. In M.\u00a0Krone, S.\u00a0Lenti, & J.\u00a0Schmidt (Eds.), Eurovis 2022\u2014posters. The Eurographics Association. https:\/\/doi.org\/10.2312\/evp.20221110.","DOI":"10.2312\/evp.20221110"},{"key":"5160_CR4","unstructured":"Beckett, D., Berners-Lee, T., Prud\u2019hommeaux, E., & Carothers, G. (2014). RDF 1.1 turtle: Terse RDF triple language [W3C recommendation]. Retrieved from https:\/\/www.w3.org\/TR\/turtle\/."},{"key":"5160_CR5","doi-asserted-by":"publisher","DOI":"10.1045\/january2017-burton","author":"A Burton","year":"2017","unstructured":"Burton, A., Aryani, A., Koers, H., Manghi, P., La Bruzzo, S., Stocker, M., et al. (2017). The scholix framework for interoperability in data-literature information exchange. D-Lib Magazine. https:\/\/doi.org\/10.1045\/january2017-burton","journal-title":"D-Lib Magazine"},{"issue":"2","key":"5160_CR6","doi-asserted-by":"publisher","first-page":"195","DOI":"10.3233\/SW-210439","volume":"13","author":"M Daquino","year":"2022","unstructured":"Daquino, M., Heibi, I., Peroni, S., & Shotton, D. (2022). Creating RESTful APIs over SPARQL endpoints using RAMOSE. Semantic Web, 13(2), 195\u2013213. https:\/\/doi.org\/10.3233\/SW-210439","journal-title":"Semantic Web"},{"key":"5160_CR7","doi-asserted-by":"crossref","unstructured":"Daquino, M., Peroni, S., Shotton, D., Colavizza, G., Ghavimi, B., Lauscher, A., et al. (2020). The opencitations data model. In International semantic web conference (pp. 447\u2013463).","DOI":"10.1007\/978-3-030-62466-8_28"},{"key":"5160_CR8","doi-asserted-by":"publisher","DOI":"10.1007\/s00799-023-00372-3","author":"E Entrup","year":"2023","unstructured":"Entrup, E., Eppelin, A., Ewerth, R., Hartwig, J., Tullney, M., Wohlgemuth, M., & Hoppe, A. (2023). Comparing different search methods for the open access journal recommendation tool b!son. International Journal on Digital Libraries. https:\/\/doi.org\/10.1007\/s00799-023-00372-3","journal-title":"International Journal on Digital Libraries"},{"key":"5160_CR9","doi-asserted-by":"publisher","unstructured":"Fenner, M. (2016). A common API for retrieving DataCite metadata [other]. Retrieved September 26, 2023, from  https:\/\/blog.front-matter.io\/posts\/a-common-api-for-retrieving-datacite-metadata. https:\/\/doi.org\/10.53731\/r79x5j1-97aq74v-ag59c.","DOI":"10.53731\/r79x5j1-97aq74v-ag59c"},{"key":"5160_CR10","doi-asserted-by":"crossref","unstructured":"Franchuk, N. (2023). \u0422\u0435\u0445\u043d\u043e\u043b\u043e\u0433i\u044f \u0412\u0438\u043a\u043e\u0440\u0438\u0441\u0442\u0430\u043d\u043d\u044f \u0432i\u0434\u043a\u0440\u0438\u0442\u043e\u0433\u043e \u0443\u043a\u0440\u0430\u00ef\u043d\u0441\u044c\u043a\u043e\u0433\u043e i\u043d\u0434\u0435\u043a\u0441\u0443 \u0446\u0438\u0442\u0443\u0432\u0430\u043d\u044c \u0434\u043b\u044f \u043e\u0446i\u043d\u044e\u0432\u0430\u043d\u043d\u044f \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u0438\u0432\u043d\u043e\u0441\u0442i \u043f\u0435\u0434\u0430\u0433\u043e\u0433i\u0447\u043d\u0438\u0445 \u0434\u043e\u0441\u043bi\u0434\u0436\u0435\u043d\u044c. \u041e\u0441\u0432i\u0442\u0430. I\u043d\u043d\u043e\u0432\u0430\u0442\u0438\u043a\u0430. \u041f\u0440\u0430\u043a\u0442\u0438\u043a\u0430, 5(11), 95\u2013101, https:\/\/doi.org\/10.31110\/2616-650X-vol11i5-014","DOI":"10.31110\/2616-650X-vol11i5-014"},{"key":"5160_CR11","doi-asserted-by":"publisher","unstructured":"Grieco, G., Peroni, S., Moretti, A., dbrembilla, Heibi, I., Czygan, M. (2024). Opencitations index (v1.0.1). https:\/\/doi.org\/10.5281\/zenodo.12960640.","DOI":"10.5281\/zenodo.12960640"},{"key":"5160_CR12","doi-asserted-by":"publisher","unstructured":"Group, D. M. W., et\u00a0al. (2024). Datacite metadata schema documentation for the publication and citation of research data and other research outputs note. https:\/\/doi.org\/10.14454\/G8E5-6293.","DOI":"10.14454\/G8E5-6293"},{"key":"5160_CR13","unstructured":"Harris, S., & Seaborne, A. (2013). SPARQL 1.1 query language. Retrieved from https:\/\/www.w3.org\/TR\/sparql11-query\/."},{"issue":"1","key":"5160_CR14","doi-asserted-by":"publisher","first-page":"205","DOI":"10.3233\/DS-190016","volume":"2","author":"I Heibi","year":"2019","unstructured":"Heibi, I., Peroni, S., & Shotton, D. (2019a). Enabling text search on SPARQL endpoints through OSCAR. Data Science, 2(1), 205\u2013227. https:\/\/doi.org\/10.3233\/DS-190016.","journal-title":"Data Science"},{"issue":"2","key":"5160_CR15","doi-asserted-by":"publisher","first-page":"1213","DOI":"10.1007\/s11192-019-03217-6","volume":"121","author":"I Heibi","year":"2019","unstructured":"Heibi, I., Peroni, S., & Shotton, D. (2019). Software review: COCI, the OpenCitations index of crossref open DOI-to-DOI citations. Scientometrics, 121(2), 1213\u20131228. https:\/\/doi.org\/10.1007\/s11192-019-03217-6","journal-title":"Scientometrics"},{"key":"5160_CR16","unstructured":"Hendricks, G., Rittman, M., & Bartell, A. (2022). Amendments to membership terms to open reference distribution and include UK jurisdiction [website]. Retrieved April 9, 2024, from https:\/\/www.crossref.org\/blog\/amendments-to-membership-terms-to-open-reference-distribution-and-include-uk-jurisdiction\/."},{"issue":"1","key":"5160_CR17","doi-asserted-by":"publisher","first-page":"414","DOI":"10.1162\/qss_a_00022","volume":"1","author":"G Hendricks","year":"2020","unstructured":"Hendricks, G., Tkaczyk, D., Lin, J., & Feeney, P. (2020). Crossref: The sustainable source of community-owned scholarly metadata. Quantitative Science Studies, 1(1), 414\u2013427. https:\/\/doi.org\/10.1162\/qss_a_00022.","journal-title":"Quantitative Science Studies"},{"issue":"10","key":"5160_CR18","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pbio.3000385","volume":"17","author":"BI Hutchins","year":"2019","unstructured":"Hutchins, B. I., Baker, K. L., Davis, M. T., Diwersy, M. A., Haque, E., Harriman, R. M., & Santangelo, G. M. (2019). The NIH open citation collection: A public access, broad coverage resource. PLoS Biology, 17(10), e3000385. https:\/\/doi.org\/10.1371\/journal.pbio.3000385.","journal-title":"PLoS Biology"},{"key":"5160_CR19","doi-asserted-by":"publisher","unstructured":"ICite, Hutchins, B. I., & Santangelo, G. (2022). iCite database snapshots (NIH open citation collection). https:\/\/doi.org\/10.35092\/YHJC.C.4586573.","DOI":"10.35092\/YHJC.C.4586573"},{"issue":"1","key":"5160_CR20","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1241\/johokanri.55.42","volume":"55","author":"T Kato","year":"2012","unstructured":"Kato, T., Tsuchiya, E., Kubota, S., & Miyagawa, Y. (2012). Japan link center (jalc): link management and doi assignment for Japanese electronic scholarly contents. Journal of Information Processing and Management, 55(1), 42\u201346. https:\/\/doi.org\/10.1241\/johokanri.55.42.","journal-title":"Journal of Information Processing and Management"},{"key":"5160_CR21","doi-asserted-by":"publisher","unstructured":"Kinney, R., Anastasiades, C., Authur, R., Beltagy, I., Bragg, J., Buraczynski, A., et al. (2023). The semantic scholar open data platform.  https:\/\/doi.org\/10.48550\/arXiv.2301.10140.","DOI":"10.48550\/arXiv.2301.10140"},{"key":"5160_CR22","doi-asserted-by":"publisher","unstructured":"La\u00a0Bruzzo, S., Baglioni, M., Atzori, C., & Manghi, P. (2023). Scholix dump of the OpenAIRE inferred citations. Zenodo. https:\/\/doi.org\/10.5281\/ZENODO.7845968.","DOI":"10.5281\/ZENODO.7845968"},{"key":"5160_CR23","unstructured":"La\u00a0Bruzzo, S., & Manghi, P. (2022). OpenAIRE ScholeXplorer service: Scholix JSON dump. [object Object]. Retrieved April 3, 2024, from https:\/\/zenodo.org\/record\/1200252."},{"key":"5160_CR24","unstructured":"Lebo, T., Sahoo, S., & McGuinness, D. (2013). PROV-O: The PROV ontology W3C recommendation. World Wide Web Consortium. Retrieved September 14, 2019, from https:\/\/www.w3.org\/TR\/prov-o\/."},{"key":"5160_CR25","doi-asserted-by":"publisher","unstructured":"Manghi, P., Bardi, A., Atzori, C., Baglioni, M., Manola, N., & Schirrwagen, J. et al. (2019). The openaire research graph data model. Zenodo. https:\/\/doi.org\/10.5281\/zenodo.2643199","DOI":"10.5281\/zenodo.2643199"},{"key":"5160_CR26","doi-asserted-by":"publisher","DOI":"10.1045\/september2012-manghi","author":"P Manghi","year":"2012","unstructured":"Manghi, P., Bolikowski, L., Manold, N., Schirrwagen, J., & Smith, T. (2012). OpenAIREplus: The European scholarly communication data infrastructure. D-Lib Magazine. https:\/\/doi.org\/10.1045\/september2012-manghi","journal-title":"D-Lib Magazine"},{"key":"5160_CR27","unstructured":"Manghi, P., Manola, N., Horstmann, W., & Peters, D. (2010). An infrastructure for managing ec funded research output: The openaire project. Grey Journal (TGJ), 6(1)."},{"key":"5160_CR28","doi-asserted-by":"publisher","DOI":"10.1162\/qss_a_00292","author":"A Massari","year":"2024","unstructured":"Massari, A., Mariani, F., Heibi, I., Peroni, S., & Shotton, D. (2024). OpenCitations meta. Quantitative Science Studies. https:\/\/doi.org\/10.1162\/qss_a_00292","journal-title":"Quantitative Science Studies"},{"key":"5160_CR29","doi-asserted-by":"publisher","unstructured":"Massari, A., Moretti, A., Soricetti, M., Rizzetto, E., & Heibi, I. (2024). Opencitations data source converter (v1.0.0).. https:\/\/doi.org\/10.5281\/zenodo.12911527.","DOI":"10.5281\/zenodo.12911527"},{"key":"5160_CR30","doi-asserted-by":"publisher","unstructured":"Massari, A., & Peroni, S. (2024). HERITRACE: Tracing evolution and bridging data for streamlined curatorial work in the GLAM domain. Atti del XIII Convegno Annuale AIUCD. ME.TE. Digitali\u2014Mediterraneo in rete tra testi e contesti. Catania, Italy. https:\/\/doi.org\/10.48550\/arxiv.2402.00477.","DOI":"10.48550\/arxiv.2402.00477"},{"key":"5160_CR31","doi-asserted-by":"publisher","DOI":"10.5334\/johd.178","author":"A Moretti","year":"2024","unstructured":"Moretti, A., Soricetti, M., Heibi, I., Massari, A., Peroni, S., & Rizzetto, E. (2024). The integration of the japan link center\u2019s bibliographic data into OpenCitations. Journal of Open Humanities Data. https:\/\/doi.org\/10.5334\/johd.178.","journal-title":"Journal of Open Humanities Data"},{"key":"5160_CR32","doi-asserted-by":"publisher","unstructured":"Nielsen, F. \u00c5., Mietchen, D., & Willighagen, E. (2017). Scholia, scientometrics and wikidata. In The semantic web: Eswc 2017 satellite events: Eswc 2017 satellite events, portoro\u017e, slovenia, May 28\u2013June 1, 2017, revised selected papers 14 (pp. 237\u2013259). Retrieved from https:\/\/doi.org\/10.1007\/978-3-319-70407-4_36.","DOI":"10.1007\/978-3-319-70407-4_36"},{"key":"5160_CR33","doi-asserted-by":"publisher","first-page":"33","DOI":"10.2139\/ssrn.3198992","volume":"17","author":"S Peroni","year":"2012","unstructured":"Peroni, S., & Shotton, D. (2012). FaBiO and CiTO: Ontologies for describing bibliographic resources and citations. Journal of Web Semantics, 17, 33\u201343. https:\/\/doi.org\/10.1016\/j.websem.2012.08.001","journal-title":"Journal of Web Semantics"},{"key":"5160_CR34","doi-asserted-by":"publisher","unstructured":"Peroni, S., & Shotton, D. (2018a). Open citation: Definition. https:\/\/doi.org\/10.6084\/m9.figshare.6683855.v1.","DOI":"10.6084\/m9.figshare.6683855.v1"},{"key":"5160_CR35","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1007\/978-3-030-00668-6_8","volume-title":"The semantic web\u2014ISWC 2018","author":"S Peroni","year":"2018","unstructured":"Peroni, S., & Shotton, D. (2018b). The SPAR ontologies. In L. Rutkowski, R. Scherer, M. Korytkowski, W. Pedrycz, R. Tadeusiewicz, & J. M. Zurada (Eds.), The semantic web\u2014ISWC 2018 (Vol. 10842, pp. 119\u2013136). Springer. https:\/\/doi.org\/10.1007\/978-3-030-00668-6_8."},{"key":"5160_CR36","doi-asserted-by":"publisher","unstructured":"Peroni, S., & Shotton, D. (2019). Open citation identifier: Definition. https:\/\/doi.org\/10.6084\/m9.figshare.7127816.v2.","DOI":"10.6084\/m9.figshare.7127816.v2"},{"issue":"1","key":"5160_CR37","doi-asserted-by":"publisher","first-page":"428","DOI":"10.1162\/qss_a_00023","volume":"1","author":"S Peroni","year":"2020","unstructured":"Peroni, S., & Shotton, D. (2020). OpenCitations, an infrastructure organization for open scholarship. Quantitative Science Studies, 1(1), 428\u2013444. https:\/\/doi.org\/10.1162\/qss_a_00023","journal-title":"Quantitative Science Studies"},{"key":"5160_CR38","doi-asserted-by":"crossref","unstructured":"Peroni, S., Shotton, D., & Vitali, F. (2017). One year of the OpenCitations corpus: Releasing RDF-based scholarly citation data into the public domain. In: C.\u00a0d\u2019Amato et\u00a0al. (Eds.), The semantic web\u2014ISWC 2017. Lecture Notes in Computer Science. (Vol. 10588, pp. 184\u2013192). Springer. https:\/\/link.springer.com\/10.1007\/978-3-319-68204-4_19.","DOI":"10.1007\/978-3-319-68204-4_19"},{"key":"5160_CR39","unstructured":"Priem, J., Piwowar, H., & Orr, R. (2022). OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts.  (No.\narXiv:2205.01833). (arXiv:2205.01833)."},{"issue":"3","key":"5160_CR40","doi-asserted-by":"publisher","first-page":"373","DOI":"10.3233\/SW-150197","volume":"8","author":"L Rietveld","year":"2016","unstructured":"Rietveld, L., & Hoekstra, R. (2016). The YASGUI family of SPARQL clients1. Semantic Web, 8(3), 373\u2013383. https:\/\/doi.org\/10.3233\/SW-150197","journal-title":"Semantic Web"},{"key":"5160_CR41","unstructured":"Sugimoto, C.R., Waltman, L., Larivi\u00e8re, V., van Eck, N.J., Boyack, K.W., Wouters, P., & de Rijcke, S. (2017). Open citations: A letter from the scientometric community to scholarly publishers."},{"issue":"10","key":"5160_CR42","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1145\/2629489","volume":"57","author":"D Vrandecic","year":"2014","unstructured":"Vrandecic, D., & Kr\u00f6tzsch, M. (2014). Wikidata: A free collaborative knowledgebase. Communications of the ACM, 57(10), 78\u201385. https:\/\/doi.org\/10.1145\/2629489","journal-title":"Communications of the ACM"}],"container-title":["Scientometrics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11192-024-05160-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11192-024-05160-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11192-024-05160-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,14]],"date-time":"2024-12-14T09:05:42Z","timestamp":1734167142000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11192-024-05160-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,28]]},"references-count":42,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["5160"],"URL":"https:\/\/doi.org\/10.1007\/s11192-024-05160-7","relation":{},"ISSN":["0138-9130","1588-2861"],"issn-type":[{"value":"0138-9130","type":"print"},{"value":"1588-2861","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,28]]},"assertion":[{"value":"6 August 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 September 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 September 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no conflict of interest to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}