{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"institution":[{"name":"medRxiv"}],"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T12:24:35Z","timestamp":1768566275008,"version":"3.49.0"},"posted":{"date-parts":[[2023,10,3]]},"group-title":"Intensive Care and Critical Care Medicine","reference-count":18,"publisher":"openRxiv","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"accepted":{"date-parts":[[2023,10,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                <jats:p>\n                  Pulse oximeters measure peripheral arterial oxygen saturation (SpO\n                  <jats:sub>2<\/jats:sub>\n                  ) noninvasively, while the gold standard (SaO\n                  <jats:sub>2<\/jats:sub>\n                  ) involves arterial blood gas measurement. There are known racial and ethnic disparities in their performance. BOLD is a new comprehensive dataset that aims to underscore the importance of addressing biases in pulse oximetry accuracy, which disproportionately affect darker-skinned patients.\n                <\/jats:p>\n                <jats:p>\n                  The dataset was created by harmonizing three Electronic Health Record databases (MIMIC-III, MIMIC-IV, eICU-CRD) comprising Intensive Care Unit stays of US patients. Paired SpO\n                  <jats:sub>2<\/jats:sub>\n                  and SaO\n                  <jats:sub>2<\/jats:sub>\n                  measurements were time-aligned and combined with various other sociodemographic and parameters to provide a detailed representation of each patient. BOLD includes 49,099 paired measurements, within a 5-minute window and with oxygen saturation levels between 70-100%. Minority racial and ethnic groups account for \u223c25% of the data \u2013 a proportion seldom achieved in previous studies. The codebase is publicly available.\n                <\/jats:p>\n                <jats:p>Given the prevalent use of pulse oximeters in the hospital and at home, we hope that BOLD will be leveraged to develop debiasing algorithms that can result in more equitable healthcare solutions.<\/jats:p>","DOI":"10.1101\/2023.10.03.23296485","type":"posted-content","created":{"date-parts":[[2023,10,3]],"date-time":"2023-10-03T23:50:16Z","timestamp":1696377016000},"source":"Crossref","is-referenced-by-count":0,"title":["BOLD: Blood-gas and Oximetry Linked Dataset \u2013 Open Source Research"],"prefix":"10.64898","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0312-1647","authenticated-orcid":false,"given":"Jo\u00e3o","family":"Matos","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0199-0184","authenticated-orcid":false,"given":"Tristan","family":"Struja","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1306-2334","authenticated-orcid":false,"given":"Jack","family":"Gallifant","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6847-6748","authenticated-orcid":false,"given":"Luis","family":"Nakayama","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5786-2627","authenticated-orcid":false,"given":"Marie-Laure","family":"Charpignon","sequence":"additional","affiliation":[]},{"given":"Xiaoli","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Nicoleta","family":"Economou-Zavlanos","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3760-2473","authenticated-orcid":false,"given":"Jaime S.","family":"Cardoso","sequence":"additional","affiliation":[]},{"given":"Kimberly S","family":"Johnson","sequence":"additional","affiliation":[]},{"given":"Nrupen","family":"Bhavsar","sequence":"additional","affiliation":[]},{"given":"Judy","family":"Gichoya","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6712-6626","authenticated-orcid":false,"given":"Leo Anthony","family":"Celi","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5668-4251","authenticated-orcid":false,"given":"A. Ian","family":"Wong","sequence":"additional","affiliation":[]}],"member":"54368","reference":[{"key":"2023100902451459000_2023.10.03.23296485v1.1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMc2029240"},{"key":"2023100902451459000_2023.10.03.23296485v1.2","doi-asserted-by":"publisher","DOI":"10.1001\/jamanetworkopen.2021.31674"},{"key":"2023100902451459000_2023.10.03.23296485v1.3","doi-asserted-by":"publisher","DOI":"10.1136\/bmj-2021-069775"},{"key":"2023100902451459000_2023.10.03.23296485v1.4","doi-asserted-by":"publisher","DOI":"10.1001\/jamainternmed.2022.2587"},{"key":"2023100902451459000_2023.10.03.23296485v1.5","doi-asserted-by":"publisher","DOI":"10.13026\/C2WM1R"},{"key":"2023100902451459000_2023.10.03.23296485v1.6","doi-asserted-by":"crossref","unstructured":"Johnson, A. E. W. et al. MIMIC-III, a freely accessible critical care database. Sci Data 3, 160035 (2016).","DOI":"10.1038\/sdata.2016.35"},{"key":"2023100902451459000_2023.10.03.23296485v1.7","doi-asserted-by":"publisher","DOI":"10.13026\/S6N6-XD98"},{"key":"2023100902451459000_2023.10.03.23296485v1.8","doi-asserted-by":"publisher","DOI":"10.1109\/ENBENG58165.2023.10175316"},{"key":"2023100902451459000_2023.10.03.23296485v1.9","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocx084"},{"key":"2023100902451459000_2023.10.03.23296485v1.10","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1111\/j.1553-2712.1995.tb03276.x","article-title":"Time to equilibration of oxygen saturation using pulse oximetry","volume":"2","year":"1995","journal-title":"Acad. Emerg. Med"},{"key":"2023100902451459000_2023.10.03.23296485v1.11","doi-asserted-by":"publisher","DOI":"10.1007\/s001340100900"},{"key":"2023100902451459000_2023.10.03.23296485v1.12","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1001\/jama.2021.13304","article-title":"AMA Manual of Style Committee. Updated Guidance on the Reporting of Race and Ethnicity in Medical and Science Journals","volume":"326","year":"2021","journal-title":"JAMA"},{"key":"2023100902451459000_2023.10.03.23296485v1.13","doi-asserted-by":"publisher","DOI":"10.1097\/MLR.0b013e31819432e5"},{"key":"2023100902451459000_2023.10.03.23296485v1.14","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9681(87)90171-8"},{"key":"2023100902451459000_2023.10.03.23296485v1.15","doi-asserted-by":"publisher","DOI":"10.1007\/BF01709751"},{"key":"2023100902451459000_2023.10.03.23296485v1.16","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1093\/jamiaopen\/ooy012","article-title":"G. tableone: An open source Python package for producing summary statistics for research papers","volume":"1","year":"2018","journal-title":"JAMIA Open"},{"key":"2023100902451459000_2023.10.03.23296485v1.17","doi-asserted-by":"publisher","DOI":"10.1183\/20734735.001415"},{"key":"2023100902451459000_2023.10.03.23296485v1.18","doi-asserted-by":"crossref","unstructured":"Bilogur, A . Missingno: a missing data visualization suite. J. Open Source Softw. 3, 547 (2018).","DOI":"10.21105\/joss.00547"}],"container-title":[],"original-title":[],"link":[{"URL":"https:\/\/syndication.highwire.org\/content\/doi\/10.1101\/2023.10.03.23296485","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T19:42:25Z","timestamp":1768506145000},"score":1,"resource":{"primary":{"URL":"http:\/\/medrxiv.org\/lookup\/doi\/10.1101\/2023.10.03.23296485"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,3]]},"references-count":18,"URL":"https:\/\/doi.org\/10.1101\/2023.10.03.23296485","relation":{},"subject":[],"published":{"date-parts":[[2023,10,3]]},"subtype":"preprint"}}