{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T02:51:14Z","timestamp":1774925474533,"version":"3.50.1"},"reference-count":67,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,7,19]],"date-time":"2024-07-19T00:00:00Z","timestamp":1721347200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2024,7,19]]},"abstract":"<jats:p>ColorVideoVDP is a video and image quality metric that models spatial and temporal aspects of vision for both luminance and color. The metric is built on novel psychophysical models of chromatic spatiotemporal contrast sensitivity and cross-channel contrast masking. It accounts for the viewing conditions, geometric, and photometric characteristics of the display. It was trained to predict common video-streaming distortions (e.g., video compression, rescaling, and transmission errors) and also 8 new distortion types related to AR\/VR displays (e.g., light source and waveguide non-uniformities). To address the latter application, we collected our novel XR-Display-Artifact-Video quality dataset (XR-DAVID), comprised of 336 distorted videos. Extensive testing on XR-DAVID, as well as several datasets from the literature, indicate a significant gain in prediction performance compared to existing metrics. ColorVideoVDP opens the doors to many novel applications that require the joint automated spatiotemporal assessment of luminance and color distortions, including video streaming, display specification, and design, visual comparison of results, and perceptually-guided quality optimization. The code for the metric can be found at https:\/\/github.com\/gfxdisp\/ColorVideoVDP.<\/jats:p>","DOI":"10.1145\/3658144","type":"journal-article","created":{"date-parts":[[2024,7,19]],"date-time":"2024-07-19T10:47:57Z","timestamp":1721386077000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["ColorVideoVDP: A visual difference predictor for image, video and display distortions"],"prefix":"10.1145","volume":"43","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2353-0349","authenticated-orcid":false,"given":"Rafal K.","family":"Mantiuk","sequence":"first","affiliation":[{"name":"Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7985-4177","authenticated-orcid":false,"given":"Param","family":"Hanji","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8142-5611","authenticated-orcid":false,"given":"Maliha","family":"Ashraf","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-8612-9349","authenticated-orcid":false,"given":"Yuta","family":"Asano","sequence":"additional","affiliation":[{"name":"Meta, Redmond, United States of America"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7367-0131","authenticated-orcid":false,"given":"Alexandre","family":"Chapiro","sequence":"additional","affiliation":[{"name":"Meta, Sunnyvale, United States of America"}]}],"member":"320","published-online":{"date-parts":[[2024,7,19]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1167\/14.8.22"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/0042-6989(85)90104-X"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3406183"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","unstructured":"Pontus Andersson Jim Nilsson Peter Shirley and Tomas Akenine-M\u00f6ller. 2021. Visualizing Errors in Rendered High Dynamic Range Images. In Eurographics Short Papers. 10.2312\/egs.20211015","DOI":"10.2312\/egs.20211015"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1002\/col.22493"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0145671"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1167\/jov.24.4.5"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1002\/sdtp.12190"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCOM.1983.1095851"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1167\/9.5.17"},{"key":"e_1_2_1_11_1","volume-title":"Training deep nets with sublinear memory cost. arXiv preprint arXiv:1604.06174","author":"Chen Tianqi","year":"2016","unstructured":"Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. 2016. Training deep nets with sublinear memory cost. arXiv preprint arXiv:1604.06174 (2016)."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW53098.2021.00054"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1002\/col.22588"},{"key":"e_1_2_1_15_1","unstructured":"CIE. 2018. CIE 015: 2018 Colorimetry. (2018)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1117\/12.135952"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/0042-6989(82)90113-4"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386569.3392411"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1113\/jphysiol.1984.sp015499"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964991"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-020-01419-7"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSAA.11.001710"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSA.35.000268"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/0042-6989(92)90112-V"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/sdtp.13059"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46475-6_43"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.2352\/ISSN.2470-1173.2021.11.HVEI-153"},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Justin Laird Mitchell Rosen Jeff Pelz Ethan Montag and Scott Daly. 2006. Spatiovelocity CSF as a function of retinal velocity using unstabilized stimuli. In SPIE 6057 Human Vision and Electronic Imaging XI.","DOI":"10.1117\/12.647870"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.2352\/ISSN.2470-1173.2016.16.HVEI-103"},{"key":"e_1_2_1_32_1","unstructured":"Zhi Li Anne Aaron Ioannis Katsavounidis Anush Moorthy and Megha Manohara. 2016b. Toward A Practical Perceptual Video Quality Metric. https:\/\/netflixtechblog.com\/toward-a-practical-perceptual-video-quality-metric-653f208b9652"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/QoMEX.2019.8743252"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528223.3530115"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/PCS50896.2021.9477471"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459831"},{"key":"e_1_2_1_37_1","volume-title":"HDR-VDP-3: A multi-metric for predicting image differences, quality and contrast distortions in high dynamic range and regular content. (apr","author":"Mantiuk Rafal K.","year":"2023","unstructured":"Rafal K. Mantiuk, Dounia Hammou, and Param Hanji. 2023. HDR-VDP-3: A multi-metric for predicting image differences, quality and contrast distortions in high dynamic range and regular content. (apr 2023). arXiv:2304.13625 http:\/\/arxiv.org\/abs\/2304.13625"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964935"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0042-6989(00)00247-9"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3076298"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR48806.2021.9412676"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV51458.2022.00010"},{"key":"e_1_2_1_43_1","doi-asserted-by":"crossref","unstructured":"Chun Wei Ooi and John Dingliana. 2022. Color LightField: Estimation Of View-point Dependant Color Dispersion In Waveguide Display. In SIGGRAPH Asia Posters. 1--2.","DOI":"10.1145\/3550082.3564189"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSAA.7.002032"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSAA.12.000817"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2936103"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00194"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSA.68.000116"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.16910\/jemr.15.3.3"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2010.2042111"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP46576.2022.9897940"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1002\/col.20070"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2214933"},{"key":"e_1_2_1_54_1","volume-title":"Brainard","author":"Stockman Andrew","year":"2010","unstructured":"Andrew Stockman and David H. Brainard. 2010. Color vision mechanisms. In OSA handbook of optics. 11."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSA.62.001221"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1364\/josaa.5.001149"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0042-6989(98)00219-3"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP46576.2022.9897312"},{"key":"e_1_2_1_59_1","volume-title":"Bovik","author":"Venkataramanan Abhinau K.","year":"2023","unstructured":"Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, and Alan C. Bovik. 2023. One Transform To Compute Them All: Efficient Fusion-Based Full-Reference Video Quality Assessment. arXiv:2304.03412 (Nov. 2023). http:\/\/arxiv.org\/abs\/2304.03412 arXiv:2304.03412 [eess]."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACSSC.2003.1292216"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSAA.14.002379"},{"key":"e_1_2_1_62_1","volume-title":"DCTune: a technique for visual optimization of DCT quantization matrices for individual images","author":"Watson Andrew B","unstructured":"Andrew B Watson. 1993. DCTune: a technique for visual optimization of DCT quantization matrices for individual images. In Society for Information Display Digest of Technical Papers XXIV. 946--949."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1038\/302419a0"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1167\/jov.20.4.23"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00558"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.2352\/ISSN.2470-1173.2018.14.HVEI-517"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2346028"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2011.2109730"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","unstructured":"Richard Zhang Phillip Isola Alexei A Efros Eli Shechtman and Oliver Wang. 2018. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. In CVPR. 586--595. 10.1109\/CVPR.2018.00068","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1889\/1.1985127"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3658144","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3658144","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:05:54Z","timestamp":1750277154000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3658144"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,19]]},"references-count":67,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,7,19]]}},"alternative-id":["10.1145\/3658144"],"URL":"https:\/\/doi.org\/10.1145\/3658144","relation":{"is-basis-for":[{"id-type":"doi","id":"10.52843\/cassyni.89170n","asserted-by":"object"}]},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,7,19]]},"assertion":[{"value":"2024-07-19","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}