{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:40:57Z","timestamp":1777653657849,"version":"3.51.4"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2011,12,1]],"date-time":"2011-12-01T00:00:00Z","timestamp":1322697600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000006","name":"Office of Naval Research","doi-asserted-by":"publisher","award":["N000141010766"],"award-info":[{"award-number":["N000141010766"]}],"id":[{"id":"10.13039\/100000006","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:p>\n            The goal of this work is to find\n            <jats:italic>visually similar<\/jats:italic>\n            images even if they appear quite different at the raw pixel level. This task is particularly important for matching images across visual domains, such as photos taken over different seasons or lighting conditions, paintings, hand-drawn sketches, etc. We propose a surprisingly simple method that estimates the relative importance of different features in a query image based on the notion of \"data-driven uniqueness\". We employ standard tools from discriminative object detection in a novel way, yielding a generic approach that does not depend on a particular image representation or a specific visual domain. Our approach shows good performance on a number of difficult cross-domain visual tasks e.g., matching paintings or sketches to real photographs. The method also allows us to demonstrate novel applications such as\n            <jats:italic>Internet re-photography<\/jats:italic>\n            , and painting2gps. While at present the technique is too computationally intensive to be practical for interactive image retrieval, we hope that some of the ideas will eventually become applicable to that domain as well.\n          <\/jats:p>","DOI":"10.1145\/2070781.2024188","type":"journal-article","created":{"date-parts":[[2011,11,30]],"date-time":"2011-11-30T13:58:46Z","timestamp":1322661526000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":129,"title":["Data-driven visual similarity for cross-domain image matching"],"prefix":"10.1145","volume":"30","author":[{"given":"Abhinav","family":"Shrivastava","sequence":"first","affiliation":[{"name":"Carnegie Mellon University"}]},{"given":"Tomasz","family":"Malisiewicz","sequence":"additional","affiliation":[{"name":"MIT"}]},{"given":"Abhinav","family":"Gupta","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}]},{"given":"Alexei A.","family":"Efros","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}]}],"member":"320","published-online":{"date-parts":[[2011,12,12]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1805964.1805968"},{"key":"e_1_2_1_2_1","unstructured":"Baeza-Yates R. A. and Ribeiro-Neto B. 1999. Modern Information Retrieval. Addison-Wesley Longman Publishing.   Baeza-Yates R. A. and Ribeiro-Neto B. 1999. Modern Information Retrieval . Addison-Wesley Longman Publishing."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-006-0009-9"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.38"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1618452.1618470"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1399504.1360660"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Dale K. Johnson M. K. Sunkavalli K. Matusik W. and Pfister H. 2009. Image restoration using online photo collections. In ICCV.  Dale K. Johnson M. K. Sunkavalli K. Matusik W. and Pfister H. 2009. Image restoration using online photo collections. In ICCV .","DOI":"10.1109\/ICCV.2009.5459473"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1348246.1348248"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383296"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2010.266"},{"key":"e_1_2_1_13_1","unstructured":"Everingham M. Gool L. V. Williams C. K. I. Winn J. and Zisserman A. 2007. The PASCAL Visual Object Classes Challenge.  Everingham M. Gool L. V. Williams C. K. I. Winn J. and Zisserman A. 2007. The PASCAL Visual Object Classes Challenge."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/38.988747"},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"HaCohen Y. Fattal R. and Lischinski D. 2010. Image upsampling via texture hallucination. In ICCP.  HaCohen Y. Fattal R. and Lischinski D. 2010. Image upsampling via texture hallucination. In ICCP .","DOI":"10.1109\/ICCPHOT.2010.5585097"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276382"},{"key":"e_1_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Hays J. and Efros A. A. 2008. im2gps: estimating geographic information from a single image. In CVPR.  Hays J. and Efros A. A. 2008. im2gps: estimating geographic information from a single image. In CVPR .","DOI":"10.1109\/CVPR.2008.4587784"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383295"},{"key":"e_1_2_1_19_1","unstructured":"Hoiem D. Sukthankar R. Schneiderman H. and Huston L. 2004. Object-based image retrieval using the statistical structure of images. In CVPR.   Hoiem D. Sukthankar R. Schneiderman H. and Huston L. 2004. Object-based image retrieval using the statistical structure of images. In CVPR ."},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Itti L. and Koch C. 2000. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research.  Itti L. and Koch C. 2000. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research .","DOI":"10.1016\/S0042-6989(99)00163-7"},{"key":"e_1_2_1_21_1","doi-asserted-by":"crossref","unstructured":"J\u00e9gou H. Douze M. and Schmid C. 2008. Hamming embedding and weak geometric consistency for large scale image search. In ECCV.  J\u00e9gou H. Douze M. and Schmid C. 2008. Hamming embedding and weak geometric consistency for large scale image search. In ECCV .","DOI":"10.1007\/978-3-540-88682-2_24"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2010.233"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Judd T. Ehinger K. Durand F. and Torralba A. 2009. Learning to predict where humans look. In ICCV.  Judd T. Ehinger K. Durand F. and Torralba A. 2009. Learning to predict where humans look. In ICCV .","DOI":"10.1109\/ICCV.2009.5459462"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the IEEE.","author":"Kaneva B."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964921.1964956"},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Lazebnik S. Schmid C. and Ponce J. 2009. Spatial pyramid matching. In Object Categorization: Computer and Human Vision Perspectives. Cambridge University Press.  Lazebnik S. Schmid C. and Ponce J. 2009. Spatial pyramid matching. In Object Categorization: Computer and Human Vision Perspectives . Cambridge University Press.","DOI":"10.1017\/CBO9780511635465.022"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_1_28_1","unstructured":"Malisiewicz T. and Efros A. A. 2009. Beyond categories: The visual memex model for reasoning about object relationships. In NIPS.  Malisiewicz T. and Efros A. A. 2009. Beyond categories: The visual memex model for reasoning about object relationships. In NIPS ."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126229"},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Oliva A. and Torralba A. 2006. Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research.  Oliva A. and Torralba A. 2006. Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research .","DOI":"10.1016\/S0079-6123(06)55002-2"},{"key":"e_1_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Russell B. C. Sivic J. Ponce J. and Dessales H. 2011. Automatic alignment of paintings and photographs depicting a 3d scene. In 3D Representation and Recognition (3dRR).  Russell B. C. Sivic J. Ponce J. and Dessales H. 2011. Automatic alignment of paintings and photographs depicting a 3d scene. In 3D Representation and Recognition (3dRR) .","DOI":"10.1109\/ICCVW.2011.6130291"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.345012"},{"key":"e_1_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Shechtman E. and Irani M. 2007. Matching local self-similarities across images and videos. In CVPR.  Shechtman E. and Irani M. 2007. Matching local self-similarities across images and videos. In CVPR .","DOI":"10.1109\/CVPR.2007.383198"},{"key":"e_1_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Sivic J. and Zisserman A. 2003. Video google: A text retrieval approach to object matching in videos. In ICCV.   Sivic J. and Zisserman A. 2003. Video google: A text retrieval approach to object matching in videos. In ICCV .","DOI":"10.1109\/ICCV.2003.1238663"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360614"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000004830.93820.78"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.128"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.60"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Whyte O. Sivic J. and Zisserman A. 2009. Get out of my picture! internet-based inpainting. In BMVC.  Whyte O. Sivic J. and Zisserman A. 2009. Get out of my picture! internet-based inpainting. In BMVC .","DOI":"10.5244\/C.23.116"},{"key":"e_1_2_1_40_1","doi-asserted-by":"crossref","unstructured":"Wolf L. Hassner T. and Taigman Y. 2009. The one-shot similarity kernel. In ICCV.  Wolf L. Hassner T. and Taigman Y. 2009. The one-shot similarity kernel. In ICCV .","DOI":"10.1109\/ICCV.2009.5459323"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2070781.2024188","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2070781.2024188","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2070781.2024188","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T10:06:03Z","timestamp":1750241163000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2070781.2024188"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,12]]},"references-count":40,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["10.1145\/2070781.2024188"],"URL":"https:\/\/doi.org\/10.1145\/2070781.2024188","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,12]]},"assertion":[{"value":"2011-12-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}