{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T15:43:20Z","timestamp":1778168600682,"version":"3.51.4"},"reference-count":65,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2012,5,30]],"date-time":"2012-05-30T00:00:00Z","timestamp":1338336000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2012,12]]},"DOI":"10.1007\/s11263-012-0538-3","type":"journal-article","created":{"date-parts":[[2012,5,29]],"date-time":"2012-05-29T15:56:47Z","timestamp":1338307007000},"page":"275-293","source":"Crossref","is-referenced-by-count":223,"title":["Weakly Supervised Localization and Learning with Generic Knowledge"],"prefix":"10.1007","volume":"100","author":[{"given":"Thomas","family":"Deselaers","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bogdan","family":"Alexe","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vittorio","family":"Ferrari","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2012,5,30]]},"reference":[{"key":"538_CR1","volume-title":"ECCV","author":"B. Alexe","year":"2010","unstructured":"Alexe, B., Deselaers, T., & Ferrari, V. (2010a). ClassCut for unsupervised class segmentation. In ECCV."},{"key":"538_CR2","volume-title":"CVPR","author":"B. Alexe","year":"2010","unstructured":"Alexe, B., Deselaers, T., & Ferrari, V. (2010b). What is an object? In CVPR."},{"key":"538_CR3","doi-asserted-by":"crossref","unstructured":"Alexe, B., Deselaers, T., & Ferrari, V. (2012). Measuring the objectness of image windows. IEEE Transactions on Pattern Analysis and Machine Intelligence.","DOI":"10.1109\/TPAMI.2012.28"},{"key":"538_CR4","volume-title":"NIPS","author":"S. Andrews","year":"2002","unstructured":"Andrews, S., Tsochantaridis, I., & Hofmann, T. (2002). Support vector machines for multiple-instance learning. In NIPS."},{"key":"538_CR5","volume-title":"CVPR","author":"H. Arora","year":"2007","unstructured":"Arora, H., Loeff, N., Forsyth, D., & Ahuja, N. (2007). Unsupervised segmentation of objects using efficient learning. In CVPR."},{"key":"538_CR6","volume-title":"ICCV","author":"B. Babenko","year":"2009","unstructured":"Babenko, B., Branson, S., & Belongie, S. (2009). Similarity metrics for categorization: From monolithic to category specific. In ICCV."},{"key":"538_CR7","volume-title":"CVPR","author":"S. Bagon","year":"2010","unstructured":"Bagon, S., Brostovski, O., Galun, M., & Irani, M. (2010). Detecting and sketching the common. In CVPR."},{"key":"538_CR8","volume-title":"CVIU","author":"H. Bay","year":"2008","unstructured":"Bay, H., Ess, A., Tuytelaars, T., & van Gool, L. (2008). SURF: speeded up robust features. In CVIU."},{"key":"538_CR9","volume-title":"NIPS","author":"B. Blaschko","year":"2010","unstructured":"Blaschko, B., Vedaldi, A., & Zisserman, A. (2010). Simultaneous object detection and ranking with weak supervision. In NIPS."},{"key":"538_CR10","volume-title":"ECCV","author":"E. Borenstein","year":"2004","unstructured":"Borenstein, E., & Ullman, S. (2004). Learning to segment. In ECCV."},{"key":"538_CR11","volume-title":"ICCV","author":"L. Cao","year":"2007","unstructured":"Cao, L., & Li, F. F. (2007). Spatially coherent latent topic model for concurrent segmentation and classification of objects and scene. In ICCV."},{"key":"538_CR12","volume-title":"CVPR","author":"J. Carreira","year":"2010","unstructured":"Carreira, J., Li, F., & Sminchisescu, C. (2010). Constrained parametric min cuts for automatic object segmentation. In CVPR."},{"issue":"12","key":"538_CR13","doi-asserted-by":"crossref","first-page":"1931","DOI":"10.1109\/TPAMI.2006.248","volume":"28","author":"Y. Chen","year":"2006","unstructured":"Chen, Y., Bi, J., & Wang, J. Z. (2006). MILES: multiple-instance learning via embedded instance selection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(12), 1931\u20131947.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"538_CR14","volume-title":"CVPR","author":"O. Chum","year":"2007","unstructured":"Chum, O., & Zisserman, A. (2007). An exemplar model for learning object classes. In CVPR."},{"key":"538_CR15","volume-title":"ECCV","author":"D. J. Crandall","year":"2006","unstructured":"Crandall, D. J., & Huttenlocher, D. (2006). Weakly supervised learning of part-based spatial models for visual object recognition. In ECCV."},{"key":"538_CR16","volume-title":"CVPR","author":"N. Dalal","year":"2005","unstructured":"Dalal, N., & Triggs, B. (2005). Histogram of Oriented Gradients for human detection. In CVPR."},{"key":"538_CR17","volume-title":"ICML","author":"T. Deselaers","year":"2010","unstructured":"Deselaers, T., & Ferrari, V. (2010). A conditional random field for multiple-instance learning. In ICML."},{"key":"538_CR18","volume-title":"ECCV","author":"T. Deselaers","year":"2010","unstructured":"Deselaers, T., Alexe, B., & Ferrari, V. (2010). Localizing objects while learning their appearance. In ECCV."},{"key":"538_CR19","unstructured":"Dork\u00f3, G., & Schmid, C. (2005). Object class recognition using discriminative local features. Tech. Rep. RR-5497, INRIA, Rhone-Alpes."},{"key":"538_CR20","volume-title":"ECCV","author":"I. Endres","year":"2010","unstructured":"Endres, I., & Hoiem, D. (2010). Category independent object proposals. In ECCV."},{"key":"538_CR21","unstructured":"Everingham, M., Van Gool, L., Williams, C. K. I., & Zisserman, A. (2006). The PASCAL Visual Object Classes Challenge 2006 (VOC2006). http:\/\/pascallin.ecs.soton.ac.uk\/challenges\/VOC\/voc2006\/ ."},{"key":"538_CR22","unstructured":"Everingham, M., Van Gool, L., Williams, C., Winn, J., & Zisserman, A. (2007). The PASCAL Visual Object Classes Challenge 2007 Results."},{"key":"538_CR23","unstructured":"Everingham, M., et al. (2010). The PASCAL Visual Object Classes Challenge 2010 Results."},{"key":"538_CR24","first-page":"1134","volume-title":"ICCV","author":"L. Fei-Fei","year":"2003","unstructured":"Fei-Fei, L., Fergus, R., & Perona, P. (2003). A bayesian approach to unsupervised one-shot learning of object categories. In ICCV (pp.\u00a01134\u20131141)."},{"key":"538_CR25","volume-title":"CVPR workshop of generative model based vision","author":"L. Fei-Fei","year":"2004","unstructured":"Fei-Fei, L., Fergus, R., & Perona, P. (2004). Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In CVPR workshop of generative model based vision."},{"issue":"9","key":"538_CR26","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1109\/TPAMI.2009.167","volume":"32","author":"P. Felzenszwalb","year":"2010","unstructured":"Felzenszwalb, P., Girshick, R., McAllester, D., & Ramanan, D. (2010). Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9), 1627\u20131645.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"538_CR27","volume-title":"CVPR","author":"R. Fergus","year":"2003","unstructured":"Fergus, R., Perona, P., & Zisserman, A. (2003). Object class recognition by unsupervised scale-invariant learning. In CVPR."},{"key":"538_CR28","volume-title":"ICML","author":"T. Finley","year":"2008","unstructured":"Finley, T., & Joachims, T. (2008). Training structural svms when exact inference is intractable. In ICML."},{"key":"538_CR29","volume-title":"DAGM","author":"M. Fritz","year":"2006","unstructured":"Fritz, M., & Schiele, B. (2006). Towards unsupervised discovery of visual categories. In DAGM."},{"key":"538_CR30","volume-title":"ICCV","author":"A. Frome","year":"2007","unstructured":"Frome, A., Singer, Y., Sha, F., & Malik, J. (2007). Learning globally-consistent local distance functions for shape-based image retrieval and classification. In ICCV."},{"key":"538_CR31","volume-title":"BMVC","author":"A. Gaidon","year":"2009","unstructured":"Gaidon, A., Marszalek, M., & Schmid, C. (2009). Mining visual actions from movies. In BMVC."},{"key":"538_CR32","volume-title":"ECCV","author":"C. Galleguillos","year":"2008","unstructured":"Galleguillos, C., Babenko, B., Rabinovich, A., & Belongie, S. (2008). Weakly supervised object localization with stable segmentations. In ECCV."},{"key":"538_CR33","volume-title":"CVPR","author":"K. Grauman","year":"2006","unstructured":"Grauman, K., & Darrell, T. (2006). Unsupervised learning of categories from sets of partially matching image features. In CVPR."},{"key":"538_CR34","volume-title":"NIPS","author":"G. Kim","year":"2009","unstructured":"Kim, G., & Torralba, A. (2009). Unsupervised detection of regions of interest using iterative link analysis. In NIPS."},{"issue":"10","key":"538_CR35","doi-asserted-by":"crossref","first-page":"1568","DOI":"10.1109\/TPAMI.2006.200","volume":"28","author":"V. Kolmogorov","year":"2006","unstructured":"Kolmogorov, V. (2006a). Convergent tree-reweighted message passing for energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1568\u20131583.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"10","key":"538_CR36","doi-asserted-by":"crossref","first-page":"1568","DOI":"10.1109\/TPAMI.2006.200","volume":"28","author":"V. Kolmogorov","year":"2006","unstructured":"Kolmogorov, V. (2006b). Convergent tree-reweighted message passing for energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1568\u20131583.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"538_CR37","volume-title":"CVPR","author":"C. Lampert","year":"2009","unstructured":"Lampert, C., Nickisch, H., & Harmeling, S. (2009a). Learning to detect unseen object classes by between-class attribute transfer. In CVPR."},{"issue":"12","key":"538_CR38","doi-asserted-by":"crossref","first-page":"2129","DOI":"10.1109\/TPAMI.2009.144","volume":"31","author":"C. H. Lampert","year":"2009","unstructured":"Lampert, C. H., Blaschko, M. B., & Hofmann, T. (2009b). Efficient subwindow search: A branch and bound framework for object localization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(12), 2129\u20132142.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"538_CR39","unstructured":"Lando, M., & Edelman, S. (1995). Generalization from a single view in face recognition. (technical report cs-tr 95-02). The Weizmann Institute of Science."},{"key":"538_CR40","volume-title":"CVPR","author":"Y. Lee","year":"2009","unstructured":"Lee, Y., & Grauman, K. (2009a). Shape discovery from unlabeled image collections. In CVPR."},{"key":"538_CR41","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s11263-009-0252-y","volume":"85","author":"Y. J. Lee","year":"2009","unstructured":"Lee, Y. J., & Grauman, K. (2009b). Foreground focus: unsupervised learning from partially matching images. International Journal of Computer Vision, 85, 143\u2013166.","journal-title":"International Journal of Computer Vision"},{"key":"538_CR42","volume-title":"CVPR","author":"T. Malisiewicz","year":"2008","unstructured":"Malisiewicz, T., & Efros, A. A. (2008). Recognition by association via learning per-exemplar distances. In CVPR."},{"key":"538_CR43","volume-title":"ICCV","author":"M. Nguyen","year":"2009","unstructured":"Nguyen, M., Torresani, L., de la Torre, F., & Rother, C. (2009). Weakly supervised discriminative localization and classification: a joint learning process. In ICCV."},{"key":"538_CR44","volume-title":"CVPR","author":"E. Nowak","year":"2007","unstructured":"Nowak, E., & Jurie, F. (2007). Learning visual similarity measures for comparing never seen objects. In CVPR."},{"issue":"3","key":"538_CR45","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1023\/A:1011139631724","volume":"42","author":"A. Oliva","year":"2001","unstructured":"Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3), 145\u2013175.","journal-title":"International Journal of Computer Vision"},{"key":"538_CR46","volume-title":"ECCV","author":"N. Payet","year":"2010","unstructured":"Payet, N., & Todorovic, S. (2010). From a set of shapes to object discovery. In ECCV."},{"key":"538_CR47","volume-title":"CVPR","author":"A. Quattoni","year":"2008","unstructured":"Quattoni, A., Collins, M., & Darrell, T. (2008). Transfer learning for image classification with sparse prototype representations. In CVPR."},{"key":"538_CR48","volume-title":"ICML","author":"R. Raina","year":"2007","unstructured":"Raina, R., Battle, A., Lee, H., Packer, B., & Ng, A. (2007). Self-taught learning: transfer learning from unlabeled data. In ICML."},{"key":"538_CR49","volume-title":"NIPS","author":"D. Ramanan","year":"2006","unstructured":"Ramanan, D. (2006). Learning to parse images of articulated bodies. In NIPS."},{"key":"538_CR50","volume-title":"CVPR","author":"M. Rohrbach","year":"2010","unstructured":"Rohrbach, M., Stark, M., Szarvas, G., Gurevych, I., & Schiele, B. (2010). What helps where\u2014and why? semantic relatedness for knowledge transfer. In CVPR."},{"issue":"3","key":"538_CR51","first-page":"309","volume":"23","author":"C. Rother","year":"2004","unstructured":"Rother, C., Kolmogorov, V., & Blake, A. (2004). Grabcut: interactive foreground extraction using iterated graph cuts. Computer Graphics, 23(3), 309\u2013314.","journal-title":"Computer Graphics"},{"issue":"1\u20133","key":"538_CR52","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1007\/s11263-007-0090-8","volume":"77","author":"B. C. Russel","year":"2008","unstructured":"Russel, B. C., & Torralba, A. (2008). LabelMe: a database and web-based tool for image annotation. International Journal of Computer Vision, 77(1\u20133), 157\u2013173.","journal-title":"International Journal of Computer Vision"},{"key":"538_CR53","volume-title":"CVPR","author":"B. C. Russell","year":"2006","unstructured":"Russell, B. C., Efros, A. A., Sivic, J., Freeman, W. T., & Zisserman, A. (2006). Using multiple segmentations to discover objects and their extent in image collections. In CVPR."},{"key":"538_CR54","volume-title":"ICCV","author":"M. Stark","year":"2009","unstructured":"Stark, M., Goesele, M., & Schiele, B. (2009). A shape-based object class model for knowledge transfer. In ICCV."},{"key":"538_CR55","volume-title":"ECCV","author":"M. Szummer","year":"2008","unstructured":"Szummer, M., Kohli, P., & Hoiem, D. (2008). Learning CRFs using graph cuts. In ECCV."},{"key":"538_CR56","volume-title":"NIPS","author":"S. Thrun","year":"1996","unstructured":"Thrun, S. (1996). Is learning the n-th thing any easier than learning the first? In NIPS."},{"key":"538_CR57","volume-title":"CVPR","author":"S. Todorovic","year":"2006","unstructured":"Todorovic, S., & Ahuja, N. (2006). Extracting subimages of an unknown category from a set of images. In CVPR."},{"key":"538_CR58","volume-title":"BMVC","author":"T. Tommasi","year":"2009","unstructured":"Tommasi, T., & Caputo, B. (2009). The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories. In BMVC."},{"key":"538_CR59","volume-title":"CVPR","author":"T. Tommasi","year":"2010","unstructured":"Tommasi, T., Orabona, F., & Caputo, B. (2010). Safety in numbers: learning categories from few examples with multi model knowledge transfer. In CVPR."},{"key":"538_CR60","volume-title":"ECCV","author":"L. Torresani","year":"2010","unstructured":"Torresani, L., Szummer, M., & Fitzgibbon, A. (2010). Efficient object category recognition using classemes. In ECCV."},{"key":"538_CR61","first-page":"1453","volume":"6","author":"I. Tsochantaridis","year":"2005","unstructured":"Tsochantaridis, I., Joachims, T., Hofmann, T., & Altun, Y. (2005). Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research, 6, 1453\u20131484.","journal-title":"Journal of Machine Learning Research"},{"key":"538_CR62","volume-title":"NIPS","author":"P. A. Viola","year":"2005","unstructured":"Viola, P. A., Platt, J., & Zhang, C. (2005). Multiple instance boosting for object detection. In NIPS."},{"key":"538_CR63","volume-title":"NIPS","author":"K. Q. Weinberger","year":"2005","unstructured":"Weinberger, K. Q., Blitzer, J., & Saul, L. K. (2005). Distance metric learning for large margin nearest neighbor classification. In NIPS."},{"key":"538_CR64","volume-title":"ICCV","author":"J. Winn","year":"2005","unstructured":"Winn, J., & Jojic, N. (2005a). LOCUS: learning object classes with unsupervised segmentation. In ICCV."},{"issue":"2","key":"538_CR65","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s11263-006-9794-4","volume":"73","author":"J. Zhang","year":"2007","unstructured":"Zhang, J., Marszalek, M., Lazebnik, S., & Schmid, C. (2007). Local features and kernels for classification of texture and object categories: a comprehensive study. International Journal of Computer Vision, 73(2), 213\u2013238","journal-title":"International Journal of Computer Vision"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-012-0538-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11263-012-0538-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-012-0538-3","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,29]],"date-time":"2019-06-29T02:35:33Z","timestamp":1561775733000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11263-012-0538-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,5,30]]},"references-count":65,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["538"],"URL":"https:\/\/doi.org\/10.1007\/s11263-012-0538-3","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,5,30]]}}}