{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T10:37:49Z","timestamp":1761647869174},"reference-count":55,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2011,7,1]],"date-time":"2011-07-01T00:00:00Z","timestamp":1309478400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2012,4]]},"DOI":"10.1007\/s11263-011-0479-2","type":"journal-article","created":{"date-parts":[[2011,6,30]],"date-time":"2011-06-30T18:05:19Z","timestamp":1309457119000},"page":"191-209","source":"Crossref","is-referenced-by-count":27,"title":["Accurate Object Recognition with Shape Masks"],"prefix":"10.1007","volume":"97","author":[{"given":"Marcin","family":"Marsza\u0142ek","sequence":"first","affiliation":[]},{"given":"Cordelia","family":"Schmid","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,7,1]]},"reference":[{"key":"479_CR1","volume-title":"ECCV","author":"S. Agarwal","year":"2002","unstructured":"Agarwal, S., & Roth, D. (2002). Learning a sparse representation for object detection. In ECCV."},{"issue":"11","key":"479_CR2","doi-asserted-by":"crossref","first-page":"1475","DOI":"10.1109\/TPAMI.2004.108","volume":"26","author":"S. Agarwal","year":"2004","unstructured":"Agarwal, S., Awan, A., & Roth, D. (2004). Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(11), 1475\u20131490.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"479_CR3","volume-title":"ECCV","author":"E. Borenstein","year":"2002","unstructured":"Borenstein, E., & Ullman, S. (2002). Class-specific, top-down segmentation. In ECCV."},{"issue":"5","key":"479_CR4","doi-asserted-by":"crossref","first-page":"1055","DOI":"10.1109\/72.788646","volume":"10","author":"O. Chapelle","year":"1999","unstructured":"Chapelle, O., Haffner, P., & Vapnik, V. (1999). Support vector machines for histogram-based image classification. IEEE Transactions on Neural Networks, 10(5), 1055\u20131064.","journal-title":"IEEE Transactions on Neural Networks"},{"key":"479_CR5","volume-title":"ECCV workshop on statistical learning in computer vision","author":"G. Csurka","year":"2004","unstructured":"Csurka, G., Dance, C., Fan, L., Willamowski, J., & Bray, C. (2004). Visual categorization with bags of keypoints. In ECCV workshop on statistical learning in computer vision."},{"key":"479_CR6","volume-title":"ICCV","author":"G. Dork\u00f3","year":"2003","unstructured":"Dork\u00f3, G., & Schmid, C. (2003). Selection of scale-invariant parts for object class recognition. In ICCV."},{"key":"479_CR7","volume-title":"Selected proceedings of the first PASCAL challenges workshop","author":"M. Everingham","year":"2006","unstructured":"Everingham, M., Zisserman, A., Williams, C., & Gool, L.V., et al. (2006). The 2005 PASCAL visual object classes challenge. In Selected proceedings of the first PASCAL challenges workshop."},{"key":"479_CR8","volume-title":"The PASCAL VOC\u201908 challenge workshop in conj. with ECCV","author":"M. Everingham","year":"2008","unstructured":"Everingham, M., van Gool, L., Williams, C., Winn, J., & Zisserman,\u00a0A. (2008). Overview and results of the detection challenge. In The PASCAL VOC\u201908 challenge workshop in conj. with ECCV."},{"key":"479_CR9","unstructured":"Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., & Zisserman, A. (2009). The PASCAL visual object classes challenge 2009 (VOC2009) results. http:\/\/www.pascal-network.org\/challenges\/VOC\/voc2009\/workshop\/index.html ."},{"issue":"3","key":"479_CR10","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/s11263-006-8707-x","volume":"71","author":"R. Fergus","year":"2007","unstructured":"Fergus, R., Perona, P., & Zisserman, A. (2007). Weakly supervised scale-invariant learning of models for visual recognition. International Journal of Computer Vision, 71(3), 273\u2013303.","journal-title":"International Journal of Computer Vision"},{"issue":"2","key":"479_CR11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TPAMI.2004.1307307","volume":"26","author":"C. Fowlkes","year":"2004","unstructured":"Fowlkes, C., Belongie, S., Chung, F., & Malik, J. (2004). Spectral grouping using the Nystr\u00f6m method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(2), 1\u201312.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"479_CR12","volume-title":"ICCV","author":"M. Fritz","year":"2005","unstructured":"Fritz, M., Leibe, B., Caputo, B., & Schiele, B. (2005). Integrating representative and discriminant models for object category detection. In ICCV."},{"key":"479_CR13","volume-title":"ICPR","author":"M. Fussenegger","year":"2006","unstructured":"Fussenegger, M., Opelt, A., & Pinz, A. (2006). Object localization\/segmentation using generic shape priors. In ICPR."},{"key":"479_CR14","volume-title":"ECCV","author":"C. Galleguillos","year":"2008","unstructured":"Galleguillos, C., Babenko, B., Rabinovich, A., & Belongie, S. (2008). Weakly supervised object localization with stable segmentations. In ECCV."},{"issue":"2","key":"479_CR15","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1007\/BF00058750","volume":"17","author":"J. G\u00e5rding","year":"1996","unstructured":"G\u00e5rding, J., & Lindeberg, T. (1996). Direct computation of shape cues using scale-adapted spatial derivative operators. International Journal of Computer Vision, 17(2), 163\u2013191.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR16","volume-title":"ICCV","author":"K. Grauman","year":"2005","unstructured":"Grauman, K., & Darrell, T. (2005). The pyramid match kernel: Discriminative classification with sets of image features. In ICCV."},{"key":"479_CR17","volume-title":"CVPR","author":"C. Gu","year":"2009","unstructured":"Gu, C., Lim, J., Arbelaez, P., & Malik, J. (2009). Recognition using regions. In CVPR."},{"key":"479_CR18","volume-title":"ECCV","author":"E. Hayman","year":"2004","unstructured":"Hayman, E., Caputo, B., Fritz, M., & Eklundh, JO (2004). On the significance of real-world conditions for material classification. In ECCV."},{"key":"479_CR19","volume-title":"ICME","author":"F. Jing","year":"2003","unstructured":"Jing, F., Li, M., Zhang, H. J., & Zhang, B. (2003). Support vector machines for region-based image retrieval. In ICME."},{"key":"479_CR20","volume-title":"ICCV","author":"S. Lazebnik","year":"2005","unstructured":"Lazebnik, S., Schmid, C., & Ponce, J. (2005). A maximum entropy framework for part-based texture and object recognition. In ICCV."},{"key":"479_CR21","volume-title":"CVPR","author":"B. Leibe","year":"2005","unstructured":"Leibe, B., Seemann, E., & Schiele, B. (2005). Pedestrian detection in crowded scenes. In CVPR."},{"issue":"1\u20133","key":"479_CR22","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1007\/s11263-007-0095-3","volume":"77","author":"B. Leibe","year":"2008","unstructured":"Leibe, B., Leonardis, A., & Schiele, B. (2008). Robust object detection with interleaved categorization and segmentation. International Journal of Computer Vision, 77(1\u20133), 259\u2013289.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR23","volume-title":"CVPR","author":"L. J. Li","year":"2009","unstructured":"Li, L. J., Socher, R., & Fei-Fei, L. (2009). Towards total scene understanding: classification, annotation and segmentation in an unsupervised framework. In CVPR."},{"issue":"2","key":"479_CR24","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1023\/A:1008045108935","volume":"30","author":"T. Lindeberg","year":"1998","unstructured":"Lindeberg, T. (1998). Feature detection with automatic scale selection. International Journal of Computer Vision, 30(2), 79\u2013116.","journal-title":"International Journal of Computer Vision"},{"issue":"2","key":"479_CR25","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"D. Lowe","year":"2004","unstructured":"Lowe, D. (2004). Distinctive image features form scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91\u2013110.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR26","volume-title":"CVPR","author":"S. Lyu","year":"2005","unstructured":"Lyu, S. (2005). Mercer kernels for object recognition with local features. In CVPR."},{"key":"479_CR27","volume-title":"Vision","author":"D. Marr","year":"1982","unstructured":"Marr, D. (1982). Vision. New York: Freeman."},{"key":"479_CR28","volume-title":"CVPR","author":"M. Marsza\u0142ek","year":"2006","unstructured":"Marsza\u0142ek, M., & Schmid, C. (2006). Spatial weighting for bag-of-features. In CVPR."},{"key":"479_CR29","volume-title":"CVPR","author":"M. Marsza\u0142ek","year":"2007","unstructured":"Marsza\u0142ek, M., & Schmid, C. (2007). Accurate object localization with shape masks. In CVPR."},{"issue":"1","key":"479_CR30","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1023\/B:VISI.0000027790.02288.f2","volume":"60","author":"K. Mikolajczyk","year":"2004","unstructured":"Mikolajczyk, K., & Schmid, C. (2004). Scale and affine invariant interest point detectors. International Journal of Computer Vision, 60(1), 63\u201386.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR31","volume-title":"SCIA","author":"A. Opelt","year":"2005","unstructured":"Opelt, A., & Pinz, A. (2005). Object localization with boosting and weak supervision for generic object recognition. In SCIA."},{"key":"479_CR32","unstructured":"Opelt, A., Fussenegger, M., Pinz, A., & Auer, P. (2004a). Generic object recognition with boosting. Tech. rep. TR-EMT-2004-01, TU Graz."},{"key":"479_CR33","volume-title":"ECCV","author":"A. Opelt","year":"2004","unstructured":"Opelt, A., Fussenegger, M., Pinz, A., & Auer, P. (2004b). Weak hypotheses and boosting for generic object detection and recognition. In ECCV."},{"issue":"3","key":"479_CR34","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1109\/TPAMI.2006.54","volume":"28","author":"A. Opelt","year":"2006","unstructured":"Opelt, A., Pinz, A., Fussenegger, M., & Auer, P. (2006). Generic object recognition with boosting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(3), 416\u2013431.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"479_CR35","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1111\/1467-8721.ep10770552","volume":"3","author":"M. Peterson","year":"1994","unstructured":"Peterson, M. (1994). Object recognition processes can and do operate before figure-ground organization. Current Directions in Psychological Science, 3, 105\u2013111.","journal-title":"Current Directions in Psychological Science"},{"key":"479_CR36","volume-title":"CVPR","author":"D. Ramanan","year":"2007","unstructured":"Ramanan, D. (2007). Using segmentation to verify object hypotheses. In CVPR."},{"key":"479_CR37","volume-title":"CVPR","author":"F. Rothganger","year":"2003","unstructured":"Rothganger, F., Lazebnik, S., Schmid, C., & Ponce, J. (2003). 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints. In CVPR."},{"issue":"1","key":"479_CR38","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/34.655647","volume":"20","author":"H. Rowley","year":"1998","unstructured":"Rowley, H., Baluja, S., & Kanade, T. (1998). Neural networks based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(1), 22\u201338.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"2","key":"479_CR39","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1023\/A:1026543900054","volume":"40","author":"Y. Rubner","year":"2000","unstructured":"Rubner, Y., Tomasi, C., & Guibas, L. (2000). The Earth Mover\u2019s distance as a metric for image retrieval. International Journal of Computer Vision, 40(2), 99\u2013121.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR40","volume-title":"CVPR","author":"B. Russell","year":"2006","unstructured":"Russell, B., Efros, A., Sivic, J., Freeman, W., & Zisserman, A. (2006). Using multiple segmentations to discover objects and their extents in image collections. In CVPR."},{"key":"479_CR41","volume-title":"Learning with kernels: support vector machines, regularization, optimization and beyond","author":"B. Sch\u00f6lkopf","year":"2002","unstructured":"Sch\u00f6lkopf, B., & Smola, A. (2002). Learning with kernels: support vector machines, regularization, optimization and beyond. Cambridge: MIT Press."},{"key":"479_CR42","volume-title":"DAGM","author":"E. Seemann","year":"2006","unstructured":"Seemann, E., & Schiele, B. (2006). Cross-articulation learning for robust detection of pedestrians. In DAGM."},{"key":"479_CR43","volume-title":"CVPR","author":"E. Seemann","year":"2006","unstructured":"Seemann, E., Leibe, B., & Schiele, B. (2006). Multi-aspect detection of articulated objects. In CVPR."},{"key":"479_CR44","volume-title":"ICCV","author":"J. Shotton","year":"2005","unstructured":"Shotton, J., Blake, A., & Cipolla, R. (2005). Contour-based learning for object detection. In ICCV."},{"key":"479_CR45","volume-title":"CVPR","author":"J. Shotton","year":"2008","unstructured":"Shotton, J., Johnson, M., & Cipolla, R. (2008). Semantic texton forests for image categorization and segmentation. In CVPR."},{"key":"479_CR46","volume-title":"ICCV","author":"J. Sivic","year":"2003","unstructured":"Sivic, J., & Zisserman, A. (2003). Video Google: a text retrieval approach to object matching in videos. In ICCV."},{"key":"479_CR47","volume-title":"ICCV","author":"J. Sivic","year":"2005","unstructured":"Sivic, J., Russell, B., Efros, A., Zisserman, A., & Freeman, W. (2005). Discovering objects and their location in images. In ICCV."},{"key":"479_CR48","volume-title":"CVPR","author":"A. Thomas","year":"2006","unstructured":"Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T., Schiele, B., & Gool, L.\u00a0V. (2006). Towards multi-view object class detection. In CVPR."},{"key":"479_CR49","volume-title":"CVPR","author":"S. Todorovic","year":"2006","unstructured":"Todorovic, S., & Ahuja, N. (2006). Extracting subimages of an unknown category from a set of images. In CVPR."},{"issue":"2","key":"479_CR50","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1037\/0096-1523.24.2.441","volume":"24","author":"S. Vecera","year":"1998","unstructured":"Vecera, S. (1998). Figure-ground organization and object recognition processes: an interactive account. Journal of Experimental Psychology. Human Perception and Performance, 24(2), 441\u2013462.","journal-title":"Journal of Experimental Psychology. Human Perception and Performance"},{"issue":"2","key":"479_CR51","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1023\/B:VISI.0000013087.49260.fb","volume":"57","author":"P. Viola","year":"2004","unstructured":"Viola, P., & Jones, M. (2004). Robust real-time object detection. International Journal of Computer Vision, 57(2), 137\u2013154.","journal-title":"International Journal of Computer Vision"},{"key":"479_CR52","volume-title":"ICCV","author":"J. Winn","year":"2005","unstructured":"Winn, J., & Joijic, N. (2005). LOCUS: learning object classes with unsupervised segmentation. In ICCV."},{"key":"479_CR53","volume-title":"CVPR","author":"B. Wu","year":"2007","unstructured":"Wu, B., & Nevatia, R. (2007). Simultaneous object detection and segmentation by boosting local shape feature based classifier. In CVPR."},{"key":"479_CR54","volume-title":"CVPR","author":"S. Yu","year":"2003","unstructured":"Yu, S., & Shi, J. (2003). Object-specific figure-ground segregation. In CVPR."},{"issue":"2","key":"479_CR55","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s11263-006-9794-4","volume":"73","author":"J. Zhang","year":"2007","unstructured":"Zhang, J., Marsza\u0142ek, M., Lazebnik, S., & Schmid, C. (2007). Local features and kernels for classification of texture and object categories: a comprehensive study. International Journal of Computer Vision, 73(2), 213\u2013238.","journal-title":"International Journal of Computer Vision"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-011-0479-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11263-011-0479-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-011-0479-2","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,1]],"date-time":"2019-06-01T12:16:47Z","timestamp":1559391407000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11263-011-0479-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7,1]]},"references-count":55,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2012,4]]}},"alternative-id":["479"],"URL":"https:\/\/doi.org\/10.1007\/s11263-011-0479-2","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,7,1]]}}}