{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T09:06:25Z","timestamp":1760346385085,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":30,"publisher":"ACM","license":[{"start":{"date-parts":[[2014,4,1]],"date-time":"2014-04-01T00:00:00Z","timestamp":1396310400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2014,4]]},"DOI":"10.1145\/2578726.2578739","type":"proceedings-article","created":{"date-parts":[[2014,4,15]],"date-time":"2014-04-15T17:50:38Z","timestamp":1397584238000},"page":"121-128","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Towards Efficient Learning of Optimal Spatial Bag-of-Words Representations"],"prefix":"10.1145","author":[{"given":"Lu","family":"Jiang","sequence":"first","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Tong","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Deyu","family":"Meng","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics, Xi'an Jiaotong University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander G.","family":"Hauptmann","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,4]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"JS Tiling webpage. https:\/\/code.google.com\/p\/learning2tile\/.  JS Tiling webpage. https:\/\/code.google.com\/p\/learning2tile\/."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539963"},{"key":"e_1_3_2_1_3_1","volume-title":"ICML","author":"Boureau Y.","year":"2010","unstructured":"Y. Boureau , J. Ponce , and Y. LeCun . A theoretical analysis of feature pooling in visual recognition . In ICML , 2010 . Y. Boureau, J. Ponce, and Y. LeCun. A theoretical analysis of feature pooling in visual recognition. In ICML, 2010."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199"},{"key":"e_1_3_2_1_5_1","volume-title":"MoSift: Recognizing human actions in surveillance videos. Technical report","author":"Chen M.","year":"2009","unstructured":"M. Chen and A. Hauptmann . MoSift: Recognizing human actions in surveillance videos. Technical report , Carnegie Mellon University , 2009 . M. Chen and A. Hauptmann. MoSift: Recognizing human actions in surveillance videos. Technical report, Carnegie Mellon University, 2009."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/31.3.283"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_3_2_1_8_1","volume-title":"CVPR","author":"Feng J.","year":"2011","unstructured":"J. Feng , B. Ni , Q. Tian , and S. Yan . Geometric lp-norm feature pooling for image classification . In CVPR , 2011 . J. Feng, B. Ni, Q. Tian, and S. Yan. Geometric lp-norm feature pooling for image classification. In CVPR, 2011."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461466.2461482"},{"key":"e_1_3_2_1_10_1","volume-title":"CVPR","author":"Jia Y.","year":"2012","unstructured":"Y. Jia , C. Huang , and T. Darrell . Beyond spatial pyramids: Receptive field learning for pooled image features . In CVPR , 2012 . Y. Jia, C. Huang, and T. Darrell. Beyond spatial pyramids: Receptive field learning for pooled image features. In CVPR, 2012."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2393412"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.68"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2324796.2324801"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_2_1_15_1","volume-title":"PASCAL VOC workshop","author":"Marszalek M.","year":"2007","unstructured":"M. Marszalek and C. Schmid . Learning representations for visual object class recognition . In PASCAL VOC workshop , 2007 . M. Marszalek and C. Schmid. Learning representations for visual object class recognition. In PASCAL VOC workshop, 2007."},{"key":"e_1_3_2_1_16_1","volume-title":"Efficient Generation of Set Partitions. Technical report","author":"Orlov M.","year":"2002","unstructured":"M. Orlov . Efficient Generation of Set Partitions. Technical report , University of ULM , 2002 . M. Orlov. Efficient Generation of Set Partitions. Technical report, University of ULM, 2002."},{"key":"e_1_3_2_1_17_1","volume-title":"NIST","author":"Over P.","year":"2012","unstructured":"P. Over , J. Fiscus , and G. Sanders . Trecvid 2012--an overview to the goals, tasks, data, evaluation mechanisms, and metrics. In TRECVID , NIST , 2012 . P. Over, J. Fiscus, and G. Sanders. Trecvid 2012--an overview to the goals, tasks, data, evaluation mechanisms, and metrics. In TRECVID, NIST, 2012."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.25.6"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2355062"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0021-9800(68)80031-6"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00138-013-0529-6"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995347"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.132"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1646396.1646441"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.23.124"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5540018"},{"key":"e_1_3_2_1_27_1","volume-title":"CVPR","author":"Yang J.","year":"2009","unstructured":"J. Yang , K. Yu , Y. Gong , and T. Huang . Linear spatial pyramid matching using sparse coding for image classification . In CVPR , 2009 . J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. In CVPR, 2009."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888150.1888160"},{"key":"e_1_3_2_1_29_1","volume-title":"NIST","author":"Yu S. I.","year":"2012","unstructured":"S. I. Yu , Z. Xu , D. Ding , W. Sze , F. Vicente , Z. Lan , Y. Cai , S. Rawat , P. Schulam , N. Markandaiah , 2012 multimedia event detection and recounting (med and mer). In TRECVID , NIST , 2012. S. I. Yu, Z. Xu, D. Ding, W. Sze, F. Vicente, Z. Lan, Y. Cai, S. Rawat, P. Schulam, N. Markandaiah, et al. Informedia E-LAMP@ trecvid 2012 multimedia event detection and recounting (med and mer). In TRECVID, NIST, 2012."},{"key":"e_1_3_2_1_30_1","volume-title":"NIST","author":"Zhang L.","year":"2011","unstructured":"L. Zhang , L. Jiang , L. Bao , S. Takahashi , Y. Li , and A. Hauptmann . Informedia@ trecvid 2011: Surveillance event detection. In TRECVID , NIST , 2011 . L. Zhang, L. Jiang, L. Bao, S. Takahashi, Y. Li, and A. Hauptmann. Informedia@ trecvid 2011: Surveillance event detection. In TRECVID, NIST, 2011."}],"event":{"name":"ICMR '14: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Glasgow United Kingdom","acronym":"ICMR '14"},"container-title":["Proceedings of International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2578726.2578739","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2578726.2578739","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:09:50Z","timestamp":1750234190000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2578726.2578739"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,4]]},"references-count":30,"alternative-id":["10.1145\/2578726.2578739","10.1145\/2578726"],"URL":"https:\/\/doi.org\/10.1145\/2578726.2578739","relation":{},"subject":[],"published":{"date-parts":[[2014,4]]},"assertion":[{"value":"2014-04-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}