{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:13:46Z","timestamp":1750220026471,"version":"3.41.0"},"reference-count":67,"publisher":"Association for Computing Machinery (ACM)","issue":"3s","license":[{"start":{"date-parts":[[2023,2,24]],"date-time":"2023-02-24T00:00:00Z","timestamp":1677196800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100004733","name":"University of Macau","doi-asserted-by":"crossref","award":["MYRG2018-00053-FST"],"award-info":[{"award-number":["MYRG2018-00053-FST"]}],"id":[{"id":"10.13039\/501100004733","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100012542","name":"Science and Technology Project of Sichuan","doi-asserted-by":"crossref","award":["2020YFG0459, 2021YFG0314, 2022ZHCG0033"],"award-info":[{"award-number":["2020YFG0459, 2021YFG0314, 2022ZHCG0033"]}],"id":[{"id":"10.13039\/100012542","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["U19A2078"],"award-info":[{"award-number":["U19A2078"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,10,31]]},"abstract":"<jats:p>Dictionary-based Classification (DC) has been a promising learning theory in multimedia computing. Previous studies focused on learning a discriminative dictionary as well as the sparsest representation based on the dictionary, to cope with the complex conditions in real-world applications. However, robustness by learning only one single dictionary is far from the optimal level. What is worse, it cannot take advantage of the available techniques proven in modern machine learning, like data augmentation, to mitigate the same problem. In this work, we propose a novel method that utilizes joint Augmented and Compressed Dictionaries for Robust Dictionary-based Classification (ACD-RDC). For optimization under the noise model introduced by real-world conditions, the objective function of ACD-RDC incorporates only two simple, but well-designed constraints, including one enhanced sparsity constraint by the general data augmentation, which requires less case-by-case and sophisticated tuning, and another discriminative constraint solved by a jointly learned dictionary. The optimization of the objective function is then deduced theoretically to an approximate linear problem. The sparsity and discrimination enhanced by data augmentation guarantees the robustness for image classification under various conditions, which constructs the first positive case using data augmentation to obtain robust dictionary-based classification. Numerous experiments have been conducted on popular facial and object image datasets. The results demonstrate that ACD-RDC obtains more promising classification on diversely collected images than the current dictionary-based classification methods. ACD-RDC is also confirmed to be a state-of-the-art classification method when using deep features as inputs.<\/jats:p>","DOI":"10.1145\/3572910","type":"journal-article","created":{"date-parts":[[2022,12,1]],"date-time":"2022-12-01T12:41:10Z","timestamp":1669898470000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Joint Augmented and Compressed Dictionaries for Robust Image Classification"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4384-8787","authenticated-orcid":false,"given":"Shaoning","family":"Zeng","sequence":"first","affiliation":[{"name":"University of Electronic Science and Technology of China, Huzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5433-7379","authenticated-orcid":false,"given":"Yunbo","family":"Rao","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Huzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2497-9519","authenticated-orcid":false,"given":"Bob","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Macau, Macao, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0530-2123","authenticated-orcid":false,"given":"Yong","family":"Xu","sequence":"additional","affiliation":[{"name":"Harbin Institute of Technology, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,2,24]]},"reference":[{"key":"e_1_3_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2006.881199"},{"key":"e_1_3_2_3_1","first-page":"1193","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Akhtar Naveed","year":"2017","unstructured":"Naveed Akhtar, Ajmal Mian, and Fatih Porikli. 2017a. Joint discriminative Bayesian dictionary and classifier learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1193\u20131202."},{"key":"e_1_3_2_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2016.12.017"},{"key":"e_1_3_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.322"},{"key":"e_1_3_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2005.858979"},{"key":"e_1_3_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2015.2397444"},{"key":"e_1_3_2_8_1","first-page":"1","volume-title":"Proceedings of the 33rd AAAI Conference on Artificial Intelligence","volume":"6","author":"Chen Zitian","year":"2019","unstructured":"Zitian Chen, Yanwei Fu, Kaiyu Chen, and Yu-Gang Jiang. 2019. Image block augmentation for one-shot learning. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence, Vol. 6. 1\u20138."},{"key":"e_1_3_2_9_1","first-page":"215","volume-title":"Proceedings of the International Conference on Artificial Intelligence and Statistics","volume":"15","author":"Coates Adam","year":"2011","unstructured":"Adam Coates, Andrew Y. Ng, and Honglak Lee. 2011. An analysis of single-layer networks in unsupervised feature learning. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Vol. 15. 215\u2013223."},{"key":"e_1_3_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_11_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpa.20132"},{"key":"e_1_3_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2011.2173241"},{"key":"e_1_3_2_13_1","first-page":"364","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV\u201918)","author":"Dvornik Nikita","year":"2018","unstructured":"Nikita Dvornik, Julien Mairal, and Cordelia Schmid. 2018. Modeling visual context is key to augmenting object detection datasets. In Proceedings of the European Conference on Computer Vision (ECCV\u201918). 364\u2013380."},{"key":"e_1_3_2_14_1","first-page":"2672","volume-title":"Advances in Neural Information Processing Systems","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672\u20132680."},{"key":"e_1_3_2_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.08.021"},{"key":"e_1_3_2_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2020.01.020"},{"key":"e_1_3_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.328"},{"key":"e_1_3_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_19_1","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1201\/b10905-11","article-title":"The data augmentation algorithm: Theory and methodology","author":"Hobert James P.","year":"2011","unstructured":"James P. Hobert. 2011. The data augmentation algorithm: Theory and methodology. In Handbook of Markov Chain Monte Carlo (2011), 253\u2013293.","journal-title":"Handbook of Markov Chain Monte Carlo"},{"key":"e_1_3_2_20_1","first-page":"609","volume-title":"Advances in Neural Information Processing Systems","author":"Huang Ke","year":"2007","unstructured":"Ke Huang and Selin Aviyente. 2007. Sparse representation for signal classification. In Advances in Neural Information Processing Systems. 609\u2013616."},{"key":"e_1_3_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2009.2027527"},{"key":"e_1_3_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.88"},{"key":"e_1_3_2_23_1","unstructured":"arXiv 2009 Learning multiple layers of features from tiny images"},{"key":"e_1_3_2_24_1","first-page":"1097","volume-title":"Advances in Neural Information Processing Systems","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. 1097\u20131105."},{"key":"e_1_3_2_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/3122009.3176819"},{"key":"e_1_3_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2764893"},{"key":"e_1_3_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2015.2508025"},{"key":"e_1_3_2_28_1","volume-title":"The Theory of Error-correcting Codes","author":"MacWilliams Florence Jessie","year":"1977","unstructured":"Florence Jessie MacWilliams and Neil James Alexander Sloane. 1977. The Theory of Error-correcting Codes. Vol. 16. Elsevier."},{"key":"e_1_3_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2011.156"},{"key":"e_1_3_2_30_1","article-title":"The AR Face Database","volume":"24","author":"Martinez Aleix M.","year":"1998","unstructured":"Aleix M. Martinez. 1998. The AR Face Database. CVC Technical Report 24 (1998).","journal-title":"CVC Technical Report"},{"key":"e_1_3_2_31_1","unstructured":"Sameer A. Nene Shree K. Nayar and Hiroshi Murase. 1996. Columbia Object Image Library (coil-20) ."},{"key":"e_1_3_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2774300"},{"issue":"51","key":"e_1_3_2_33_1","first-page":"1","article-title":"Bayesian combination of probabilistic classifiers using multivariate normal mixtures","volume":"20","author":"Pir\u0161 Gregor","year":"2019","unstructured":"Gregor Pir\u0161 and Erik \u0160trumbelj. 2019. Bayesian combination of probabilistic classifiers using multivariate normal mixtures. Journal of Machine Learning Research 20, 51 (2019), 1\u201318.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_34_1","doi-asserted-by":"publisher","DOI":"10.1214\/11-BA601"},{"key":"e_1_3_2_35_1","first-page":"Early Access","volume-title":"International Joint Conference on Artificial Intelligence","author":"Ruan Wenjie","year":"2019","unstructured":"Wenjie Ruan, Min Wu, Youcheng Sun, Xiaowei Huang, Daniel Kroening, and Marta Kwiatkowska. 2019. Global robustness evaluation of deep neural networks with provable guarantees for the hamming distance. In International Joint Conference on Artificial Intelligence. Early Access."},{"key":"e_1_3_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"e_1_3_2_37_1","article-title":"Very deep convolutional networks for large-scale image recognition","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).","journal-title":"arXiv preprint arXiv:1409.1556"},{"key":"e_1_3_2_38_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1987.10478458"},{"key":"e_1_3_2_39_1","doi-asserted-by":"publisher","DOI":"10.1137\/151004549"},{"key":"e_1_3_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2868350"},{"key":"e_1_3_2_41_1","first-page":"2797","volume-title":"Advances in Neural Information Processing Systems","author":"Tran Toan","year":"2017","unstructured":"Toan Tran, Trung Pham, Gustavo Carneiro, Lyle Palmer, and Ian Reid. 2017. A Bayesian data augmentation approach for learning deep models. In Advances in Neural Information Processing Systems. 2797\u20132806."},{"key":"e_1_3_2_42_1","doi-asserted-by":"publisher","DOI":"10.1198\/10618600152418584"},{"key":"e_1_3_2_43_1","first-page":"3097","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wang Hua","year":"2013","unstructured":"Hua Wang, Feiping Nie, Heng Huang, and Chris Ding. 2013. Heterogeneous visual features fusion via sparse multimodal machine. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3097\u20133102."},{"key":"e_1_3_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00760"},{"key":"e_1_3_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2018.2884715"},{"key":"e_1_3_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995566"},{"key":"e_1_3_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2010.2044470"},{"key":"e_1_3_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.79"},{"key":"e_1_3_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00581"},{"key":"e_1_3_2_50_1","unstructured":"Han Xiao Kashif Rasul and Roland Vollgraf. 2017. Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv:cs.LG\/cs.LG\/1708.07747"},{"key":"e_1_3_2_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0748-y"},{"key":"e_1_3_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2490539"},{"key":"e_1_3_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2695239"},{"key":"e_1_3_2_54_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2016.09.059"},{"issue":"2","key":"e_1_3_2_55_1","first-page":"543","article-title":"An improvement to the nearest neighbor classifier and face recognition experiments","volume":"9","author":"Xu Yong","year":"2013","unstructured":"Yong Xu, Qi Zhu, Yan Chen, Jeng-Shyang Pan, et\u00a0al. 2013. An improvement to the nearest neighbor classifier and face recognition experiments. Int. J. Innov. Comput. Inf. Control 9, 2 (2013), 543\u2013554.","journal-title":"Int. J. Innov. Comput. Inf. Control"},{"key":"e_1_3_2_56_1","first-page":"6566","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Yamaguchi Shin\u2019ya","year":"2020","unstructured":"Shin\u2019ya Yamaguchi, Sekitoshi Kanai, and Takeharu Eda. 2020. Effective data augmentation with multi-domain learning GANs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 6566\u20136574."},{"key":"e_1_3_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2903448"},{"key":"e_1_3_2_58_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TPAMI.2019.2893953","article-title":"Shared multi-view data representation for multi-domain event detection","volume":"1","author":"Yang Zhenguo","year":"2019","unstructured":"Zhenguo Yang, Qing Li, Liu Wenyin, and Jianming Lv. 2019. Shared multi-view data representation for multi-domain event detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 1, early access (2019), 1\u201314.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_59_1","first-page":"4390","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence","author":"You Shan","year":"2018","unstructured":"Shan You, Chang Xu, Chao Xu, and Dacheng Tao. 2018. Learning with single-teacher multi-student. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 4390\u20134397."},{"key":"e_1_3_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2020.3025757"},{"key":"e_1_3_2_61_1","first-page":"502","volume-title":"Proceedings of the Asian Conference on Machine Learning","author":"Zeng Shaoning","year":"2018","unstructured":"Shaoning Zeng, Bob Zhang, Yanghao Zhang, and Jianping Gou. 2018. Collaboratively weighting deep and classic representation via \\(l_2\\) regularization for image classification. In Proceedings of the Asian Conference on Machine Learning. 502\u2013517."},{"key":"e_1_3_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2016.2603342"},{"key":"e_1_3_2_63_1","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1109\/ICCV.2011.6126277","volume-title":"Proceedings of the 2011 International Conference on Computer Vision","author":"Zhang Lei","year":"2011","unstructured":"Lei Zhang, Meng Yang, and Xiangchu Feng. 2011. Sparse representation or collaborative representation: Which helps face recognition?. In Proceedings of the 2011 International Conference on Computer Vision. IEEE, 471\u2013478."},{"key":"e_1_3_2_64_1","first-page":"2691","volume-title":"Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition","author":"Zhang Qiang","year":"2010","unstructured":"Qiang Zhang and Baoxin Li. 2010. Discriminative K-SVD for dictionary learning in face recognition. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2691\u20132698."},{"key":"e_1_3_2_65_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.05.059"},{"key":"e_1_3_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2015.2430359"},{"key":"e_1_3_2_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2874313"},{"key":"e_1_3_2_68_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2638570"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3572910","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3572910","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:38Z","timestamp":1750182698000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3572910"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,24]]},"references-count":67,"journal-issue":{"issue":"3s","published-print":{"date-parts":[[2023,10,31]]}},"alternative-id":["10.1145\/3572910"],"URL":"https:\/\/doi.org\/10.1145\/3572910","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2023,2,24]]},"assertion":[{"value":"2021-06-06","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-20","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}