{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T05:01:29Z","timestamp":1750309289803,"version":"3.41.0"},"reference-count":53,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2024,1,11]],"date-time":"2024-01-11T00:00:00Z","timestamp":1704931200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,5,31]]},"abstract":"<jats:p>In this work, we propose a novel principal component approximation network (PCANet) for image compression. The proposed network is based on the assumption that a set of images can be decomposed into several shared feature matrices, and an image can be reconstructed by the weighted sum of these matrices. The proposed PCANet is specifically devised to learn and approximate these feature matrices and weight vectors, which are used to encode images for compression. Unlike previous deep learning-based methods, a distinctive aspect of our approach is its consideration of network size in the bit-rate computation. Despite this inclusion, our proposed method yields promising results. Through extensive experiments conducted on standard datasets, we demonstrate the effectiveness of our approach in comparison to state-of-the-art techniques. To the best of our knowledge, this is the first machine learning approach that includes the size of networks during bitrate computation in image compression.<\/jats:p>","DOI":"10.1145\/3637490","type":"journal-article","created":{"date-parts":[[2023,12,13]],"date-time":"2023-12-13T11:42:50Z","timestamp":1702467770000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Principal Component Approximation Network for Image Compression"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5642-5356","authenticated-orcid":false,"given":"Shupei","family":"Zhang","sequence":"first","affiliation":[{"name":"University of Alberta, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4574-4815","authenticated-orcid":false,"given":"Chenqiu","family":"Zhao","sequence":"additional","affiliation":[{"name":"University of Alberta, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7695-4148","authenticated-orcid":false,"given":"Anup","family":"Basu","sequence":"additional","affiliation":[{"name":"University of Alberta, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1,11]]},"reference":[{"key":"e_1_3_1_2_2","article-title":"Soft-to-hard vector quantization for end-to-end learning compressible representations","volume":"30","author":"Agustsson Eirikur","year":"2017","unstructured":"Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, and Luc V. Gool. 2017. Soft-to-hard vector quantization for end-to-end learning compressible representations. Adv. Neural Inf. Process. Syst. 30 (2017).","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1804.02958"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/DCC.2018.00048"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/T-C.1974.223784"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3471905"},{"key":"e_1_3_1_7_2","first-page":"677","volume-title":"Proceedings of the Hawaii International Conference: System Sciences","author":"Andrews H. C.","year":"1968","unstructured":"H. C. Andrews and W. K. Pratt. 1968. Fourier transform coding of images. In Proceedings of the Hawaii International Conference: System Sciences. 677\u2013679."},{"key":"e_1_3_1_8_2","volume-title":"Advances in Neural Information Processing Systems","author":"Baig Mohammad Haris","year":"2017","unstructured":"Mohammad Haris Baig, Vladlen Koltun, and Lorenzo Torresani. 2017. Learning to inpaint for image compression. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc.Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2017\/file\/013a006f03dbc5392effeb8f18fda755-Paper.pdf"},{"key":"e_1_3_1_9_2","volume-title":"Proceedings of the 5th International Conference on Learning Representations","author":"Ball\u00e9 Johannes","year":"2016","unstructured":"Johannes Ball\u00e9, Valero Laparra, and Eero P. Simoncelli. 2016. End-to-end optimized image compression. In Proceedings of the 5th International Conference on Learning Representations."},{"key":"e_1_3_1_10_2","volume-title":"Proceedings of the 6th International Conference on Learning Representations","author":"Ball\u00e9 Johannes","year":"2018","unstructured":"Johannes Ball\u00e9, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick Johnston. 2018. Variational image compression with a scale hyperprior. In Proceedings of the 6th International Conference on Learning Representations."},{"key":"e_1_3_1_11_2","article-title":"Calculation of average PSNR differences between RD-curves","author":"Bjontegaard Gisle","year":"2001","unstructured":"Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. ITU SG16 Doc. VCEG-M33 (2001).","journal-title":"ITU SG16 Doc. VCEG-M33"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3058615"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9053885"},{"key":"e_1_3_1_14_2","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201919)","author":"Cheng Zhengxue","year":"2019","unstructured":"Zhengxue Cheng, Heming Sun, Masaru Takeuchi, and Jiro Katto. 2019. Learning image and video compression through spatial-temporal energy compaction. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201919)."},{"key":"e_1_3_1_15_2","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201920)","author":"Cheng Zhengxue","year":"2020","unstructured":"Zhengxue Cheng, Heming Sun, Masaru Takeuchi, and Jiro Katto. 2020. Learned image compression with discretized Gaussian mixture likelihoods and attention modules. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201920)."},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/30.920468"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2014.126"},{"key":"e_1_3_1_18_2","first-page":"14677","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201921)","author":"Gao Ge","year":"2021","unstructured":"Ge Gao, Pei You, Rong Pan, Shunyuan Han, Yuanyuan Zhang, Yuchao Dai, and Hojae Lee. 2021. Neural image compression via attentional multi-scale back projection and frequency decomposition. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201921). 14677\u201314686."},{"key":"e_1_3_1_19_2","volume-title":"Proceedings of the ACM International Conference on Multimedia (ACM MM\u201923)","author":"Gao Wei","year":"2023","unstructured":"Wei Gao, Shangkun Sun, Huiming Zheng, Yuyang Wu, Hua Ye, and Yongchi Zhang. 2023. OpenDMC: An open-source library and performance evaluation for deep-learning-based multi-frame compression. In Proceedings of the ACM International Conference on Multimedia (ACM MM\u201923)."},{"key":"e_1_3_1_20_2","volume-title":"Advances in Neural Information Processing Systems","author":"Gregor Karol","year":"2016","unstructured":"Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, and Daan Wierstra. 2016. Towards conceptual compression. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc."},{"key":"e_1_3_1_21_2","first-page":"1462","volume-title":"Proceedings of the 32nd International Conference on Machine Learning (Proceedings of Machine Learning Research)","volume":"37","author":"Gregor Karol","year":"2015","unstructured":"Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Rezende, and Daan Wierstra. 2015. DRAW: A recurrent neural network for image generation. In Proceedings of the 32nd International Conference on Machine Learning (Proceedings of Machine Learning Research), Francis Bach and David Blei (Eds.), Vol. 37. PMLR, 1462\u20131471."},{"key":"e_1_3_1_22_2","first-page":"5718","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"He Dailan","year":"2022","unstructured":"Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, and Yan Wang. 2022. ELIC: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 5718\u20135727."},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3065339"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/JRPROC.1952.273898"},{"key":"e_1_3_1_25_2","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)","author":"Johnston Nick","year":"2018","unstructured":"Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh, Troy Chinen, Sung Jin Hwang, Joel Shor, and George Toderici. 2018. Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)."},{"key":"e_1_3_1_26_2","first-page":"5992","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Kim Jun-Hyuk","year":"2022","unstructured":"Jun-Hyuk Kim, Byeongho Heo, and Jong-Seok Lee. 2022. Joint global and local hierarchical priors for learned image compression. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 5992\u20136001."},{"key":"e_1_3_1_27_2","volume-title":"Proceedings of the 7th International Conference on Learning Representations","author":"Lee Jooyoung","year":"2019","unstructured":"Jooyoung Lee, Seunghyun Cho, and Seung-Kwon Beack. 2019. Context-adaptive entropy model for end-to-end optimized image compression. In Proceedings of the 7th International Conference on Learning Representations."},{"key":"e_1_3_1_28_2","first-page":"16113","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Lee Jae-Han","year":"2022","unstructured":"Jae-Han Lee, Seungmin Jeon, Kwang Pyo Choi, Youngo Park, and Chang-Su Kim. 2022. DPICT: Deep progressive image compression using trit-planes. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 16113\u201316122."},{"key":"e_1_3_1_29_2","first-page":"19669","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Lei Jianjun","year":"2022","unstructured":"Jianjun Lei, Xiangrui Liu, Bo Peng, Dengchao Jin, Wanqing Li, and Jingxiao Gu. 2022. Deep stereo image compression via bi-directional coding. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 19669\u201319678."},{"key":"e_1_3_1_30_2","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)","author":"Li Mu","year":"2018","unstructured":"Mu Li, Wangmeng Zuo, Shuhang Gu, Debin Zhao, and David Zhang. 2018. Learning convolutional networks for content-weighted image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)."},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","unstructured":"Ziwei Liu Ping Luo Xiaogang Wang and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV\u201915) .","DOI":"10.1109\/ICCV.2015.425"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2019.2910119"},{"key":"e_1_3_1_34_2","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)","author":"Mentzer Fabian","year":"2018","unstructured":"Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool. 2018. Conditional probability models for deep image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201918)."},{"key":"e_1_3_1_35_2","volume-title":"Advances in Neural Information Processing Systems","author":"Minnen David","year":"2018","unstructured":"David Minnen, Johannes Ball\u00e9, and George D. Toderici. 2018. Joint autoregressive and hierarchical priors for learned image compression. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc.Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2018\/file\/53edebc543333dfbf7c5933af792c9c4-Paper.pdf"},{"key":"e_1_3_1_36_2","unstructured":"Matthew Muckley Jordan Juravsky Daniel Severo Mannat Singh Quentin Duval and Karen Ullrich. 2021. Neural Compression. Retrieved from https:\/\/github.com\/facebookresearch\/NeuralCompre ssion"},{"key":"e_1_3_1_37_2","first-page":"227","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision","author":"Patel Yash","year":"2021","unstructured":"Yash Patel, Srikar Appalaraju, and R. Manmatha. 2021. Saliency driven perceptual image compression. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 227\u2013236."},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/PROC.1969.6869"},{"key":"e_1_3_1_39_2","first-page":"6033","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Rhee Hochang","year":"2022","unstructured":"Hochang Rhee, Yeong Il Jang, Seyun Kim, and Nam Ik Cho. 2022. LC-FDNet: Learned lossless image compression with frequency decomposition network. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 6033\u20136042."},{"key":"e_1_3_1_40_2","first-page":"2922","volume-title":"Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research)","volume":"70","author":"Rippel Oren","year":"2017","unstructured":"Oren Rippel and Lubomir Bourdev. 2017. Real-time adaptive image compression. In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research), Doina Precup and Yee Whye Teh (Eds.), Vol. 70. PMLR, 2922\u20132930."},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2015.02.027"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.11.026"},{"key":"e_1_3_1_43_2","first-page":"2380","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201921)","author":"Song Myungseo","year":"2021","unstructured":"Myungseo Song, Jinyoung Choi, and Bohyung Han. 2021. Variable-rate deep image compression through spatially-adaptive feature transform. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV\u201921). 2380\u20132389."},{"key":"e_1_3_1_44_2","volume-title":"Proceedings of the 5th International Conference on Learning Representations","author":"Theis Lucas","year":"2017","unstructured":"Lucas Theis, Wenzhe Shi, Andrew Cunningham, and Ferenc Husz\u00e1r. 2017. Lossy image compression with compressive autoencoders. In Proceedings of the 5th International Conference on Learning Representations."},{"key":"e_1_3_1_45_2","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201917)","author":"Toderici George","year":"2017","unstructured":"George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, and Michele Covell. 2017. Full resolution image compression with recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201917)."},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/30.125072"},{"key":"e_1_3_1_47_2","first-page":"17379","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Wang Dezhao","year":"2022","unstructured":"Dezhao Wang, Wenhan Yang, Yueyu Hu, and Jiaying Liu. 2022. Neural data-dependent transform for learned image compression. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 17379\u201317388."},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACSSC.2003.1292216"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/214762.214771"},{"key":"e_1_3_1_50_2","first-page":"661","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"W\u00f6dlinger Matthias","year":"2022","unstructured":"Matthias W\u00f6dlinger, Jan Kotera, Jan Xu, and Robert Sablatnig. 2022. SASIC: Stereo image compression with latent shifts and stereo attention. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 661\u2013670."},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093387"},{"key":"e_1_3_1_52_2","first-page":"2617","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops","author":"Zhou Lei","year":"2018","unstructured":"Lei Zhou, Chunlei Cai, Yue Gao, Sanbao Su, and Junmin Wu. 2018. Variational autoencoder for low bit-rate image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2617\u20132620."},{"key":"e_1_3_1_53_2","first-page":"17612","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Zhu Xiaosu","year":"2022","unstructured":"Xiaosu Zhu, Jingkuan Song, Lianli Gao, Feng Zheng, and Heng Tao Shen. 2022. Unified multivariate gaussian mixture for efficient neural image compression. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 17612\u201317621."},{"key":"e_1_3_1_54_2","first-page":"17492","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922)","author":"Zou Renjie","year":"2022","unstructured":"Renjie Zou, Chunfeng Song, and Zhaoxiang Zhang. 2022. The devil is in the details: Window-based attention for image compression. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR\u201922). 17492\u201317501."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3637490","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3637490","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:25Z","timestamp":1750291405000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3637490"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,11]]},"references-count":53,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,5,31]]}},"alternative-id":["10.1145\/3637490"],"URL":"https:\/\/doi.org\/10.1145\/3637490","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2024,1,11]]},"assertion":[{"value":"2023-08-14","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-10","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}