{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:30:07Z","timestamp":1760146207950,"version":"build-2065373602"},"reference-count":46,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2024,10,10]],"date-time":"2024-10-10T00:00:00Z","timestamp":1728518400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2022YFB3305300"],"award-info":[{"award-number":["2022YFB3305300"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>The rich information and complex background of industrial images make it a challenging task to improve the high compression rate of images. Current learning-based image compression methods mostly use customized convolutional neural networks (CNNs), which find it difficult to cope with the complex production background of industrial images. This causes useful information to be lost in the abundance of irrelevant data, making it difficult to accurately extract important features during the feature extraction stage. To address this, a Multi-path Residual Asymmetric Convolutional Compression Network (MRACNN) is proposed. Firstly, a Multi-path Residual Asymmetric Convolution Block (MRACB) is introduced, which includes the Multi-path Residual Asymmetric Convolution Down-sampling Module for down-sampling in the encoder to extract key features, and the Mult-path Residual Asymmetric Convolution Up-sampling Module for up-sampling in the decoder to recover details and reconstruct the image. This feature transfer and information flow enables the better capture of image details and important information, thereby improving the quality and efficiency of image compression and decompression. Furthermore, a two-branch enhanced local attention mechanisms, and a channel-squeezing entropy model based on the compression-based enhanced local attention module is proposed to enhance the performance of the modeled compression. Extensive experimental evaluations demonstrate that the proposed method outperforms state-of-the-art techniques, achieves superior Rate\u2013Distortion Performance, and excels in preserving local details.<\/jats:p>","DOI":"10.3390\/sym16101342","type":"journal-article","created":{"date-parts":[[2024,10,10]],"date-time":"2024-10-10T11:34:36Z","timestamp":1728560076000},"page":"1342","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["MRACNN: Multi-Path Residual Asymmetric Convolution and Enhanced Local Attention Mechanism for Industrial Image Compression"],"prefix":"10.3390","volume":"16","author":[{"given":"Zikang","family":"Yan","sequence":"first","affiliation":[{"name":"Faculty of Information Science and Engineering, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7746-8061","authenticated-orcid":false,"given":"Peishun","family":"Liu","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Engineering, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xuefang","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Mathematical Sciences, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haojie","family":"Gao","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Engineering, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaolong","family":"Ma","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Engineering, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xintong","family":"Hu","sequence":"additional","affiliation":[{"name":"Faculty of Information Science and Engineering, Ocean University of China, Qingdao 266100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,10,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1145\/103085.103089","article-title":"The JPEG still picture compression standard","volume":"34","author":"Wallace","year":"1991","journal-title":"Commun. ACM"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Taubman, D.S., and Marcellin, M.W. (2002). JPEG2000: Image Compression Fundamentals, Standards and Practice, Springer Science+Business Media.","DOI":"10.1007\/978-1-4615-0799-4"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1016\/j.image.2012.01.011","article-title":"Objective assessment of the WebP image coding algorithm","volume":"27","author":"Ginesu","year":"2012","journal-title":"Signal Process. Image Commun."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1649","DOI":"10.1109\/TCSVT.2012.2221191","article-title":"Overview of the high efficiency video coding (HEVC) standard","volume":"22","author":"Sullivan","year":"2012","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_5","unstructured":"Ball\u00e9, J., Laparra, V., and Simoncelli, E.P. (2016). End-to-end optimized image compression. arXiv."},{"key":"ref_6","first-page":"3","article-title":"A mathematical theory of communication","volume":"5","author":"Shannon","year":"2001","journal-title":"Bell Syst. Tech. J."},{"key":"ref_7","unstructured":"Wang, Z., Simoncelli, E.P., and Bovik, A.C. (2003, January 9\u201312). Multiscale structural similarity for image quality assessment. Proceedings of the Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, Pacific Grove, CA, USA."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Mentzer, F., Agustsson, E., Tschannen, M., Timofte, R., and Van Gool, L. (2018, January 18\u201322). Conditional probability models for deep image compression. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00462"},{"key":"ref_9","unstructured":"Ball\u00e9, J., Minnen, D., Singh, S., Hwang, S.J., and Johnston, N. (2018). Variational image compression with a scale hyperprior. arXiv."},{"key":"ref_10","unstructured":"Van Den Oord, A., Kalchbrenner, N., and Kavukcuoglu, K. (2016, January 20\u201322). Pixel recurrent neural networks. Proceedings of the International Conference on Machine Learning, New York, NY, USA."},{"key":"ref_11","unstructured":"Minnen, D., Ball\u00e9, J., and Toderici, G.D. (2018, January 3\u20138). Joint autoregressive and hierarchical priors for learned image compression. Proceedings of the 32nd International Conference on Neural Information Processing Systems, Montr\u00e9al, QC, Canada."},{"key":"ref_12","unstructured":"Liu, H., Chen, T., Shen, Q., Yue, T., and Ma, Z. (2018, January 18\u201322). Deep image compression via end-to-end learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA."},{"key":"ref_13","unstructured":"Agustsson, E., Tschannen, M., Mentzer, F., Timofte, R., and Gool, L.V. (November, January 27). Generative adversarial networks for extreme learned image compression. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Huang, C., Liu, H., Chen, T., Shen, Q., and Ma, Z. (2019, January 1\u20134). Extreme image coding via multiscale autoencoders with generative adversarial optimization. Proceedings of the 2019 IEEE Visual Communications and Image Processing (VCIP), Sydney, NSW, Australia.","DOI":"10.1109\/VCIP47243.2019.8966059"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Cheng, Z., Sun, H., Takeuchi, M., and Katto, J. (2020, January 14\u201319). Learned image compression with discretized gaussian mixture likelihoods and attention modules. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00796"},{"key":"ref_16","unstructured":"Liu, H., Chen, T., Guo, P., Shen, Q., Cao, X., Wang, Y., and Ma, Z. (2019). Non-local attention optimized deep image compression. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zou, R., Song, C., and Zhang, Z. (2022, January 18\u201324). The devil is in the details: Window-based attention for image compression. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01697"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Virtual.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Johnston, N., Vincent, D., Minnen, D., Covell, M., Singh, S., Chinen, T., Hwang, S.J., Shor, J., and Toderici, G. (2018, January 18\u201323). Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00461"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Lin, C., Yao, J., Chen, F., and Wang, L. (2020, January 14\u201319). A spatial rnn codec for end-to-end image compression. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01328"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Li, M., Zuo, W., Gu, S., Zhao, D., and Zhang, D. (2018, January 18\u201322). Learning convolutional networks for content-weighted image compression. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00339"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Dong, C., Deng, Y., Loy, C.C., and Tang, X. (2015, January 7\u201313). Compression artifacts reduction by a deep convolutional network. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.73"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hu, X., Yang, K., Fei, L., and Wang, K. (2019, January 22\u201325). Acnet: Attention based network to exploit complementary features for rgbd semantic segmentation. Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan.","DOI":"10.1109\/ICIP.2019.8803025"},{"key":"ref_24","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (July, January 26). Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Lo, S.-Y., Hang, H.-M., Chan, S.-W., and Lin, J.-J. (2019, January 15\u201318). Efficient dense modules of asymmetric convolution for real-time semantic segmentation. Proceedings of the 1st ACM International Conference on Multimedia in Asia, New York, NY, USA.","DOI":"10.1145\/3338533.3366558"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"3718","DOI":"10.1109\/TSMC.2021.3069265","article-title":"Asymmetric CNN for image superresolution","volume":"52","author":"Tian","year":"2021","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1109\/TCSVT.2022.3199472","article-title":"Joint graph attention and asymmetric convolutional neural network for deep image compression","volume":"33","author":"Tang","year":"2022","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201323). Non-local neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_29","unstructured":"Zhang, Y., Li, K., Li, K., Zhong, B., and Fu, Y. (2019). Residual non-local attention networks for image restoration. arXiv."},{"key":"ref_30","unstructured":"Zhou, J., Wang, P., Wang, F., Liu, Q., Li, H., and Jin, R. (2021). Elsa: Enhanced local self-attention for vision transformer. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"7498","DOI":"10.1109\/TCSVT.2024.3371686","article-title":"Efficient contextformer: Spatio-channel window attention for fast context modeling in learned image compression","volume":"34","author":"Koyuncu","year":"2024","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_33","unstructured":"Lee, J., Cho, S., and Beack, S.K. (2018). Context-adaptive entropy model for end-to-end optimized image compression. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., and Lu, H. (2019, January 16\u201317). Dual attention network for scene segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Liu, J., Sun, H., and Katto, J. (2023, January 17\u201324). Learned image compression with mixed transformer-cnn architectures. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01383"},{"key":"ref_36","unstructured":"Ren, G., Cao, T., Kong, F., and Tang, J. (2022, January 25\u201327). Channel-Stationary Entropy Model for Multispectral Image Compression. Proceedings of the International Conference in Communications, Signal Processing, and Systems, Xi\u2019an, China."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"He, D., Yang, Z., Peng, W., Ma, R., Qin, H., and Wang, Y. (2022, January 18\u201324). Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00563"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Minnen, D., and Singh, S. (2020, January 25\u201328). Channel-wise autoregressive entropy models for learned image compression. Proceedings of the 2020 IEEE International Conference on Image Processing (ICIP), Virtual.","DOI":"10.1109\/ICIP40778.2020.9190935"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Luo, A., Sun, H., Liu, J., and Katto, J. (2022, January 16\u201319). Memory-efficient learned image compression with pruned hyperprior module. Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France.","DOI":"10.1109\/ICIP46576.2022.9897854"},{"key":"ref_40","unstructured":"Kodak, E.J.U. (2024, October 07). Kodak Lossless True Color Image Suite (PhotoCD PCD0992). Available online: https:\/\/r0k.us\/graphics\/kodak\/."},{"key":"ref_41","unstructured":"Toderici, G., Shi, W., Timofte, R., Theis, L., Balle, J., Agustsson, E., Johnston, N., and Mentzer, F. (2020, January 13\u201319). Workshop and challenge on learned image compression (clic2020). Proceedings of the CVPR, Seattle, WA, USA."},{"key":"ref_42","first-page":"18","article-title":"Openimages: A public dataset for large-scale multi-label and multi-class image classification","volume":"2","author":"Krasin","year":"2017","journal-title":"Dataset"},{"key":"ref_43","unstructured":"Kingma, D.P. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1109\/TIP.2003.819861","article-title":"Image quality assessment: From error visibility to structural similarity","volume":"13","author":"Wang","year":"2004","journal-title":"IEEE Trans. Image Process."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Xie, Y., Cheng, K.L., and Chen, Q. (2021, January 20\u201324). Enhanced invertible encoding for learned image compression. Proceedings of the 29th ACM International Conference on Multimedia, Chengdu, China.","DOI":"10.1145\/3474085.3475213"},{"key":"ref_46","unstructured":"Zhu, Y., Yang, Y., and Cohen, T. (2022, January 25\u201329). Transformer-based transform coding. Proceedings of the International Conference on Learning Representations, Virtual."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/16\/10\/1342\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:10:44Z","timestamp":1760112644000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/16\/10\/1342"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,10]]},"references-count":46,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2024,10]]}},"alternative-id":["sym16101342"],"URL":"https:\/\/doi.org\/10.3390\/sym16101342","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2024,10,10]]}}}