{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:44:05Z","timestamp":1760143445523,"version":"build-2065373602"},"reference-count":38,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2024,2,9]],"date-time":"2024-02-09T00:00:00Z","timestamp":1707436800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Macau Science and Technology Development Funds","award":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"],"award-info":[{"award-number":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"]}]},{"name":"Science and Technology of Social Development Program","award":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"],"award-info":[{"award-number":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"]}]},{"name":"Dongguan Sci-tech Commissoner Program","award":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"],"award-info":[{"award-number":["0061\/2020\/A2","20211800904512","20231800935472","20231800500352"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>In recent years, advancements in deep Convolutional Neural Networks (CNNs) have brought about a paradigm shift in the realm of image super-resolution (SR). While augmenting the depth and breadth of CNNs can indeed enhance network performance, it often comes at the expense of heightened computational demands and greater memory usage, which can restrict practical deployment. To mitigate this challenge, we have incorporated a technique called factorized convolution and introduced the efficient Cross-Scale Interaction Block (CSIB). CSIB employs a dual-branch structure, with one branch extracting local features and the other capturing global features. Interaction operations take place in the middle of this dual-branch structure, facilitating the integration of cross-scale contextual information. To further refine the aggregated contextual information, we designed an Efficient Large Kernel Attention (ELKA) using large convolutional kernels and a gating mechanism. By stacking CSIBs, we have created a lightweight cross-scale interaction network for image super-resolution named \u201cCSINet\u201d. This innovative approach significantly reduces computational costs while maintaining performance, providing an efficient solution for practical applications. The experimental results convincingly demonstrate that our CSINet surpasses the majority of the state-of-the-art lightweight super-resolution techniques used on widely recognized benchmark datasets. Moreover, our smaller model, CSINet-S, shows an excellent performance record on lightweight super-resolution benchmarks with extremely low parameters and Multi-Adds (e.g., 33.82 dB@Set14 \u00d7 2 with only 248 K parameters).<\/jats:p>","DOI":"10.3390\/s24041135","type":"journal-article","created":{"date-parts":[[2024,2,9]],"date-time":"2024-02-09T03:53:46Z","timestamp":1707450826000},"page":"1135","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["CSINet: A Cross-Scale Interaction Network for Lightweight Image Super-Resolution"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1813-5985","authenticated-orcid":false,"given":"Gang","family":"Ke","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China"},{"name":"School of Electronic Information, Dongguan Polytechnic, Dongguan 523109, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sio-Long","family":"Lo","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3641-2686","authenticated-orcid":false,"given":"Hua","family":"Zou","sequence":"additional","affiliation":[{"name":"School of Computer Science, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yi-Feng","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhen-Qiang","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jing-Kai","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,2,9]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Dong, C., Loy, C.C., and Tang, X. (2016, January 11\u201314). Accelerating the super-resolution convolutional neural network. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46475-6_25"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Kim, J., Lee, J.K., and Lee, K.M. (2016, January 27\u201330). Accurate image super-resolution using very deep convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.182"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Lim, B., Son, S., Kim, H., Nah, S., and Mu Lee, K. (2017, January 21\u201326). Enhanced deep residual networks for single image super-resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.151"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., and Fu, Y. (2018, January 8\u201314). Image super-resolution using very deep residual channel attention networks. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_18"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Kong, F., Li, M., Liu, S., Liu, D., He, J., Bai, Y., Chen, F., and Fu, L. (2022, January 18\u201324). Residual Local Feature Network for Efficient Super-Resolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00092"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Li, Z., Liu, Y., Chen, X., Cai, H., Gu, J., Qiao, Y., and Dong, C. (2022, January 18\u201324). Blueprint Separable Residual Network for Efficient Image Super-Resolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00099"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wang, L., Dong, X., Wang, Y., Ying, X., Lin, Z., An, W., and Guo, Y. (2021, January 20\u201325). Exploring sparsity in image super-resolution for efficient inference. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00488"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zhao, H., Kong, X., He, J., Qiao, Y., and Dong, C. (2020, January 23\u201328). Efficient image super-resolution using pixel attention. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-67070-2_3"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Du, Z., Liu, D., Liu, J., Tang, J., Wu, G., and Fu, L. (2022, January 18\u201324). Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00101"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Wang, Y. (2022, January 18\u201324). Edge-Enhanced Feature Distillation Network for Efficient Super-Resolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00093"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Hui, Z., Gao, X., Yang, Y., and Wang, X. (2019, January 21\u201325). Lightweight Image Super-Resolution with Information Multi-distillation Network. Proceedings of the 27th ACM International Conference on Multimedia, Nice, France.","DOI":"10.1145\/3343031.3351084"},{"key":"ref_12","first-page":"20343","article-title":"Lapar: Linearly-assembled pixel-adaptive regression network for single image super-resolution and beyond","volume":"33","author":"Li","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, J., Tang, J., and Wu, G. (2020, January 23\u201328). Residual feature distillation network for lightweight image super-resolution. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-67070-2_2"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Ahn, N., Kang, B., and Sohn, K.A. (2018, January 8\u201314). Fast, accurate, and lightweight super-resolution with cascading residual network. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01249-6_16"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Hui, Z., Wang, X., and Gao, X. (2018, January 18\u201323). Fast and accurate single image super-resolution via information distillation network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00082"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Chen, S., Huang, K., Li, B., Xiong, D., Jiang, H., and Claesen, L. (2020, January 23\u201328). Adaptive hybrid composition based super-resolution network via fine-grained channel pruning. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-67070-2_7"},{"key":"ref_17","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention Is All You Need. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 13\u201319). ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201323). Non-Local Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1007\/s41095-023-0364-2","article-title":"Visual attention network","volume":"9","author":"Guo","year":"2023","journal-title":"Comput. Vis. Media"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"109376","DOI":"10.1016\/j.knosys.2022.109376","article-title":"LKASR: Large kernel attention for lightweight image super-resolution","volume":"252","author":"Feng","year":"2022","journal-title":"Knowl.-Based Syst."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Xie, C., Zhang, X., Li, L., Meng, H., Zhang, T., Li, T., and Zhao, X. (2023, January 17\u201324). Large Kernel Distillation Network for Efficient Single Image Super-Resolution. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPRW59228.2023.00135"},{"key":"ref_24","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1109\/TITS.2017.2750080","article-title":"Erfnet: Efficient residual factorized convnet for real-time semantic segmentation","volume":"19","author":"Romera","year":"2017","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_26","unstructured":"Li, G., Yun, I., Kim, J., and Kim, J. (2019). Dabnet: Depth-wise asymmetric bottleneck for real-time semantic segmentation. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, Y., Zhou, Q., Liu, J., Xiong, J., Gao, G., Wu, X., and Latecki, L.J. (2019, January 22\u201325). Lednet: A lightweight encoder-decoder network for real-time semantic segmentation. Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan.","DOI":"10.1109\/ICIP.2019.8803154"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"25489","DOI":"10.1109\/TITS.2021.3098355","article-title":"MSCFNet: A lightweight network with multi-scale context fusion for real-time semantic segmentation","volume":"23","author":"Gao","year":"2021","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_29","unstructured":"Li, H., Li, J., Wei, H., Liu, Z., Zhan, Z., and Ren, Q. (2022). Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Agustsson, E., and Timofte, R. (2017, January 21\u201326). Ntire 2017 challenge on single image super-resolution: Dataset and study. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.150"},{"key":"ref_31","unstructured":"Bevilacqua, M., Roumy, A., Guillemot, C., and Alberi-Morel, M.L. (2024, February 06). Low-Complexity Single-Image Super-Resolution Based on Nonnegative Neighbor Embedding British Machine Vision Conference. Available online: https:\/\/api.semanticscholar.org\/CorpusID:5250573."},{"key":"ref_32","unstructured":"Zeyde, R., Elad, M., and Protter, M. (2010, January 24\u201330). On single image scale-up using sparse-representations. Proceedings of the International Conference on Curves and Surfaces, Avignon, France."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1109\/ICCV.2001.937655","article-title":"A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics","volume":"Volume 2","author":"Martin","year":"2001","journal-title":"Proceedings of the Eighth IEEE International Conference on Computer Vision. ICCV 2001"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Huang, J.B., Singh, A., and Ahuja, N. (2015, January 7\u201312). Single image super-resolution from transformed self-exemplars. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299156"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"21811","DOI":"10.1007\/s11042-016-4020-z","article-title":"Sketch-based manga retrieval using manga109 dataset","volume":"76","author":"Matsui","year":"2017","journal-title":"Multimed. Tools Appl."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Muqeet, A., Hwang, J., Yang, S., Kang, J., Kim, Y., and Bae, S.H. (2020, January 23\u201328). Multi-attention based ultra lightweight image super-resolution. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-67070-2_6"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"4927","DOI":"10.1109\/TCSVT.2021.3138431","article-title":"Cross-srn: Structure-preserving super-resolution network with cross convolution","volume":"32","author":"Liu","year":"2021","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_38","unstructured":"Gao, G., Li, W., Li, J., Wu, F., Lu, H., and Yu, Y. (March, January 22). Feature distillation interaction weighting network for lightweight image super-resolution. Proceedings of the AAAI Conference on Artificial Intelligence, Online."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/4\/1135\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T13:57:43Z","timestamp":1760104663000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/4\/1135"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,9]]},"references-count":38,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2024,2]]}},"alternative-id":["s24041135"],"URL":"https:\/\/doi.org\/10.3390\/s24041135","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2024,2,9]]}}}