{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T10:55:29Z","timestamp":1774263329856,"version":"3.50.1"},"reference-count":66,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T00:00:00Z","timestamp":1774137600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Jilin Provincial Department of Education Science and Technology Research Project","award":["JJKH20251376KJ"],"award-info":[{"award-number":["JJKH20251376KJ"]}]},{"name":"Jilin Provincial Natural Science Foundation Project","award":["YDZJ202501ZYTS589"],"award-info":[{"award-number":["YDZJ202501ZYTS589"]}]},{"name":"Jilin Provincial Natural Science Foundation Project","award":["YDZJ202501ZYTS619"],"award-info":[{"award-number":["YDZJ202501ZYTS619"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>This paper presents a channel segmentation proofreading network for crack counting with imbalanced samples. The network is built by stacking basic blocks called channel segmentation proofreading blocks, which are composed of the Approximate Overlapping Window Transformer and the Counting Proofreading Module. The former is designed to extract sufficient high-level semantic information, enhancing the ability of the network to judge crack quantities. Guided by the calculation results of the self-attention mechanism in the classical Transformer, Approximate Overlapping Window Transformer employs distinct computation steps to obtain the same results. Confining the computation process within overlapping windows, we continuously adjust to obtain the most suitable feature extraction process and internal structure for crack counting. Furthermore, to prevent the misidentification of multiple cracks as a single crack due to incorrect connection predictions of crack regions, the Counting Proofreading Module employs channel separation techniques. Following the concept of splitting positive and negative weights, it constructs positive and negative values with different characteristics, further confirming crack regions. Through the combined action of both components, when trained and tested on the crack counting dataset, our network achieves optimal results across all metrics.<\/jats:p>","DOI":"10.3390\/a19030236","type":"journal-article","created":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T09:48:42Z","timestamp":1774259322000},"page":"236","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Channel Segmentation Proofreading Network for Crack Counting with Imbalanced Samples"],"prefix":"10.3390","volume":"19","author":[{"given":"Mingsi","family":"Sun","sequence":"first","affiliation":[{"name":"College of Computer Science and Technology, Changchun University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fangai","family":"Xu","sequence":"additional","affiliation":[{"name":"China Mobile Communications Group Jilin Co., Ltd., Changchun 130061, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fachao","family":"Zhang","sequence":"additional","affiliation":[{"name":"Longwang Township Comprehensive Service Center, Nongan County, Changchun 130218, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3265-6461","authenticated-orcid":false,"given":"Jian","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Changchun University, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hongwei","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Jilin University, Changchun 130012, China"},{"name":"Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,3,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.neucom.2019.01.036","article-title":"DeepCrack: A Deep Hierarchical Feature Learning Architecture for Crack Segmentation","volume":"338","author":"Liu","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Schmugge, S.J., Rice, L., Lindberg, J., Grizziy, R., Joffey, C., and Shin, M.C. (2017). Crack segmentation by leveraging multiple frames of varying illumination. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA, 24\u201331 March 2017, IEEE.","DOI":"10.1109\/WACV.2017.121"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1525","DOI":"10.1109\/TITS.2019.2910595","article-title":"Feature pyramid and hierarchical boosting network for pavement crack detection","volume":"21","author":"Yang","year":"2019","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"104646","DOI":"10.1016\/j.autcon.2022.104646","article-title":"Pavement crack detection based on transformer network","volume":"145","author":"Guo","year":"2023","journal-title":"Autom. Constr."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1049\/ipr2.12388","article-title":"Road crack detection network under noise based on feature pyramid structure with feature enhancement (road crack detection under noise)","volume":"16","author":"Sun","year":"2022","journal-title":"IET Image Process."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Inoue, Y., and Nagayoshi, H. (2021). Crack detection as a weakly-supervised problem: Towards achieving less annotation-intensive crack detectors. Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 10\u201315 January 2021, IEEE.","DOI":"10.1109\/ICPR48806.2021.9412041"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.Y. (2023, January 2\u20136). Segment anything. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00371"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"105899","DOI":"10.1016\/j.autcon.2024.105899","article-title":"Semi-supervised crack detection using segment anything model and deep transfer learning","volume":"170","author":"Li","year":"2025","journal-title":"Autom. Constr."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"59519","DOI":"10.1007\/s11042-023-17846-w","article-title":"A multi-task mean teacher with two stage decoder for semi-supervised crack detection","volume":"83","author":"Sun","year":"2024","journal-title":"Multimed. Tools Appl."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Subirats, P., Dumoulin, J., Legeay, V., and Barba, D. (2006). Automation of pavement surface crack detection using the continuous wavelet transform. Proceedings of the 2006 International Conference on Image Processing, Las Vegas, NV, USA, 26\u201329 June 2006, IEEE.","DOI":"10.1109\/ICIP.2006.313007"},{"key":"ref_12","unstructured":"Huang, W., and Zhang, N. (2012). A novel road crack detection and identification method using digital image processing techniques. Proceedings of the 2012 7th International Conference on Computing and Convergence Technology (ICCCT), Seoul, Republic of Korea, 3\u20135 December 2012, IEEE."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Xu, W., Tang, Z., Zhou, J., and Ding, J. (2013). Pavement crack detection based on saliency and statistical features. Proceedings of the 2013 IEEE International Conference on Image Processing, Melbourne, Australia, 15\u201318 September 2013, IEEE.","DOI":"10.1109\/ICIP.2013.6738843"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.patrec.2011.11.004","article-title":"CrackTree: Automatic crack detection from pavement images","volume":"33","author":"Zou","year":"2012","journal-title":"Pattern Recognit. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Tang, J., and Gu, Y. (2013). Automatic crack detection and segmentation using a hybrid algorithm for road distress analysis. Proceedings of the 2013 IEEE International Conference on Systems, Man, and Cybernetics, Manchester, UK, 13\u201316 October 2013, IEEE.","DOI":"10.1109\/SMC.2013.516"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Kapela, R., \u015aniata\u0142a, P., Turkot, A., Rybarczyk, A., Po\u017carycki, A., Rydzewski, P., Wycza\u0142ek, M., and B\u0142och, A. (2015). Asphalt surfaced pavement cracks detection based on histograms of oriented gradients. Proceedings of the 2015 22nd International Conference Mixed Design of Integrated Circuits & Systems (MIXDES), Toru\u0144, Poland, 25\u201327 June 2015, IEEE.","DOI":"10.1109\/MIXDES.2015.7208590"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1109\/TITS.2015.2482222","article-title":"A simplified computer vision system for road surface inspection and maintenance","volume":"17","author":"Quintana","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zakeri, H., Nejad, F.M., Fahimifar, A., Torshizi, A.D., and Zarandi, M.F. (2013). A multi-stage expert system for classification of pavement cracking. Proceedings of the 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA\/NAFIPS), Edmonton, AB, Canada, 24\u201328 June 2013, IEEE.","DOI":"10.1109\/IFSA-NAFIPS.2013.6608558"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Maode, Y., Shaobo, B., Kun, X., and Yuyao, H. (2007). Pavement crack detection and analysis for high-grade highway. Proceedings of the 2007 8th International Conference on Electronic Measurement and Instruments, Xi\u2019an, China, 16\u201318 August 2007, IEEE.","DOI":"10.1109\/ICEMI.2007.4351202"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2718","DOI":"10.1109\/TITS.2015.2477675","article-title":"Automatic crack detection on two-dimensional pavement images: An algorithm based on minimal path selection","volume":"17","author":"Amhaz","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhang, L., Yang, F., Zhang, Y.D., and Zhu, Y.J. (2016). Road crack detection using deep convolutional neural network. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25\u201328 September 2016, IEEE.","DOI":"10.1109\/ICIP.2016.7533052"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Pauly, L., Hogg, D., Fuentes, R., and Peel, H. (2017). Deeper networks for pavement crack detection. Proceedings of the 34th ISARC, Taipei, Taiwan, 28 June\u20131 July 2017, IAARC.","DOI":"10.22260\/ISARC2017\/0066"},{"key":"ref_23","first-page":"298","article-title":"Deep active learning for civil infrastructure defect detection and classification","volume":"2017","author":"Feng","year":"2017","journal-title":"Comput. Civ. Eng."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Eisenbach, M., Stricker, R., Seichter, D., Amende, K., Debes, K., Sesselmann, M., Ebersbach, D., Stoeckert, U., and Gross, H.M. (2017). How to get pavement distress detection ready for deep learning? A systematic approach. Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14\u201319 May 2017, IEEE.","DOI":"10.1109\/IJCNN.2017.7966101"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Chen, Z., Zhang, J., Lai, Z., Zhu, G., Liu, Z., Chen, J., and Li, J. (2023, January 2\u20136). The devil is in the crack orientation: A new perspective for crack detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00612"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"121346","DOI":"10.1016\/j.eswa.2023.121346","article-title":"MOD-YOLO: Rethinking the YOLO architecture at the level of feature information and applying it to crack detection","volume":"237","author":"Su","year":"2024","journal-title":"Expert Syst. Appl."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Dong, X., Liu, Y., and Dai, J. (2024). Concrete surface crack detection algorithm based on improved YOLOv8. Sensors, 24.","DOI":"10.3390\/s24165252"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"114892","DOI":"10.1109\/ACCESS.2020.3003638","article-title":"Automated pavement crack segmentation using u-net-based convolutional neural network","volume":"8","author":"Lau","year":"2020","journal-title":"IEEE Access"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Xu, H., Su, X., Wang, Y., Cai, H., Cui, K., and Chen, X. (2019). Automatic bridge crack detection using a convolutional neural network. Appl. Sci., 9.","DOI":"10.3390\/app9142867"},{"key":"ref_30","first-page":"1","article-title":"Tunnel crack detection with linear seam based on mixed attention and multiscale feature fusion","volume":"71","author":"Zhou","year":"2022","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"911","DOI":"10.1007\/s11760-021-02034-w","article-title":"DcsNet: A real-time deep network for crack segmentation","volume":"16","author":"Pang","year":"2022","journal-title":"Signal Image Video Process."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"51740","DOI":"10.1109\/ACCESS.2023.3279888","article-title":"Adaptive Canny and Semantic Segmentation Networks Based on Feature Fusion for Road Crack Detection","volume":"11","author":"Luo","year":"2023","journal-title":"IEEE Access"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Khan, M.A.M., Kee, S.H., and Nahid, A.A. (2023). Vision-based concrete-crack detection on railway sleepers using dense U-Net model. Algorithms, 16.","DOI":"10.20944\/preprints202311.1952.v1"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ai, W., Zou, J., Liu, Z., Wang, S., and Teng, S. (2025). Light propagation and multi-scale enhanced DeepLabV3+ for underwater crack detection. Algorithms, 18.","DOI":"10.3390\/a18080462"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 10\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_36","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Xie, S., Girshick, R., Doll\u00e1r, P., Tu, Z., and He, K. (2017, January 21\u201326). Aggregated residual transformations for deep neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.634"},{"key":"ref_39","unstructured":"Tan, M., and Le, Q. (2019). Efficientnet: Rethinking model scaling for convolutional neural networks. Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA, 9\u201315 June 2019, PMLR."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., and Xie, S. (2022, January 19\u201320). A convnet for the 2020s. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"ref_41","unstructured":"Trockman, A., and Kolter, J.Z. (2022). Patches are all you need?. arXiv."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lou, M., and Yu, Y. (2025, January 11\u201315). OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels. Proceedings of the Computer Vision and Pattern Recognition Conference, Nashville, TN, USA.","DOI":"10.1109\/CVPR52734.2025.00021"},{"key":"ref_43","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16 \u00d7 16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Wang, W., Xie, E., Li, X., Fan, D.P., Song, K., Liang, D., Lu, T., Luo, P., and Shao, L. (2021, January 10\u201317). Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00061"},{"key":"ref_45","unstructured":"Mehta, S., and Rastegari, M. (2021). Mobilevit: Light-weight, general-purpose, and mobile-friendly vision transformer. arXiv."},{"key":"ref_46","unstructured":"Liu, J., Huang, X., Liu, Y., and Li, H. (2022). Mixmim: Mixed and masked image modeling for efficient visual representation learning. arXiv."},{"key":"ref_47","unstructured":"Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., and J\u00e9gou, H. (2021). Training data-efficient image transformers & distillation through attention. Proceedings of the International Conference on Machine Learning, Virtual, 18\u201324 July 2021, PMLR."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Shi, D. (2024, January 17\u201318). Transnext: Robust foveal visual perception for vision transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.01683"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Fan, Q., Huang, H., Chen, M., Liu, H., and He, R. (2024, January 17\u201318). Rmt: Retentive networks meet vision transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.00539"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Su, L., Ma, X., Zhu, X., Niu, C., Lei, Z., and Zhou, J.Z. (2025, January 3). Can we get rid of handcrafted feature extractors? sparsevit: Nonsemantics-centered, parameter-efficient image manipulation localization through spare-coding transformer. Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA.","DOI":"10.1609\/aaai.v39i7.32754"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Wang, Z., Liu, Y., Tian, Y., Liu, Y., Wang, Y., and Ye, Q. (2025, January 11\u201315). Building Vision Models upon Heat Conduction. Proceedings of the Computer Vision and Pattern Recognition Conference, Nashville, TN, USA.","DOI":"10.1109\/CVPR52734.2025.00907"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Xu, J., Le, H., Nguyen, V., Ranjan, V., and Samaras, D. (2023, January 17\u201324). Zero-shot object counting. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01492"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Zhu, H., Yuan, J., Yang, Z., Guo, Y., Wang, Z., Zhong, X., and He, S. (2024). Zero-shot object counting with good exemplars. Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September\u20134 October 2024, Springer.","DOI":"10.1007\/978-3-031-72652-1_22"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"111998","DOI":"10.1016\/j.patcog.2025.111998","article-title":"Dynamic example network for class-agnostic object counting","volume":"170","author":"Liu","year":"2026","journal-title":"Pattern Recognit."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"He, J., Li, P., Geng, Y., and Xie, X. (2023, January 18\u201322). Fastinst: A simple query-based model for real-time instance segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.02266"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Wei, Z., Chen, P., Yu, X., Li, G., Jiao, J., and Han, Z. (2024, January 17\u201318). Semantic-aware sam for point-prompted instance segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.00344"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"042074","DOI":"10.1088\/1742-6596\/1529\/4\/042074","article-title":"Analysis of surface crack using various crack growth models","volume":"1529","author":"Akramin","year":"2020","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"4637","DOI":"10.1007\/s00477-024-02743-x","article-title":"Image classification for sub-surface crack identification in concrete dam based on borehole CCTV images using deep dense hybrid model","volume":"39","author":"Dai","year":"2025","journal-title":"Stoch. Environ. Res. Risk Assess."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"3434","DOI":"10.1109\/TITS.2016.2552248","article-title":"Automatic road crack detection using random structured forests","volume":"17","author":"Shi","year":"2016","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Li, J., Wen, Y., and He, L. (2023, January 18\u201322). SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00596"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (2019, January 27\u201328). Searching for mobilenetv3. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Ding, M., Xiao, B., Codella, N., Luo, P., Wang, J., and Yuan, L. (2022). Davit: Dual attention vision transformers. Proceedings of the Computer Vision\u2013ECCV 2022: 17th European Conference, Tel Aviv, Israel, 23\u201327 October 2022, Springer. Proceedings, Part XXIV.","DOI":"10.1007\/978-3-031-20053-3_5"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Ding, X., Zhang, X., Han, J., and Ding, G. (2022, January 18\u201324). Scaling up your kernels to 31x31: Revisiting large kernel design in cnns. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01166"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Mangalam, K., Fan, H., Li, Y., Wu, C.Y., Xiong, B., Feichtenhofer, C., and Malik, J. (2022, January 18\u201324). Reversible vision transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01056"},{"key":"ref_65","unstructured":"Han, D., Li, T., Wang, Z., and Huang, G. (2025). Vision Transformers are Circulant Attention Learners. arXiv."},{"key":"ref_66","unstructured":"Wang, F., Ren, S., Zhang, T., Neskovic, P., Bhattad, A., Xie, C., and Yuille, A. (2026). ViT-5: Vision Transformers for The Mid-2020s. arXiv."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/19\/3\/236\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T09:58:36Z","timestamp":1774259916000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/19\/3\/236"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,22]]},"references-count":66,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2026,3]]}},"alternative-id":["a19030236"],"URL":"https:\/\/doi.org\/10.3390\/a19030236","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,22]]}}}