{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T14:34:16Z","timestamp":1781534056746,"version":"3.54.5"},"reference-count":27,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62262054"],"award-info":[{"award-number":["62262054"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"award":["62262054"],"award-info":[{"award-number":["62262054"]}],"id":[{"id":"https:\/\/ror.org\/01h0zpd94","id-type":"ROR","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of Ningxia Hui Autonomous Region","award":["2026AAC030942"],"award-info":[{"award-number":["2026AAC030942"]}]},{"name":"Science and Technology Research and Development Program Project of Guyuan","award":["2025GKJYF0002"],"award-info":[{"award-number":["2025GKJYF0002"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>The scale features of rock art exhibit significant diversity and graduality. Among the existing semantic segmentation methods for rock art, although some models have taken note of the scale differences in rock art patterns and the complexity of directional features, and proposed targeted improvement strategies, most of these methods view scale adaptation and directional representation as unconnected problems. They fail to model the intrinsic correlation between the scale adaptation and directional representation, and particularly overlook the restrictive effect of scale accuracy on the extraction of directional features. This ultimately leads to the problem of \u201cspatial representation misalignment\u201d in the semantic segmentation of rock art. To address the above problems, this paper proposes a Dynamic Fine-tuning Rotation Network (DFTR-Net), which aims to solve the problems of imprecise scale feature extraction and directional misalignment for rock art patterns with arbitrary orientations. The network consists of a dynamic selective convolution structure and a shapeaware spatial feature extraction module. Specifically, the dynamic selective convolution dynamically adjusts the coverage range of the receptive field through inter-layer feature aggregation. It uses stacked small dilated convolution kernels to replace large convolution kernels with the same receptive field for extracting the neighborhood details of patterns. Then, by combining with feature aggregation, it constructs spatial feature differences and realizes intra-layer dynamic weighted fusion, thereby achieving accurate scale feature extraction. After obtaining fine-grained scale features, the shape-aware module first corrects the initial segmentation candidate regions of the patterns to generate directional guide boxes. Subsequently, it drives the rotational sampling of convolution kernels based on the angles of the guide boxes, forming region-constrained deformable convolutions that adapt to the shape of the patterns. These convolution kernels obtain strong supervision based on pixel-level annotations, which enhances the sensitivity to the directional features of the patterns and effectively alleviates the problem of directional misalignment. Extensive experiments show that DFTR-Net can achieve higher performance on the 3D-pitoti and Petroglyph Annotation datasets compared with the existing methods.<\/jats:p>","DOI":"10.3390\/a19050349","type":"journal-article","created":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:54:07Z","timestamp":1777654447000},"page":"349","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Dynamic Fine-Tuning Rotation Network for Semantic Segmentation of Rock Paintings"],"prefix":"10.3390","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5156-1068","authenticated-orcid":false,"given":"Chuanping","family":"Bai","sequence":"first","affiliation":[{"name":"School of Mathematics and Computer Science, Ningxia Normal University, Guyuan 756000, China"},{"name":"Center of Research for Artificial Intelligence and Intelligent Medicine Engineering Technology, Ningxia Normal University, Guyuan 756000, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3021-5371","authenticated-orcid":false,"given":"Donglin","family":"Jing","sequence":"additional","affiliation":[{"name":"Shanghai Aerospace Control Technology Institute, Shanghai 201109, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhixue","family":"Wang","sequence":"additional","affiliation":[{"name":"Center of Research for Artificial Intelligence and Intelligent Medicine Engineering Technology, Ningxia Normal University, Guyuan 756000, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Fangqin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Center of Research for Artificial Intelligence and Intelligent Medicine Engineering Technology, Ningxia Normal University, Guyuan 756000, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2026,5,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"6505805","DOI":"10.1109\/LGRS.2022.3144513","article-title":"FAR-Net: Fast anchor refining for arbitrary-oriented object detection","volume":"19","author":"Deng","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"4200715","DOI":"10.1109\/TGRS.2026.3663195","article-title":"ETD-Det: Oriented Object Detection with Spectral Diffusion Encoding and Extended-Gaussian Decoding","volume":"64","author":"Zhu","year":"2026","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Seidl, M., and Breiteneder, C. (2012, January 16\u201319). Automated petroglyph image segmentation with interactive classifier fusion. Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing, Mumbai, India.","DOI":"10.1145\/2425333.2425399"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Deufemia, V., and Paolino, L. (2014). Segmentation and Recognition of Petroglyphs Using Generic Fourier Descriptors, Springer.","DOI":"10.1007\/978-3-319-07998-1_56"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Poier, G., Seidl, M., Zeppelzauer, M., Reinbacher, C., Schaich, M., Bellandi, G., Marretta, A., and Bischof, H. (2017, January 19\u201321). The 3d-pitoti dataset: A dataset for high-resolution 3D surface segmentation. Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, Florence, Italy.","DOI":"10.1145\/3095713.3095719"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"473","DOI":"10.26599\/CVM.2025.9450512","article-title":"Open-vocabulary camouflaged object segmentation with cascaded vision language models","volume":"12","author":"Zhao","year":"2026","journal-title":"Comput. Vis. Media"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"104033","DOI":"10.1016\/j.media.2026.104033","article-title":"PCa-Mamba: Spatiotemporal state space models for prostate cancer detection in multi-parametric MRI","volume":"111","author":"Zhao","year":"2026","journal-title":"Med. Image Anal."},{"key":"ref_8","unstructured":"Poier, G., Seidl, M., Zeppelzauer, M., Reinbacher, C., and Bischof, H. (2016). PetroSurf3D\u2014A high-resolution 3D dataset of rock art for surface segmentation. arXiv."},{"key":"ref_9","first-page":"13","article-title":"Deep Segmentation of Corrupted Glyphs","volume":"15","author":"Melnik","year":"2022","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1186\/s40494-022-00857-5","article-title":"BEGL: Boundary enhancement with Gaussian Loss for rock-art image segmentation","volume":"11","author":"Bai","year":"2023","journal-title":"Herit. Sci."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer International.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"5615515","DOI":"10.1109\/TGRS.2023.3294520","article-title":"Toward hierarchical adaptive alignment for aerial object detection in remote sensing images","volume":"61","author":"Deng","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhu, H., and Jing, D. (2024). Optimizing slender target detection in remote sensing with adaptive boundary perception. Remote Sens., 16.","DOI":"10.3390\/rs16142643"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zhu, Q., Wang, X., Keogh, E., and Lee, S.H. (2009\u20131, January 28). Augmenting the generalized hough transform to enable the mining of petroglyphs. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France.","DOI":"10.1145\/1557019.1557133"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1007\/s10618-010-0200-z","article-title":"An efficient and effective similarity measure to enable data mining of petroglyphs","volume":"23","author":"Zhu","year":"2011","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_17","unstructured":"Badrinarayanan, V., Handa, A., and Cipolla, R. (2015). SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Springer International.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Chen, L., Yi, Y., Jiang, W., Wei, X., and Yuille, A. (2016). Attention to Scale: Scale-Aware Semantic Image Segmentation. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Springer International.","DOI":"10.1109\/CVPR.2016.396"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018). BiSeNet: Bilateral segmentation network for real-time semantic segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Springer International.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_21","unstructured":"Hong, Y., Pan, X., Sun, W., Liu, D., Cheng, G., Ren, J., and Shao, L. (2021). Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE."},{"key":"ref_22","first-page":"3349","article-title":"Deep high-resolution representation learning for visual recognition","volume":"43","author":"Sun","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI)"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhao, H., Qi, X., Shen, X., Shi, J., and Jia, J. (2018). ICNet for real-time semantic segmentation on high-resolution images. Proceedings of the European Conference on Computer Vision (ECCV), Springer International.","DOI":"10.1007\/978-3-030-01219-9_25"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Yuan, Y., Chen, X., and Wang, J. (2020). Object-contextual representations for semantic segmentation. European Conference on Computer Vision (ECCV), Springer International.","DOI":"10.1007\/978-3-030-58539-6_11"},{"key":"ref_25","first-page":"3467","article-title":"SegNeXt: Rethinking convolutional attention design for semantic segmentation","volume":"35","author":"Guo","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst. (NeurIPS)"},{"key":"ref_26","first-page":"12077","article-title":"SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers","volume":"34","author":"Xie","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst. (NeurIPS)"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Cheng, B., Misra, I., Schwing, A.G., Kirillov, A., and Girdhar, R. (2022). Masked-attention Mask Transformer for Universal Image Segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE.","DOI":"10.1109\/CVPR52688.2022.00135"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/19\/5\/349\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,15]],"date-time":"2026-05-15T04:13:37Z","timestamp":1778818417000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/19\/5\/349"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5,1]]},"references-count":27,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2026,5]]}},"alternative-id":["a19050349"],"URL":"https:\/\/doi.org\/10.3390\/a19050349","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,5,1]]}}}