{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T09:59:04Z","timestamp":1775901544637,"version":"3.50.1"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62371344"],"award-info":[{"award-number":["62371344"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2026,4,30]]},"abstract":"<jats:p>3D reconstruction faces challenges such as geometric warping and structural ambiguity, particularly in intricate topologies, heavy occlusions and complex backgrounds. These problems are partly attributed to excessive feature entanglement, which induces semantic confusion and spatial ambiguity. To address these limitations, we propose an occlusion sensitivity-guided semantic-disentangled Mamba-CNN network that enables human-controllable disentanglement of multi-attribute information within a bias-free semi-supervised framework. Specifically, we create various occlusion conditions and assign pseudo-labels to the augmented data within a semi-supervised framework, which enables the exploration of occlusion sensitivity of different semantic attributes for human-controllable semantic disentanglement. To reduce the bias between the augmented samples and their assigned pseudo-labels, we use linear PIoU and nonlinear MS-SSIM algorithms to calculate the confidence of pseudo-labels, which minimizes the error propagation caused by the bias. Then, we develop a disentangled multi-depth Mamba-CNN block that combines CNN\u2019s local feature extraction capability with Mamba\u2019s ability to capture long-range dependencies. This allows our model to effectively capture disentangled multi-attribute spatial features and semantic representations. However, critical cross-level semantic attribute connections could be lost in the disentanglement process. To tackle this, we propose a multi-attribute semantic query block to dynamically re-establish these connections and minimize cross-attribute information loss. Extensive quantitative and qualitative evaluations on object and face reconstruction demonstrate that our method outperforms existing state-of-the-art approaches. Codes and all data are publicly available at https:\/\/github.com\/Ray-tju\/sensitivity-guided-semantic-disentangled-Mamba-CNN.<\/jats:p>","DOI":"10.1145\/3796709","type":"journal-article","created":{"date-parts":[[2026,2,23]],"date-time":"2026-02-23T14:05:27Z","timestamp":1771855527000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Bias-Free Semi-Supervised 3D Reconstruction via Occlusion Sensitivity-Guided Semantic Disentanglement"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0253-516X","authenticated-orcid":false,"given":"Lei","family":"Li","sequence":"first","affiliation":[{"name":"College of Electronic and Information Engineering, Tongji University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5312-9903","authenticated-orcid":false,"given":"Fuqiang","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Electronic and Information Engineering, Tongji University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2657-0343","authenticated-orcid":false,"given":"Yanni","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Information Science and Technology, Fudan University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1838-8336","authenticated-orcid":false,"given":"Junyuan","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Electronic and Information Engineering, Tongji University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2026,4,11]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.neunet.2022.09.019","article-title":"3D face-model reconstruction from a single image: A feature aggregation approach using hierarchical transformer with weak supervision","volume":"156","author":"Basak Shubhajit","year":"2022","unstructured":"Shubhajit Basak, Peter Corcoran, Rachel McDonnell, and Michael Schukat. 2022. 3D face-model reconstruction from a single image: A feature aggregation approach using hierarchical transformer with weak supervision. Neural Networks 156 (2022), 108\u2013122.","journal-title":"Neural Networks"},{"issue":"11","key":"e_1_3_1_3_2","first-page":"2399","article-title":"Manifold regularization: A geometric framework for learning from labeled and unlabeled examples","volume":"7","author":"Belkin Mikhail","year":"2006","unstructured":"Mikhail Belkin, Partha Niyogi, and Vikas Sindhwani. 2006. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research 7, 11 (2006), 2399\u20132434.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2017.03.006"},{"key":"e_1_3_1_5_2","first-page":"92","volume-title":"Proceedings of the 11th Annual Conference on Computational Learning Theory","author":"Blum Avrim","year":"1998","unstructured":"Avrim Blum and Tom Mitchell. 1998. Combining labeled and unlabeled data with co-training. In Proceedings of the 11th Annual Conference on Computational Learning Theory, 92\u2013100."},{"key":"e_1_3_1_6_2","first-page":"1513","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Burgos-Artizzu Xavier P.","year":"2013","unstructured":"Xavier P. Burgos-Artizzu, Pietro Perona, and Piotr Doll\u00e1r. 2013. Robust face landmark estimation under occlusion. In Proceedings of the IEEE International Conference on Computer Vision, 1513\u20131520."},{"issue":"2","key":"e_1_3_1_7_2","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/s11263-013-0667-3","article-title":"Face alignment by explicit shape regression","volume":"107","author":"Cao Xudong","year":"2014","unstructured":"Xudong Cao, Yichen Wei, Fang Wen, and Jian Sun. 2014. Face alignment by explicit shape regression. International Journal of Computer Vision 107, 2 (2014), 177\u2013190.","journal-title":"International Journal of Computer Vision"},{"key":"e_1_3_1_8_2","unstructured":"Angel X. Chang Thomas Funkhouser Leonidas Guibas Pat Hanrahan Qixing Huang Zimo Li Silvio Savarese Manolis Savva Shuran Song Hao Su et al. 2015. ShapeNet: An information-rich 3D model repository. arXiv:1512.03012. Retrieved from https:\/\/arxiv.org\/abs\/1512.03012"},{"key":"e_1_3_1_9_2","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1016\/j.neuroimage.2017.04.041","article-title":"VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images","volume":"170","author":"Chen Hao","year":"2018","unstructured":"Hao Chen, Qi Dou, Lequan Yu, Jing Qin, and Pheng-Ann Heng. 2018. VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images. NeuroImage 170 (2018), 446\u2013455.","journal-title":"NeuroImage"},{"key":"e_1_3_1_10_2","first-page":"628","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Choy Christopher B.","year":"2016","unstructured":"Christopher B. Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, and Silvio Savarese. 2016. 3D-R2N2: A unified approach for single and multi-view 3D object reconstruction. In Proceedings of the European Conference on Computer Vision. Springer, 628\u2013644."},{"issue":"1","key":"e_1_3_1_11_2","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1006\/cviu.1995.1004","article-title":"Active shape models-their training and application","volume":"61","author":"Cootes Timothy F.","year":"1995","unstructured":"Timothy F. Cootes, Christopher J. Taylor, David H. Cooper, and Jim Graham. 1995. Active shape models-their training and application. Computer Vision and Image Understanding 61, 1 (1995), 38\u201359.","journal-title":"Computer Vision and Image Understanding"},{"key":"e_1_3_1_12_2","first-page":"1","volume-title":"Proceedings of the 2023 International Joint Conference on Neural Networks","author":"Deng Jia","year":"2023","unstructured":"Jia Deng, Xiaofei Li, Xing Wang, and Xiangzheng Li. 2023. Deformable feature interaction network and graph structure reasoning for 3D dense alignment and face reconstruction. In Proceedings of the 2023 International Joint Conference on Neural Networks. IEEE, 1\u20138."},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3713075","article-title":"Unleashing creativity in the metaverse: Generative AI and multimodal content","volume":"21","author":"El Saddik Abdulmotaleb","year":"2024","unstructured":"Abdulmotaleb El Saddik, Jamil Ahmad, Mustaqeem Khan, Saad Abouzahir, and Wail Gueaieb. 2024. Unleashing creativity in the metaverse: Generative AI and multimodal content. ACM Transactions on Multimedia Computing, Communications, and Applications 21 (2024), 1\u201343.","journal-title":"ACM Transactions on Multimedia Computing, Communications, and Applications"},{"key":"e_1_3_1_14_2","first-page":"605","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Fan Haoqiang","year":"2017","unstructured":"Haoqiang Fan, Hao Su, and Leonidas J. Guibas. 2017. A point set generation network for 3D object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 605\u2013613."},{"issue":"8","key":"e_1_3_1_15_2","first-page":"4803","article-title":"Domain-adaptive crowd counting via high-quality image translation and density reconstruction","volume":"34","author":"Gao Junyu","year":"2021","unstructured":"Junyu Gao, Tao Han, Yuan, Yuan, and Qi Wang. 2021. Domain-adaptive crowd counting via high-quality image translation and density reconstruction. IEEE Transactions on Neural Networks and Learning Systems 34, 8 (2021), 4803\u20134815.","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"issue":"2","key":"e_1_3_1_16_2","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1109\/TAES.2024.3456748","article-title":"Imbalanced aircraft data anomaly detection","volume":"61","author":"Gao Junyu","year":"2025","unstructured":"Junyu Gao, Hao Yang, Da Zhang, Yuan, Yuan, and Xuelong Li. 2025. Imbalanced aircraft data anomaly detection. IEEE Transactions on Aerospace and Electronic Systems 61, 2 (2025), 1422\u20131432.","journal-title":"IEEE Transactions on Aerospace and Electronic Systems"},{"key":"e_1_3_1_17_2","first-page":"216","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Groueix Thibault","year":"2018","unstructured":"Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, and Mathieu Aubry. 2018. A papier-m\u00e2ch\u00e9 approach to learning 3D surface generation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 216\u2013224."},{"key":"e_1_3_1_18_2","first-page":"1474","article-title":"Hippo: Recurrent memory with optimal polynomial projections","volume":"33","author":"Gu Albert","year":"2020","unstructured":"Albert Gu, Tri Dao, Stefano Ermon, Atri Rudra, and Christopher R\u00e9. 2020. Hippo: Recurrent memory with optimal polynomial projections. In Advances in Neural Information Processing Systems, Vol. 33, 1474\u20131487.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_19_2","unstructured":"Albert Gu Karan Goel and Christopher R\u00e9. 2021. Efficiently modeling long sequences with structured state spaces. arXiv:2111.00396. Retrieved from https:\/\/arxiv.org\/abs\/2111.00396"},{"key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"3039","DOI":"10.1016\/S1573-4412(05)80019-4","article-title":"State-space models","volume":"4","author":"Hamilton James D.","year":"1994","unstructured":"James D. Hamilton. 1994. State-space models. Handbook of Econometrics, Vol. 4. J. J. Heckman and E. E. Leamer (Eds.), 3039\u20133080.","journal-title":"Handbook of Econometrics"},{"key":"e_1_3_1_21_2","first-page":"1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops","author":"Jiang Lei","year":"2019","unstructured":"Lei Jiang, Xiao-Jun Wu, and Josef Kittler. 2019. Dual attention MobDenseNet (DAMDNet) for robust 3D face alignment. In Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops, 1\u201310."},{"key":"e_1_3_1_22_2","unstructured":"Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv:1312.6114. Retrieved from https:\/\/arxiv.org\/abs\/1312.6114"},{"key":"e_1_3_1_23_2","first-page":"3404","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV)","author":"Koch Sebastian","year":"2024","unstructured":"Sebastian Koch, Pedro Hermosilla, Narunas Vaskevicius, Mirco Colosi, and Timo Ropinski. 2024. SGRec3D: Self-supervised 3D scene graph learning via object-level scene reconstruction. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV). IEEE, 3404\u20133414."},{"key":"e_1_3_1_24_2","first-page":"2144","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshops","author":"Koestinger Martin","year":"2011","unstructured":"Martin Koestinger, Paul Wohlhart, Peter M. Roth, and Horst Bischof. 2011. Annotated facial landmarks in the wild: A large-scale, real-world database for facial landmark localization. In Proceedings of the IEEE International Conference on Computer Vision Workshops, 2144\u20132151."},{"key":"e_1_3_1_25_2","first-page":"896","volume-title":"Proceedings of the Workshop on Challenges in Representation Learning, ICML","volume":"3","author":"Hyun Lee Dong","year":"2013","unstructured":"Dong Hyun Lee. 2013. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In Proceedings of the Workshop on Challenges in Representation Learning, ICML, Vol. 3, 896."},{"key":"e_1_3_1_26_2","first-page":"4265","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Li Lei","year":"2021","unstructured":"Lei Li, Xiangzheng Li, Kangbo Wu, Kui Lin, and Suping Wu. 2021. Multi-granularity feature interaction and relation reasoning for 3D dense alignment and face reconstruction. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 4265\u20134269."},{"key":"e_1_3_1_27_2","first-page":"1","volume-title":"Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN)","author":"Li Lei","year":"2024","unstructured":"Lei Li, Fuqiang Liu, Qi Wang, Jiahao Li, Junyuan Wang, Xiang Li, and Yanni Wang. 2024. Multi-attribute dynamic learning for 3D face reconstruction and dense alignment in the unconstrained environment. In Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN). IEEE, 1\u20138."},{"issue":"3","key":"e_1_3_1_28_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3568678","article-title":"Multi-scale edge-guided learning for 3D reconstruction","volume":"19","author":"Li Lei","year":"2023","unstructured":"Lei Li, Zhiyuan Zhou, Suping Wu, and Yongrong Cao. 2023. Multi-scale edge-guided learning for 3D reconstruction. ACM Transactions on Multimedia Computing, Communications, and Applications 19, 3 (2023), 1\u201324.","journal-title":"ACM Transactions on Multimedia Computing, Communications, and Applications"},{"key":"e_1_3_1_29_2","first-page":"10241","volume-title":"Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Li Manyi","year":"2021","unstructured":"Manyi Li and Hao Zhang. 2021. D2IM-Net: Learning detail disentangled implicit fields from single images. In Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 10241\u201310250."},{"issue":"12","key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"8348","DOI":"10.1109\/TPAMI.2024.3404334","article-title":"Learning disentangled representation for one-shot progressive face swapping","volume":"46","author":"Li Qi","year":"2024","unstructured":"Qi Li, Weining Wang, Chengzhong Xu, Zhenan Sun, and Ming-Hsuan Yang. 2024. Learning disentangled representation for one-shot progressive face swapping. IEEE Transactions on Pattern Analysis and Machine Intelligence 46, 12 (2024), 8348\u20138364.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"11","key":"e_1_3_1_31_2","doi-asserted-by":"crossref","first-page":"7701","DOI":"10.1007\/s00371-023-03202-4","article-title":"Disentangled representation transformer network for 3D face reconstruction and robust dense alignment","volume":"40","author":"Li Xiangzheng","year":"2024","unstructured":"Xiangzheng Li, Xiaofei Li, and Jia Deng. 2024. Disentangled representation transformer network for 3D face reconstruction and robust dense alignment. The Visual Computer 40, 11 (2024), 7701\u20137718.","journal-title":"The Visual Computer"},{"key":"e_1_3_1_32_2","first-page":"7226","volume-title":"Proceedings of the IEEE International Conference on Pattern Recognition","author":"Li Xiangzheng","year":"2021","unstructured":"Xiangzheng Li and Suping Wu. 2021. Multi-attribute regression network for face reconstruction. In Proceedings of the IEEE International Conference on Pattern Recognition, 7226\u20137233."},{"issue":"6","key":"e_1_3_1_33_2","doi-asserted-by":"crossref","first-page":"3727","DOI":"10.1007\/s11263-025-02348-z","article-title":"Sample-cohesive pose-aware contrastive facial representation learning","volume":"133","author":"Liu Yuanyuan","year":"2025","unstructured":"Yuanyuan Liu, Shaoze Feng, Shuyang Liu, Yibing Zhan, Dapeng Tao, Zijing Chen, and Zhe Chen. 2025. Sample-cohesive pose-aware contrastive facial representation learning. International Journal of Computer Vision 133, 6 (2025), 3727\u20133745.","journal-title":"International Journal of Computer Vision"},{"key":"e_1_3_1_34_2","first-page":"1619","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshops","author":"Liu Yaojie","year":"2017","unstructured":"Yaojie Liu, Amin Jourabloo, William Ren, and Xiaoming Liu. 2017. Dense face alignment. In Proceedings of the IEEE International Conference on Computer Vision Workshops, 1619\u20131628."},{"issue":"6","key":"e_1_3_1_35_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2816795.2818013","article-title":"SMPL: A skinned multi-person linear model","volume":"34","author":"Loper Matthew","year":"2015","unstructured":"Matthew Loper, Naureen Mahmood, Javier Romero, Gerard Pons-Moll, and Michael J. Black. 2015. SMPL: A skinned multi-person linear model. ACM Transactions on Graphics 34, 6 (2015), 1\u201316.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_3_1_36_2","first-page":"4460","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Mescheder Lars","year":"2019","unstructured":"Lars Mescheder, Michael Oechsle, Michael Niemeyer, Sebastian Nowozin, and Andreas Geiger. 2019. Occupancy networks: Learning 3D reconstruction in function space. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 4460\u20134470."},{"issue":"8","key":"e_1_3_1_37_2","doi-asserted-by":"crossref","first-page":"1979","DOI":"10.1109\/TPAMI.2018.2858821","article-title":"Virtual adversarial training: A regularization method for supervised and semi-supervised learning","volume":"41","author":"Miyato Takeru","year":"2018","unstructured":"Takeru Miyato, Shin-Ichi Maeda, Masanori Koyama, and Shin Ishii. 2018. Virtual adversarial training: A regularization method for supervised and semi-supervised learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 8 (2018), 1979\u20131993.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_38_2","first-page":"4004","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Song Hyun Oh","year":"2016","unstructured":"Hyun Oh Song, Yu Xiang, Stefanie Jegelka, and Silvio Savarese. 2016. Deep metric learning via lifted structured feature embedding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4004\u20134012."},{"key":"e_1_3_1_39_2","first-page":"3577","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Riegler Gernot","year":"2017","unstructured":"Gernot Riegler, Ali Osman Ulusoy, and Andreas Geiger. 2017. OctNet: Learning deep 3D representations at high resolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3577\u20133586."},{"key":"e_1_3_1_40_2","first-page":"1163","article-title":"Regularization with stochastic transformations and perturbations for deep semi-supervised learning","volume":"29","author":"Sajjadi Mehdi","year":"2016","unstructured":"Mehdi Sajjadi, Mehran Javanmardi, and Tolga Tasdizen. 2016. Regularization with stochastic transformations and perturbations for deep semi-supervised learning. Advances in Neural Information Processing Systems, Vol. 29, 1163\u20131171.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_41_2","first-page":"596","article-title":"FixMatch: Simplifying semi-supervised learning with consistency and confidence","volume":"33","author":"Sohn Kihyuk","year":"2020","unstructured":"Kihyuk Sohn, David Berthelot, Nicholas Carlini, Zizhao Zhang, Han Zhang, Colin A. Raffel, Ekin Dogus Cubuk, Alexey Kurakin, and Chun-Liang Li. 2020. FixMatch: Simplifying semi-supervised learning with consistency and confidence. In Advances in Neural Information Processing Systems, Vol. 33, 596\u2013608.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_42_2","first-page":"398","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Tan Shuai","year":"2024","unstructured":"Shuai Tan, Bin Ji, Mengxiao Bi, and Ye Pan. 2024. EDTalk: Efficient disentanglement for emotional talking head synthesis. In Proceedings of the European Conference on Computer Vision. Springer, 398\u2013416."},{"key":"e_1_3_1_43_2","first-page":"7346","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Tran Luan","year":"2018","unstructured":"Luan Tran and Xiaoming Liu. 2018. Nonlinear 3D face morphable model. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 7346\u20137355."},{"key":"e_1_3_1_44_2","first-page":"1160","article-title":"3D face reconstruction from a single image assisted by 2D face images in the wild","volume":"23","author":"Tu Xiaoguang","year":"2020","unstructured":"Xiaoguang Tu, Jian Zhao, Mei Xie, Zihang Jiang, Akshaya Balamurugan, Yao Luo, Yang Zhao, Lingxiao He, Zheng Ma, and Jiashi Feng. 2020. 3D face reconstruction from a single image assisted by 2D face images in the wild. IEEE Transactions on Multimedia 23 (2020), 1160\u20131172.","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_1_45_2","first-page":"52","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"Wang Nanyang","year":"2018","unstructured":"Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Wei Liu, and Yu-Gang Jiang. 2018. Pixel2Mesh: Generating 3D mesh models from single RGB images. In Proceedings of the European Conference on Computer Vision (ECCV), 52\u201367."},{"key":"e_1_3_1_46_2","doi-asserted-by":"crossref","first-page":"3591","DOI":"10.1109\/TCE.2025.3572477","article-title":"A collaborative network of mamba and CNN for lightweight image super-resolution","volume":"71","author":"Wang Xin","year":"2025","unstructured":"Xin Wang, Jinxing Li, Jinkai Li, Shiqi Wang, Liang Yan, and Yong Xu. 2025. A collaborative network of mamba and CNN for lightweight image super-resolution. IEEE Transactions on Consumer Electronics 71 (2025), 3591\u20133604","journal-title":"IEEE Transactions on Consumer Electronics"},{"key":"e_1_3_1_47_2","doi-asserted-by":"crossref","first-page":"111859","DOI":"10.1016\/j.patcog.2025.111859","article-title":"CMANet: A CNN-Mamba aggregation network for face super-resolution","volume":"168","author":"Wu Ziyi","year":"2025","unstructured":"Ziyi Wu, Tao Lu, Yanduo Zhang, and Xiaoyu Chai. 2025. CMANet: A CNN-Mamba aggregation network for face super-resolution. Pattern Recognition 168 (2025), 111859.","journal-title":"Pattern Recognition"},{"key":"e_1_3_1_48_2","first-page":"1912","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wu Zhirong","year":"2015","unstructured":"Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3D ShapeNets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1912\u20131920."},{"key":"e_1_3_1_49_2","first-page":"392","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshops","author":"Yan Junjie","year":"2013","unstructured":"Junjie Yan, Zhen Lei, Dong Yi, and Stan Li. 2013. Learn to combine multiple hypotheses for accurate face alignment. In Proceedings of the IEEE International Conference on Computer Vision Workshops, 392\u2013396."},{"key":"e_1_3_1_50_2","doi-asserted-by":"crossref","first-page":"3663","DOI":"10.1109\/TMM.2023.3322895","article-title":"Learning 3D face reconstruction from the cycle-consistency of dynamic faces","volume":"26","author":"Yang Wenwu","year":"2023","unstructured":"Wenwu Yang, Yeqing Zhao, Bailin Yang, and Jianbing Shen. 2023. Learning 3D face reconstruction from the cycle-consistency of dynamic faces. IEEE Transactions on Multimedia 26 (2023), 3663\u20133675.","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_1_51_2","first-page":"4723","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Yu Ronald","year":"2017","unstructured":"Ronald Yu, Shunsuke Saito, Haoxiang Li, Duygu Ceylan, and Hao Li. 2017. Learning dense facial correspondences in unconstrained images. In Proceedings of the IEEE International Conference on Computer Vision, 4723\u20134732."},{"issue":"11","key":"e_1_3_1_52_2","first-page":"2212","article-title":"Face landmark fitting via optimized part mixtures and cascaded deformable model","volume":"38","author":"Yu Xiang","year":"2015","unstructured":"Xiang Yu, Junzhou Huang, Shaoting Zhang, and Dimitris N. Metaxas. 2015. Face landmark fitting via optimized part mixtures and cascaded deformable model. IEEE Transactions on Pattern Analysis and Machine Intelligence 38, 11 (2015), 2212\u20132226.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_53_2","first-page":"1","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP \u201903)","author":"Zhang Shikun","year":"2023","unstructured":"Shikun Zhang, Fengyi Song, Ge Song, and Ming Yang. 2023. SDRNet: Shape decoupled regression network for 3D face reconstruction. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP \u201903). IEEE, 1\u20135."},{"issue":"3","key":"e_1_3_1_54_2","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1109\/TBIOM.2023.3261272","article-title":"Replay attention and data augmentation network for 3-D face and object reconstruction","volume":"5","author":"Zhou Zhiyuan","year":"2023","unstructured":"Zhiyuan Zhou, Lei Li, Suping Wu, Xinyu Li, Kehua Ma, and Xitie Zhang. 2023. Replay attention and data augmentation network for 3-D face and object reconstruction. IEEE Transactions on Biometrics, Behavior, and Identity Science 5, 3 (2023), 308\u2013320.","journal-title":"IEEE Transactions on Biometrics, Behavior, and Identity Science"},{"key":"e_1_3_1_55_2","first-page":"11751","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Zhu Congcong","year":"2021","unstructured":"Congcong Zhu, Xiaoqiang Li, Jide Li, and Songmin Dai. 2021. Improving robustness of facial landmark detection by defending against adversarial attacks. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 11751\u201311760."},{"key":"e_1_3_1_56_2","first-page":"912","volume-title":"Proceedings of the 20th International Conference on Machine Learning (ICML \u201903)","author":"Zhu Xiaojin","year":"2003","unstructured":"Xiaojin Zhu, Zoubin Ghahramani, and John D. Lafferty. 2003. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning (ICML \u201903), 912\u2013919."},{"key":"e_1_3_1_57_2","first-page":"146","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Zhu Xiangyu","year":"2016","unstructured":"Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, and Stan Z. Li. 2016. Face alignment across large poses: A 3D solution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 146\u2013155."},{"issue":"1","key":"e_1_3_1_58_2","first-page":"78","article-title":"Face alignment in full pose range: A 3D total solution","volume":"41","author":"Zhu Xiangyu","year":"2017","unstructured":"Xiangyu Zhu, Xiaoming Liu, Zhen Lei, and Stan Z. Li. 2017. Face alignment in full pose range: A 3D total solution. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 1 (2017), 78\u201392.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"9","key":"e_1_3_1_59_2","doi-asserted-by":"crossref","first-page":"8310","DOI":"10.1109\/TCSVT.2024.3377121","article-title":"DFIE3D: 3D-aware disentangled face inversion and editing via facial-contrastive learning","volume":"34","author":"Zhu Xiaoqiang","year":"2024","unstructured":"Xiaoqiang Zhu, Jiayu Zhou, Lihua You, Xiaosong Yang, Jian Chang, Jian Jun Zhang, and Dan Zeng. 2024. DFIE3D: 3D-aware disentangled face inversion and editing via facial-contrastive learning. IEEE Transactions on Circuits and Systems for Video Technology 34, 9 (2024), 8310\u20138326.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3796709","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T09:19:24Z","timestamp":1775899164000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3796709"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,11]]},"references-count":58,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2026,4,30]]}},"alternative-id":["10.1145\/3796709"],"URL":"https:\/\/doi.org\/10.1145\/3796709","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,11]]},"assertion":[{"value":"2025-04-07","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-01-17","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-04-11","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}