{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,19]],"date-time":"2026-06-19T17:23:02Z","timestamp":1781889782359,"version":"3.54.5"},"reference-count":26,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2023,10,27]],"date-time":"2023-10-27T00:00:00Z","timestamp":1698364800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"University Key Projects stability fund","award":["WDZC20200821140447001"],"award-info":[{"award-number":["WDZC20200821140447001"]}]},{"name":"University Key Projects stability fund","award":["JSGG20201102160801004"],"award-info":[{"award-number":["JSGG20201102160801004"]}]},{"name":"Technical Breakthrough projects","award":["WDZC20200821140447001"],"award-info":[{"award-number":["WDZC20200821140447001"]}]},{"name":"Technical Breakthrough projects","award":["JSGG20201102160801004"],"award-info":[{"award-number":["JSGG20201102160801004"]}]},{"name":"Shenzhen Grand Technology Corporation","award":["WDZC20200821140447001"],"award-info":[{"award-number":["WDZC20200821140447001"]}]},{"name":"Shenzhen Grand Technology Corporation","award":["JSGG20201102160801004"],"award-info":[{"award-number":["JSGG20201102160801004"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The main challenges in reconstruction-based anomaly detection include the breakdown of the generalization gap due to improved fitting capabilities and the overfitting problem arising from simulated defects. To overcome this, we propose a new method called PRFF-AD, which utilizes progressive reconstruction and hierarchical feature fusion. It consists of a reconstructive sub-network and a discriminative sub-network. The former achieves anomaly-free reconstruction while maintaining nominal patterns, and the latter locates defects based on pre- and post-reconstruction information. Given defective samples, we find that adopting a progressive reconstruction approach leads to higher-quality reconstructions without compromising the assumption of a generalization gap. Meanwhile, to alleviate the network\u2019s overfitting of synthetic defects and address the issue of reconstruction errors, we fuse hierarchical features as guidance for discriminating defects. Moreover, with the help of an attention mechanism, the network achieves higher classification and localization accuracy. In addition, we construct a large dataset for packaging chips, named GTanoIC, with 1750 real non-defective samples and 470 real defective samples, and we provide their pixel-level annotations. Evaluation results demonstrate that our method outperforms other reconstruction-based methods on two challenging datasets: MVTec AD and GTanoIC.<\/jats:p>","DOI":"10.3390\/s23218750","type":"journal-article","created":{"date-parts":[[2023,10,27]],"date-time":"2023-10-27T11:50:18Z","timestamp":1698407418000},"page":"8750","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Anomaly Detection via Progressive Reconstruction and Hierarchical Feature Fusion"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-6901-0661","authenticated-orcid":false,"given":"Fei","family":"Liu","sequence":"first","affiliation":[{"name":"Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-3695-7288","authenticated-orcid":false,"given":"Xiaoming","family":"Zhu","sequence":"additional","affiliation":[{"name":"Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Pingfa","family":"Feng","sequence":"additional","affiliation":[{"name":"Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3090-6319","authenticated-orcid":false,"given":"Long","family":"Zeng","sequence":"additional","affiliation":[{"name":"Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,27]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"Variational autoencoder based anomaly detection using reconstruction probability","volume":"2","author":"An","year":"2015","journal-title":"Spec. Lect. IE"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Bergmann, P., L\u00f6we, S., Fauser, M., Sattlegger, D., and Steger, C. (2018). Improving unsupervised defect segmentation by applying structural similarity to autoencoders. arXiv.","DOI":"10.5220\/0007364503720380"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Haselmann, M., Gruber, D.P., and Tabatabai, P. (2018, January 17\u201320). Anomaly detection using deep learning based image completion. Proceedings of the 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA.","DOI":"10.1109\/ICMLA.2018.00201"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"101105","DOI":"10.1016\/j.aei.2020.101105","article-title":"Anomaly detection of defects on concrete structures with the convolutional autoencoder","volume":"45","author":"Chow","year":"2020","journal-title":"Adv. Eng. Inform."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Tang, T.W., Hsu, H., Huang, W.R., and Li, K.M. (2022). Industrial Anomaly Detection with Skip Autoencoder and Deep Feature Extractor. Sensors, 22.","DOI":"10.2139\/ssrn.4109686"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Oluwasanmi, A., Aftab, M.U., Baagyere, E., Qin, Z., Ahmad, M., and Mazzara, M. (2022). Attention Autoencoder for Generative Latent Representational Learning in Anomaly Detection. Sensors, 22.","DOI":"10.3390\/s22010123"},{"key":"ref_7","unstructured":"Baur, C., Wiestler, B., Albarqouni, S., and Navab, N. (2019). Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: 4th International Workshop, BrainLes 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, 16 September 2018, Springer. Revised Selected Papers, Part I 4."},{"key":"ref_8","unstructured":"Vasilev, A., Golkov, V., Meissner, M., Lipp, I., Sgarlata, E., Tomassini, V., Jones, D.K., and Cremers, D. (2020). Computational Diffusion MRI: MICCAI Workshop, Shenzhen, China, October 2019, Springer."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Niu, Z., Yu, K., and Wu, X. (2020). LSTM-Based VAE-GAN for Time-Series Anomaly Detection. Sensors, 20.","DOI":"10.3390\/s20133738"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Schlegl, T., Seeb\u00f6ck, P., Waldstein, S.M., Schmidt-Erfurth, U., and Langs, G. (2017, January 25\u201330). Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. Proceedings of the International Conference on Information Processing in Medical Imaging, Boone, NC, USA.","DOI":"10.1007\/978-3-319-59050-9_12"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhang, L., Dai, Y., Fan, F., and He, C. (2023). Anomaly Detection of GAN Industrial Image Based on Attention Feature Fusion. Sensors, 23.","DOI":"10.3390\/s23010355"},{"key":"ref_12","unstructured":"Liu, T., Li, B., Zhao, Z., Du, X., Jiang, B., and Geng, L. (2022). Reconstruction from edge image combined with color and gradient difference for industrial surface anomaly detection. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"You, Z., Yang, K., Luo, W., Cui, L., Zheng, Y., and Le, X. (2022, January 22\u201326). Adtr: Anomaly detection transformer with feature reconstruction. Proceedings of the International Conference on Neural Information Processing, Virtual.","DOI":"10.1007\/978-3-031-30111-7_26"},{"key":"ref_14","unstructured":"Li, Z., Li, N., Jiang, K., Ma, Z., Wei, X., Hong, X., and Gong, Y. (2020, January 7\u201310). Superpixel Masking and Inpainting for Self-Supervised Anomaly Detection. Proceedings of the BMVC, Virtual."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107706","DOI":"10.1016\/j.patcog.2020.107706","article-title":"Reconstruction by inpainting for visual anomaly detection","volume":"112","author":"Zavrtanik","year":"2021","journal-title":"Pattern Recognit."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pirnay, J., and Chai, K. (2022, January 16\u201319). Inpainting transformer for anomaly detection. Proceedings of the International Conference on Image Analysis and Processing, Bordeaux, France.","DOI":"10.1007\/978-3-031-06430-2_33"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zavrtanik, V., Kristan, M., and Sko\u010daj, D. (2021, January 11\u201317). Draem\u2014A discriminatively trained reconstruction embedding for surface anomaly detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00822"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Li, C.L., Sohn, K., Yoon, J., and Pfister, T. (2021, January 20\u201325). Cutpaste: Self-supervised learning for anomaly detection and localization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00954"},{"key":"ref_19","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference, Munich, Germany, 5\u20139 October 2015, Proceedings, Part III 18, Springer."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Xiao, T., Liu, Y., Zhou, B., Jiang, Y., and Sun, J. (2018, January 8\u201314). Unified perceptual parsing for scene understanding. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01228-1_26"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Bergmann, P., Fauser, M., Sattlegger, D., and Steger, C. (2019, January 15\u201320). MVTec AD\u2014A comprehensive real-world dataset for unsupervised anomaly detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00982"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Huang, H., Lin, L., Tong, R., Hu, H., Zhang, Q., Iwamoto, Y., Han, X., Chen, Y.W., and Wu, J. (2020, January 4\u20138). Unet 3+: A full-scale connected unet for medical image segmentation. Proceedings of the ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.","DOI":"10.1109\/ICASSP40776.2020.9053405"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_25","unstructured":"Gong, D., Liu, L., Le, V., Saha, B., Mansour, M.R., Venkatesh, S., and Hengel, A.v.d. (November, January 27). Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_26","first-page":"6840","article-title":"Denoising diffusion probabilistic models","volume":"33","author":"Ho","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/21\/8750\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:12:35Z","timestamp":1760130755000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/21\/8750"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,27]]},"references-count":26,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2023,11]]}},"alternative-id":["s23218750"],"URL":"https:\/\/doi.org\/10.3390\/s23218750","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,27]]}}}