{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,26]],"date-time":"2025-12-26T22:33:26Z","timestamp":1766788406816,"version":"build-2065373602"},"reference-count":43,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2025,5,19]],"date-time":"2025-05-19T00:00:00Z","timestamp":1747612800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key R&amp;D Program of China","award":["2022YFF0608804","LQ23F050002"],"award-info":[{"award-number":["2022YFF0608804","LQ23F050002"]}]},{"name":"Zhejiang Provincial Natural Science Foundation, China","award":["2022YFF0608804","LQ23F050002"],"award-info":[{"award-number":["2022YFF0608804","LQ23F050002"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computers"],"abstract":"<jats:p>Plant-parasiticnematodes represent a significant biosecurity threat in cross-border plant quarantine, necessitating precise identification for effective border control. While DL models have demonstrated success in nematode image classification based on morphological features, the limited availability of high-quality samples and the species-specific nature of nematodes result in insufficient training data, which constrains model performance. Although generative models have shown promise in data augmentation, they often struggle to balance morphological fidelity and phenotypic diversity. This paper proposes a novel few-shot data augmentation framework based on a morphology-constrained latent diffusion model, which, for the first time, integrates morphological constraints into the latent diffusion process. By geometrically parameterizing nematode morphology, the proposed approach enhances topological fidelity in the generated images and addresses key limitations of traditional generative models in controlling biological shapes. This framework is designed to augment nematode image datasets and improve classification performance under limited data conditions. The framework consists of three key components: First, we incorporate a fine-tuning strategy that preserves the generalization capability of model in few-shot settings. Second, we extract morphological constraints from nematode images using edge detection and a moving least squares method, capturing key structural details. Finally, we embed these constraints into the latent space of the diffusion model, ensuring generated images maintain both fidelity and diversity. Experimental results demonstrate that our approach significantly enhances classification accuracy. For imbalanced datasets, the Top-1 accuracy of multiple classification models improved by 7.34\u201314.66% compared to models trained without augmentation, and by 2.0\u20135.67% compared to models using traditional data augmentation. Additionally, when replacing up to 25% of real images with generated ones in a balanced dataset, model performance remained nearly unchanged, indicating the robustness and effectiveness of the method. Ablation experiments demonstrate that the morphology-guided strategy achieves superior image quality compared to both unconstrained and edge-based constraint methods, with a Fr\u00e9chet Inception Distance of 12.95 and an Inception Score of 1.21 \u00b1 0.057. These results indicate that the proposed method effectively balances morphological fidelity and phenotypic diversity in image generation.<\/jats:p>","DOI":"10.3390\/computers14050198","type":"journal-article","created":{"date-parts":[[2025,5,19]],"date-time":"2025-05-19T11:54:26Z","timestamp":1747655666000},"page":"198","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Few-Shot Data Augmentation by Morphology-Constrained Latent Diffusion for Enhanced Nematode Recognition"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-0772-5424","authenticated-orcid":false,"given":"Xiong","family":"Ouyang","sequence":"first","affiliation":[{"name":"Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo 315211, China"},{"name":"Ningbo Institute of Materials Technology & Engineering, Chinese Academy of Sciences, Ningbo 315201, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8350-6116","authenticated-orcid":false,"given":"Jiayan","family":"Zhuang","sequence":"additional","affiliation":[{"name":"Ningbo Institute of Materials Technology & Engineering, Chinese Academy of Sciences, Ningbo 315201, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8514-813X","authenticated-orcid":false,"given":"Jianfeng","family":"Gu","sequence":"additional","affiliation":[{"name":"Ningbo Entry Exit Inspect & Quarantine Bur, Ctr Tech, Ningbo 315100, China"}]},{"given":"Sichao","family":"Ye","sequence":"additional","affiliation":[{"name":"Ningbo Institute of Materials Technology & Engineering, Chinese Academy of Sciences, Ningbo 315201, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,5,19]]},"reference":[{"key":"ref_1","unstructured":"Nicol, J.M. (2002). Important nematode pests. Bread Wheat: Improvement and Production, FAO Plant Production and Protection Series."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1016\/j.cmpb.2018.01.016","article-title":"Taxonomy of multi-focal nematode image stacks by a CNN based image fusion approach","volume":"156","author":"Liu","year":"2018","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"106058","DOI":"10.1016\/j.compag.2021.106058","article-title":"Image based species identification of Globodera quarantine nematodes using computer vision and deep learning","volume":"186","author":"Thevenoux","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/j.biosystemseng.2021.11.016","article-title":"NemaNet: A convolutional neural network model for identification of soybean nematodes","volume":"213","author":"Abade","year":"2022","journal-title":"Biosyst. Eng."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"107710","DOI":"10.1016\/j.compag.2023.107710","article-title":"Domain generalization in nematode classification","volume":"207","author":"Zhu","year":"2023","journal-title":"Comput. Electron. Agric."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Zhou, B.T., Nah, W.L., Lee, K.W., and Baek, J.H. (2005, January 15\u201319). A General Image Based Nematode Identification System Design. Proceedings of the International Conference on Computational Intelligence and Security, Xi\u2019an, China.","DOI":"10.1007\/11596981_132"},{"key":"ref_7","unstructured":"DeVries, T., and Taylor, G.W. (2017). Improved regularization of convolutional neural networks with cutout. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Cubuk, E.D., Zoph, B., Shlens, J., and Le, Q.V. (2020, January 14\u201319). Randaugment: Practical automated data augmentation with a reduced search space. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00359"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhang, H., Cisse, M., Dauphin, Y.N., and Lopez-Paz, D. (2017). mixup: Beyond empirical risk minimization. arXiv.","DOI":"10.1007\/978-1-4899-7687-1_79"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., and Yoo, Y.J. (November, January 27). CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00612"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"012066","DOI":"10.1088\/1742-6596\/1827\/1\/012066","article-title":"Challenges and Corresponding Solutions of Generative Adversarial Networks (GANs): A Survey Study","volume":"1827","author":"Chen","year":"2021","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_12","unstructured":"Kingma, D.P., and Welling, M. (2013). Auto-encoding variational bayes. arXiv."},{"key":"ref_13","first-page":"6840","article-title":"Denoising diffusion probabilistic models","volume":"33","author":"Ho","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and Ommer, B. (2022, January 18\u201324). High-Resolution Image Synthesis with Latent Diffusion Models. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zhou, T., Chen, X., Shen, Y., Nieuwoudt, M., Pun, C.M., and Wang, S. (2023, January 4\u20135). Generative AI Enables EEG Data Augmentation for Alzheimer\u2019s Disease Detection Via Diffusion Model. Proceedings of the 2023 IEEE International Symposium on Product Compliance Engineering\u2014Asia (ISPCE-ASIA), Shanghai, China.","DOI":"10.1109\/ISPCE-ASIA60405.2023.10365931"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2023.3334492","article-title":"Diverse Hyperspectral Remote Sensing Image Synthesis With Diffusion Models","volume":"61","author":"Liu","year":"2023","journal-title":"IEEE Trans. Geosci. Remote. Sens."},{"key":"ref_17","unstructured":"Hu, J.E., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., and Chen, W. (2021). LoRA: Low-Rank Adaptation of Large Language Models. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhang, L., Rao, A., and Agrawala, M. (, January 1\u20136). Adding Conditional Control to Text-to-Image Diffusion Models. Proceedings of the 2023 IEEE\/CVF International Conference on Computer Vision (ICCV), Paris, France.","DOI":"10.1109\/ICCV51070.2023.00355"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"588","DOI":"10.1109\/JAS.2017.7510583","article-title":"Generative adversarial networks: Introduction and outlook","volume":"4","author":"Wang","year":"2017","journal-title":"IEEE\/Caa J. Autom. Sin."},{"key":"ref_20","unstructured":"Tomczak, J.M., and Welling, M. (2017). VAE with a VampPrior. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"da Silva Abade, A., Faria Porto, L., Afonso Ferreira, P., and de Barros Vidal, F. (2021). Nemanet: A convolutional neural network model for identification of nematodes soybean crop in brazil. arXiv.","DOI":"10.1016\/j.biosystemseng.2021.11.016"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"108517","DOI":"10.1016\/j.compag.2023.108517","article-title":"Synthetic data augmentation by diffusion probabilistic models to enhance weed recognition","volume":"216","author":"Chen","year":"2024","journal-title":"Comput. Electron. Agric."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1038\/s42256-023-00626-4","article-title":"Parameter-efficient fine-tuning of large-scale pre-trained language models","volume":"5","author":"Ding","year":"2023","journal-title":"Nat. Mach. Intell."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M., and Aberman, K. (2022, January 17\u201324). DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation. Proceedings of the 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.02155"},{"key":"ref_25","unstructured":"Gal, R., Alaluf, Y., Atzmon, Y., Patashnik, O., Bermano, A.H., Chechik, G., and Cohen-Or, D. (2022). An image is worth one word: Personalizing text-to-image generation using textual inversion. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1109\/TASSP.1981.1163711","article-title":"Cubic convolution interpolation for digital image processing","volume":"29","author":"Keys","year":"1981","journal-title":"IEEE Trans. Acoust. Speech Signal Process."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1109\/TPAMI.1986.4767851","article-title":"A Computational Approach to Edge Detection","volume":"PAMI-8","author":"Canny","year":"1986","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s11263-017-1004-z","article-title":"Holistically-Nested Edge Detection","volume":"125","author":"Xie","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Su, Z., Liu, W., Yu, Z., Hu, D., Liao, Q., Tian, Q., Pietik\u00e4inen, M., and Liu, L. (2021, January 11\u201317). Pixel Difference Networks for Efficient Edge Detection. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00507"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1145\/1141911.1141920","article-title":"Image deformation using moving least squares","volume":"25","author":"Schaefer","year":"2006","journal-title":"ACM Trans. Graph."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_32","unstructured":"Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., and Clark, J. (2021, January 18\u201324). Learning Transferable Visual Models From Natural Language Supervision. Proceedings of the International Conference on Machine Learning, Online."},{"key":"ref_33","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. arXiv."},{"key":"ref_34","unstructured":"Schuhmann, C., Beaumont, R., Vencu, R., Gordon, C., Wightman, R., Cherti, M., Coombes, T., Katta, A., Mullis, C., and Wortsman, M. (2022). LAION-5B: An open large-scale dataset for training next generation image-text models. arXiv."},{"key":"ref_35","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., and Weinberger, K.Q. (2016, January 21\u201326). Densely Connected Convolutional Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_38","unstructured":"Tan, M., and Le, Q.V. (2021, January 18\u201324). EfficientNetV2: Smaller Models and Faster Training. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhang, H., Wu, C., Zhang, Z., Zhu, Y., Zhang, Z.L., Lin, H., Sun, Y., He, T., Mueller, J.W., and Manmatha, R. (2022, January 19\u201320). ResNeSt: Split-Attention Networks. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00309"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Ding, X., Zhang, X., Ma, N., Han, J., Ding, G., and Sun, J. (2021, January 20\u201325). RepVGG: Making VGG-style ConvNets Great Again. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01352"},{"key":"ref_41","unstructured":"Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., and Hochreiter, S. (2017, January 4\u20139). GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. Proceedings of the Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_42","unstructured":"Salimans, T., Goodfellow, I.J., Zaremba, W., Cheung, V., Radford, A., and Chen, X. (2016). Improved Techniques for Training GANs. arXiv."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.E., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"}],"container-title":["Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-431X\/14\/5\/198\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:35:14Z","timestamp":1760031314000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-431X\/14\/5\/198"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,19]]},"references-count":43,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2025,5]]}},"alternative-id":["computers14050198"],"URL":"https:\/\/doi.org\/10.3390\/computers14050198","relation":{},"ISSN":["2073-431X"],"issn-type":[{"type":"electronic","value":"2073-431X"}],"subject":[],"published":{"date-parts":[[2025,5,19]]}}}