{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T12:02:25Z","timestamp":1775217745797,"version":"3.50.1"},"reference-count":60,"publisher":"Institution of Engineering and Technology (IET)","issue":"1","license":[{"start":{"date-parts":[[2025,8,24]],"date-time":"2025-08-24T00:00:00Z","timestamp":1755993600000},"content-version":"vor","delay-in-days":235,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/doi.wiley.com\/10.1002\/tdm_license_1.1"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61906044"],"award-info":[{"award-number":["61906044"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003995","name":"Natural Science Foundation of Anhui Province","doi-asserted-by":"publisher","award":["2408085MF154"],"award-info":[{"award-number":["2408085MF154"]}],"id":[{"id":"10.13039\/501100003995","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009558","name":"University Natural Science Research Project of Anhui Province","doi-asserted-by":"publisher","award":["2022AH051324"],"award-info":[{"award-number":["2022AH051324"]}],"id":[{"id":"10.13039\/501100009558","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009558","name":"University Natural Science Research Project of Anhui Province","doi-asserted-by":"publisher","award":["2023AH050406"],"award-info":[{"award-number":["2023AH050406"]}],"id":[{"id":"10.13039\/501100009558","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009558","name":"University Natural Science Research Project of Anhui Province","doi-asserted-by":"publisher","award":["2023AH050418"],"award-info":[{"award-number":["2023AH050418"]}],"id":[{"id":"10.13039\/501100009558","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["ietresearch.onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["IET Image Processing"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:title>ABSTRACT<\/jats:title>\n                  <jats:p>Currently, existing few\u2010shot learning methods encounter significant bottlenecks in semantic enhancement and data augmentation. Traditional prompt templates are limited by their fixed format, making it difficult to fully capture the characteristics of categories. On the other hand, category description methods based on large language models are susceptible to category polysemy, which can result in semantic bias. Therefore, we propose a multimodal semantic enhancement (MSE) module, which jointly analyzes the visual\u2010semantic relationship between category names and example samples through a multimodal large model. By leveraging visual information to guide the generation of discriminative category descriptions, MSE effectively mitigates semantic polysemy issues. To mitigate the issue of insufficient support set data, we introduce a multimodal image generation (MIG) module, which utilizes the image generation capability of text\u2010to\u2010image models and generates diverse images based on various textual information. Additionally, we draw inspiration from the prototypical networks and combine it with gaussian discriminant analysis to build a training\u2010free visual\u2010textual classifier. Our method (MSAG) significantly improves classification accuracy across 15 benchmark datasets, validating the effectiveness of the multimodal information collaborative enhancement strategy in alleviating the problem of data\u00a0scarcity.<\/jats:p>","DOI":"10.1049\/ipr2.70189","type":"journal-article","created":{"date-parts":[[2025,8,25]],"date-time":"2025-08-25T06:49:02Z","timestamp":1756104542000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["MSAG: Semantic Enhancement and Image Generation Based on Multimodal Large Models for Supporting Few\u2010Shot Learning"],"prefix":"10.1049","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7440-0109","authenticated-orcid":false,"given":"Jia","family":"Zhao","sequence":"first","affiliation":[{"name":"School of Computer and Information Engineering Fuyang Normal University Fuyang China"},{"name":"Anhui Engineering Research Center for Intelligent Computing and Information Innovation Fuyang Normal University Fuyang China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-6617-3346","authenticated-orcid":false,"given":"Ziyang","family":"Cao","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering Fuyang Normal University Fuyang China"}]},{"given":"Huiling","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering Fuyang Normal University Fuyang China"},{"name":"Anhui Engineering Research Center for Intelligent Computing and Information Innovation Fuyang Normal University Fuyang China"}]},{"given":"Xu","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering Fuyang Normal University Fuyang China"}]},{"given":"Yingzhou","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering Fuyang Normal University Fuyang China"}]}],"member":"265","published-online":{"date-parts":[[2025,8,24]]},"reference":[{"key":"e_1_2_10_2_1","doi-asserted-by":"crossref","unstructured":"Z.Liu Y.Lin Y.Cao H.Hu Y.Wei Z.Zhang S.Lin andB.Guo \u201cSwin Transformer: Hierarchical Vision Transformer Using Shifted Windows \u201d inProceedings of the IEEE\/CVF International Conference on Computer Vision(IEEE 2021) 10012\u201310022.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"e_1_2_10_3_1","doi-asserted-by":"crossref","unstructured":"Z.Liu H.Mao C.\u2010Y.Wu C.Feichtenhofer T.Darrell andS.Xie \u201cA ConvNet for the 2020s \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2022) 11976\u201311986.","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"e_1_2_10_4_1","first-page":"3965","article-title":"CoAtNet: Marrying Convolution and Attention for All Data Sizes","volume":"34","author":"Dai Z.","year":"2021","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_10_5_1","first-page":"9355","article-title":"Twins: Revisiting the Design of Spatial Attention in Vision Transformers","volume":"34","author":"Chu X.","year":"2021","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_10_6_1","unstructured":"L. Y. L.Chan C.Li andY.Yuan \u201cAutoPET Challenge: Tumour Synthesis for Data Augmentation \u201darXiv preprint arXiv:2409.08068(2024)."},{"key":"e_1_2_10_7_1","doi-asserted-by":"publisher","DOI":"10.3389\/fmicb.2024.1453870"},{"key":"e_1_2_10_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3582688"},{"key":"e_1_2_10_9_1","unstructured":"A.Radford J.Kim C.Hallacy A.Ramesh G.Goh S.Agarwal G.Sastry A.Askell P.Mishkin J.Clark G.Krueger andI.Sutskever \u201cLearning Transferable Visual Models from Natural Language Supervision \u201darXiv preprint arXiv:2103.00020(2021)."},{"key":"e_1_2_10_10_1","doi-asserted-by":"crossref","unstructured":"W.Wang H.Bao L.Dong J.Bjorck Z.Peng Q.Liu K.Aggarwal O.Khan S.Singhal S.Som andF.Wei \u201cImage as a Foreign Language: BEiT Pretraining for All Vision and Vision\u2010Language Tasks \u201darXiv preprint arXiv:2208.10442(2022).","DOI":"10.1109\/CVPR52729.2023.01838"},{"key":"e_1_2_10_11_1","doi-asserted-by":"crossref","unstructured":"R.Zhang W.Zhang R.Fang P.Gao K.Li J.Dai Y.Qiao H.Li H.Kong andS.Research \u201cTip\u2010Adapter: Training\u2010Free Adaptation of CLIP for Few\u2010Shot Classification \u201darXiv preprint arXiv:2111.03930(2021).","DOI":"10.1007\/978-3-031-19833-5_29"},{"key":"e_1_2_10_12_1","doi-asserted-by":"crossref","unstructured":"K.Palanisamy Y.\u2010W.Chao X.Du Y.Xiang et\u00a0al. \u201cProto\u2010CLIP: Vision\u2010Language Prototypical Network for Few\u2010Shot Learning \u201d in2024 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)(IEEE 2024) 2594\u20132601.","DOI":"10.1109\/IROS58592.2024.10801660"},{"key":"e_1_2_10_13_1","unstructured":"Z.Wang J.Liang L.Sheng R.He Z.Wang andT.Tan \u201cA Hard\u2010to\u2010Beat Baseline for Training\u2010Free CLIP\u2010Based Adaptation \u201darXiv preprint arXiv:2205.07853(2022)."},{"key":"e_1_2_10_14_1","unstructured":"A.Fang G.Ilharco M.Wortsman Y.Wan V.Shankar A.Dave andL.Schmidt \u201cData Determines Distributional Robustness in Contrastive Language Image Pre\u2010Training (CLIP) \u201d inInternational Conference on Machine Learning(PMLR 2022) 6216\u20136234."},{"key":"e_1_2_10_15_1","doi-asserted-by":"crossref","unstructured":"V.Udandarao A.Gupta andS.Albanie \u201cSUS\u2010X: Training\u2010Free Name\u2010Only Transfer of Vision\u2010Language Models \u201d inProceedings of the IEEE\/CVF International Conference on Computer Vision(IEEE 2022) 2725\u20132736.","DOI":"10.1109\/ICCV51070.2023.00257"},{"key":"e_1_2_10_16_1","doi-asserted-by":"crossref","unstructured":"R.Zhang X.Hu B.Li S.Huang H.Deng Y.Qiao P.Gao andH.Li \u201cPrompt Generate Then Cache: Cascade of Foundation Models Makes Strong Few\u2010Shot Learners \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2023) 15211\u201315222.","DOI":"10.1109\/CVPR52729.2023.01460"},{"key":"e_1_2_10_17_1","unstructured":"V.Udandarao A.Prabhu A.Ghosh Y.Sharma P.Torr A.Bibi S.Albanie andM.Bethge \u201cNo \u2018Zero\u2010Shot\u2019 Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance \u201d inProceedings of the Thirty\u2010Eighth Annual Conference on Neural Information Processing Systems (NeurIPS)(Curran Associates Inc. 2024)."},{"key":"e_1_2_10_18_1","doi-asserted-by":"crossref","unstructured":"O.Saha G.Van Horn andS.Maji \u201cImproved Zero\u2010Shot Classification by Adapting VLMs With Text Descriptions \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2024) 17542\u201317552.","DOI":"10.1109\/CVPR52733.2024.01661"},{"key":"e_1_2_10_19_1","unstructured":"T.Brown B.Mann N.Ryder M.Subbiah J.Kaplan P.Dhariwal A.Neelakantan P.Shyam G.Sastry A.Askell et\u00a0al. \u201cLanguage Models Are Few\u2010Shot Learners \u201darXiv preprint arXiv:2005.14165 2020."},{"key":"e_1_2_10_20_1","unstructured":"J.Achiam S.Adler S.Agarwal L.Ahmad I.Akkaya F. L.Aleman D.Almeida J.Altenschmidt S.Altman andS.Anadkat \u201cGPT\u20104 Technical Report \u201darXiv preprintarXiv:2303.08774 2023."},{"key":"e_1_2_10_21_1","unstructured":"X.Chen Z.Wu X.Liu Z.Pan W.Liu Z.Xie X.Yu andC.Ruan \u201cJANUS\u2010Pro: Unified Multimodal Understanding and Generation With Data and Model Scaling \u201darXiv preprint arXiv:2501.17811 2025."},{"key":"e_1_2_10_22_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0197-0"},{"issue":"9","key":"e_1_2_10_23_1","first-page":"5149","article-title":"Meta\u2010Learning in Neural Networks: A Survey","volume":"44","author":"Hospedales T.","year":"2021","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_2_10_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2023.109381"},{"key":"e_1_2_10_25_1","volume-title":"Advances in Neural Information Processing Systems","author":"Goodfellow I.","year":"2014"},{"key":"e_1_2_10_26_1","doi-asserted-by":"publisher","DOI":"10.1561\/2200000056"},{"issue":"14","key":"e_1_2_10_27_1","first-page":"71","article-title":"DALL\u2010E: Creating Images from Text","volume":"8","author":"Reddy M. D. M.","year":"2021","journal-title":"UGC Care Group I Journal"},{"key":"e_1_2_10_28_1","unstructured":"Y.Bai X.Xu Y.Liu S.Khan F.Khan W.Zuo R.Siow M.Goh andC.\u2010M.Feng \u201cSentence\u2010Level Prompts Benefit Composed Image Retrieval \u201darXiv preprint arXiv:2305.19033 2023."},{"key":"e_1_2_10_29_1","doi-asserted-by":"publisher","DOI":"10.1049\/cvi2.12249"},{"key":"e_1_2_10_30_1","unstructured":"P.Wang A.Yang R.Men et\u00a0al. \u201cOFA: Unifying Architectures Tasks and Modalities through a Simple Sequence\u2010to\u2010Sequence Learning Framework \u201d inInternational Conference on Machine Learning (ICML)(PMLR 2022) 23318\u201323340."},{"key":"e_1_2_10_31_1","doi-asserted-by":"crossref","unstructured":"Z.Guo R.Zhang L.Qiu X.Ma X.Miao X.He andB.Cui \u201cCaLiP: Zero\u2010Shot Enhancement of CLIP With Parameter\u2010Free Attention \u201d inProceedings of the AAAI Conference on Artificial Intelligence Vol.37(IEEE Information Theory Society 2023) 746\u2013754.","DOI":"10.1609\/aaai.v37i1.25152"},{"key":"e_1_2_10_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2024.3387743"},{"key":"e_1_2_10_33_1","doi-asserted-by":"crossref","unstructured":"R.Rombach A.Blattmann D.Lorenz P.Esser andB.Ommer \u201cHigh\u2010Resolution Image Synthesis With Latent Diffusion Models \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2022) 10684\u201310695.","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"e_1_2_10_34_1","first-page":"36479","article-title":"Photorealistic Text\u2010to\u2010Image Diffusion Models With Deep Language Understanding","volume":"35","author":"Saharia C.","year":"2022","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_10_35_1","unstructured":"Z.\u2010Y.Hu Y.Li M. R.Lyu andL.Wang \u201cVL\u2010PET: Vision\u2010and\u2010Language Parameter\u2010Efficient Tuning via Granularity Control \u201d inProceedings of the IEEE\/CVF International Conference on Computer Vision(IEEE 2023) 3010\u20133020."},{"key":"e_1_2_10_36_1","doi-asserted-by":"crossref","unstructured":"M.Li J.Zhong C.Li L.Li N.Lin andM.Sugiyama \u201cVision\u2010Language Model Fine\u2010Tuning via Simple Parameter\u2010Efficient Modification \u201darXiv preprint arXiv:2409.16718(2024).","DOI":"10.18653\/v1\/2024.emnlp-main.797"},{"key":"e_1_2_10_37_1","unstructured":"Y.Zhai H.Wang J.Chang et\u00a0al. \u201cVision\u2010Language Instruction\u2010Enhanced Tuning via Parameter\u2010Efficient Learning \u201dunpublished manuscript."},{"key":"e_1_2_10_38_1","doi-asserted-by":"crossref","unstructured":"M. U.Khattak H.Rasheed M.Maaz S.Khan andF. S.Khan \u201cMAPLE: Multi\u2010Modal Prompt Learning \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2022) 19113\u201319122.","DOI":"10.1109\/CVPR52729.2023.01832"},{"key":"e_1_2_10_39_1","doi-asserted-by":"crossref","unstructured":"H.Chen Y.Li Z.Huang et\u00a0al. \u201cConditional Prototype Rectification Prompt Learning \u201dIEEE Transactions on Circuits and Systems for Video Technology forthcoming 2025.","DOI":"10.1109\/TCSVT.2025.3585626"},{"key":"e_1_2_10_40_1","doi-asserted-by":"crossref","unstructured":"J.Yang Z.Li S.Xie W.Zhu W.Yu andS.Li \u201cCross\u2010Modal Adapter: Parameter\u2010Efficient Transfer Learning Approach for Vision\u2010Language Models \u201d in2024 IEEE International Conference on Multimedia and Expo (ICME)(IEEE 2024) 1\u20136.","DOI":"10.1109\/ICME57554.2024.10688369"},{"key":"e_1_2_10_41_1","doi-asserted-by":"crossref","unstructured":"X.Zhu B.Zhu Y.Tan S.Wang Y.Hao andH.Zhang \u201cSelective Vision\u2010Language Subspace Projection for Few\u2010Shot CLIP \u201d inProceedings of the 32nd ACM International Conference on Multimedia(ACM 2024) 3848\u20133857.","DOI":"10.1145\/3664647.3680885"},{"key":"e_1_2_10_42_1","unstructured":"M.Zhao Q.Zhang andC.Li \u201cCo\u2010Modulation of CLIP for Few\u2010Shot Classification \u201dSSRN 5260236."},{"key":"e_1_2_10_43_1","doi-asserted-by":"crossref","unstructured":"S.Pratt I.Covert R.Liu andA.Farhadi \u201cWhat Does a Platypus Look Like? Generating Customized Prompts for Zero\u2010Shot Image Classification \u201d inProceedings of the IEEE\/CVF International Conference on Computer Vision(IEEE 2023) 15691\u201315701.","DOI":"10.1109\/ICCV51070.2023.01438"},{"key":"e_1_2_10_44_1","doi-asserted-by":"crossref","unstructured":"J.Deng W.Dong R.Socher L.\u2010J.Li K.Li andL.Fei\u2010Fei \u201cImageNet: A Large\u2010Scale Hierarchical Image Database \u201d in2009 IEEE Conference on Computer Vision and Pattern Recognition(IEEE 2009) 248\u2013255.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_10_45_1","unstructured":"S.Maji E.Rahtu J.Kannala M.Blaschko andA.Vedaldi \u201cFine\u2010Grained Visual Classification of Aircraft \u201darXiv:1306.5151(2013)."},{"key":"e_1_2_10_46_1","doi-asserted-by":"crossref","unstructured":"O. M.Parkhi A.Vedaldi A.Zisserman andC. V.Jawahar \u201cCats and Dogs \u201d in2012 IEEE Conference on Computer Vision and Pattern Recognition(IEEE 2012) 3498\u20133505.","DOI":"10.1109\/CVPR.2012.6248092"},{"key":"e_1_2_10_47_1","doi-asserted-by":"crossref","unstructured":"J.Krause M.Stark J.Deng andL.Fei\u2010Fei \u201c3D Object Representations for Fine\u2010Grained Categorization \u201d in2013 IEEE International Conference on Computer Vision Workshops(IEEE 2013) 554\u2013561.","DOI":"10.1109\/ICCVW.2013.77"},{"key":"e_1_2_10_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSTARS.2019.2918242"},{"key":"e_1_2_10_49_1","unstructured":"F.\u2010F.Li R.Fergus andP.Perona \u201cLearning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories \u201d in2004 Conference on Computer Vision and Pattern Recognition Workshop(IEEE 2004) 178\u2013178."},{"key":"e_1_2_10_50_1","doi-asserted-by":"crossref","unstructured":"J.Xiao J.Hays K. A.Ehinger A.Oliva andA.Torralba \u201cSUN Database: Large\u2010Scale Scene Recognition from Abbey to Zoo \u201d in2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(IEEE 2010) 3485\u20133492.","DOI":"10.1109\/CVPR.2010.5539970"},{"key":"e_1_2_10_51_1","doi-asserted-by":"crossref","unstructured":"M.Cimpoi S.Maji I.Kokkinos S.Mohamed andA.Vedaldi \u201cDescribing Textures in the Wild \u201d in2014 IEEE Conference on Computer Vision and Pattern Recognition(IEEE 2014) 3606\u20133613.","DOI":"10.1109\/CVPR.2014.461"},{"key":"e_1_2_10_52_1","doi-asserted-by":"crossref","unstructured":"L.Bossard M.Guillaumin andL.Van Gool \u201cFood\u2010101 \u2010 Mining Discriminative Components With Random Forests \u201d inEuropean Conference on Computer Vision (ECCV)(Springer 2014) 446\u2013461.","DOI":"10.1007\/978-3-319-10599-4_29"},{"key":"e_1_2_10_53_1","doi-asserted-by":"crossref","unstructured":"M. E.NilsbackandA.Zisserman \u201cAutomated Flower Classification over a Large Number of Classes \u201d in2008 Sixth Indian Conference on Computer Vision Graphics & Image Processing(IEEE 2008) 722\u2013729.","DOI":"10.1109\/ICVGIP.2008.47"},{"key":"e_1_2_10_54_1","unstructured":"K.Soomro A.Zamir andM.Shah \u201cUCF101: A Dataset of 101 Human Action Classes from Videos in the Wild \u201darXiv preprint arXiv:1212.0402(2012)."},{"key":"e_1_2_10_55_1","unstructured":"B.Recht R.Roelofs L.Schmidt andV.Shankar \u201cDo ImageNet Classifiers Generalize to ImageNet? \u201darXiv preprint arXiv:1902.10811(2019)."},{"key":"e_1_2_10_56_1","unstructured":"H.Wang S.Ge E.Xing andZ.Lipton \u201cLearning Robust Global Representations by Penalizing Local Predictive Power \u201darXiv preprint arXiv:1905.13549(2019)."},{"key":"e_1_2_10_57_1","unstructured":"D.Hendrycks K.Zhao S.Basart J.Steinhardt andD.Song \u201cNatural Adversarial Examples \u201darXiv preprint arXiv:1907.07174(2019)."},{"key":"e_1_2_10_58_1","doi-asserted-by":"crossref","unstructured":"D.Hendrycks S.Basart N.Mu S.Kadavath F.Wang E.Dorundo R.Desai T.Zhu S.Parajuli M.Guo D.Song J.Steinhardt andJ.Gilmer \u201cThe Many Faces of Robustness: A Critical Analysis of Out\u2010of\u2010Distribution Generalization \u201darXiv preprint arXiv:2006.16241(2020).","DOI":"10.1109\/ICCV48922.2021.00823"},{"key":"e_1_2_10_59_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-022-01653-1"},{"key":"e_1_2_10_60_1","doi-asserted-by":"crossref","unstructured":"K.Zhou J.Yang C. C.Loy andZ.Liu \u201cConditional Prompt Learning for Vision\u2010Language Models \u201d inProceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition(IEEE 2022) 16816\u201316825.","DOI":"10.1109\/CVPR52688.2022.01631"},{"key":"e_1_2_10_61_1","doi-asserted-by":"crossref","unstructured":"Y.Wang X.Jiang D.Cheng D.Li andC.Zhao \u201cLearning Hierarchical Prompt With Structured Linguistic Knowledge for Vision\u2010Language Models \u201d inProceedings of the AAAI Conference on Artificial IntelligenceVol.38(IEEE Information Theory Society 2024) 5749\u20135757.","DOI":"10.1609\/aaai.v38i6.28387"}],"container-title":["IET Image Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/ipr2.70189","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/full-xml\/10.1049\/ipr2.70189","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/ipr2.70189","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T11:23:02Z","timestamp":1775215382000},"score":1,"resource":{"primary":{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/10.1049\/ipr2.70189"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1]]},"references-count":60,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["10.1049\/ipr2.70189"],"URL":"https:\/\/doi.org\/10.1049\/ipr2.70189","archive":["Portico"],"relation":{},"ISSN":["1751-9659","1751-9667"],"issn-type":[{"value":"1751-9659","type":"print"},{"value":"1751-9667","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1]]},"assertion":[{"value":"2025-05-09","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-14","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-24","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"e70189"}}