{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T18:14:17Z","timestamp":1779387257631,"version":"3.53.1"},"reference-count":57,"publisher":"Institution of Engineering and Technology (IET)","issue":"2","license":[{"start":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T00:00:00Z","timestamp":1772496000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T00:00:00Z","timestamp":1772496000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/doi.wiley.com\/10.1002\/tdm_license_1.1"}],"content-domain":{"domain":["ietresearch.onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["CAAI Trans on Intel Tech"],"published-print":{"date-parts":[[2026,4]]},"abstract":"<jats:title>ABSTRACT<\/jats:title>\n                  <jats:p>Analysing learners' facial expressions during learning and exploring their learning processes and emotional changes are of great significance for assisting teachers' teaching and promoting smart education. In complex learning environments, static facial expression recognition fails to capture the dynamic changes of learners' expressions losing the continuous features in the learning process, and its recognition effect is easily interfered with by factors such as occlusion and lighting variations during learning. To address the above issues, a network model based on adaptive global attention and temporal difference is proposed to recognise learners' dynamic expression sequences. Firstly, we have designed an Adaptive Global Attention (AGA) block, which adaptively models inter\u2010channel relationships to dynamically enhance key channels that are highly correlated with learners' states while suppressing redundant information, thereby improving the model's feature representation capability under noisy environments. Secondly, we have designed a Differential Temporal Transformer (DTFormer) to extract differential information between consecutive frames, increasing the model's sensitivity to learners' facial expression dynamics and improving recognition performance. The two components complement each other in terms of spatial feature enhancement and temporal dynamic modelling effectively improving the model's overall capability for representing learners' dynamic facial expressions. Experiments were conducted on public datasets DFEW, FERV39k and the learner E\u2010learning emotional state data set DAiSEE, and comparisons were made with classical methods using objective indicators. The results demonstrate that the proposed method outperforms the comparison methods in multiple performance indicators, thereby verifying its effectiveness.<\/jats:p>","DOI":"10.1049\/cit2.70115","type":"journal-article","created":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T15:25:22Z","timestamp":1772983522000},"page":"514-528","update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Dynamic Facial Expression Recognition of Learners via Adaptive Global Attention and Differential Temporal Transformer"],"prefix":"10.1049","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6468-3232","authenticated-orcid":false,"given":"Wei","family":"Liu","sequence":"first","affiliation":[{"name":"College of Computer Science and Engineering Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lujia","family":"Li","sequence":"additional","affiliation":[{"name":"College of Computer Science and Engineering Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chun","family":"Yan","sequence":"additional","affiliation":[{"name":"College of Mathematics and Systems Science Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yulin","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Mathematics and Systems Science Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaochun","family":"Cheng","sequence":"additional","affiliation":[{"name":"Computer Science Department Swansea University  Swansea UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xinyan","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Computer Science and Engineering Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mingshi","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Mathematics and Systems Science Shandong University of Science and Technology  Qingdao China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"265","published-online":{"date-parts":[[2026,3,3]]},"reference":[{"key":"e_1_2_10_2_1","doi-asserted-by":"publisher","DOI":"10.3390\/bs13070555"},{"key":"e_1_2_10_3_1","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/6453499"},{"key":"e_1_2_10_4_1","doi-asserted-by":"publisher","DOI":"10.16452\/j.cnki.sdkjzk.2023.05.010"},{"key":"e_1_2_10_5_1","doi-asserted-by":"publisher","DOI":"10.16452\/j.cnki.sdkjzk.2025.05.012"},{"key":"e_1_2_10_6_1","doi-asserted-by":"publisher","DOI":"10.61369\/etr.6415"},{"key":"e_1_2_10_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2023.3322454"},{"key":"e_1_2_10_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2022.3188390"},{"key":"e_1_2_10_9_1","unstructured":"A.Gupta A.D\u2019Cunha K.Awasthi andV.Balasubramanian \u201cDaisee: Towards User Engagement Recognition in the Wild \u201d preprint arXiv:1609.01885 (2016) https:\/\/doi.org\/10.48550\/arXiv.1609.01885."},{"key":"e_1_2_10_10_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742\u20106596\/1168\/2\/022043"},{"key":"e_1_2_10_11_1","doi-asserted-by":"publisher","DOI":"10.11772\/j.issn.1001\u20109081.2021040846"},{"key":"e_1_2_10_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383923.3383949"},{"key":"e_1_2_10_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/Confluence47617.2020.9057967"},{"key":"e_1_2_10_14_1","doi-asserted-by":"publisher","DOI":"10.3390\/s24206748"},{"key":"e_1_2_10_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2025.129656"},{"key":"e_1_2_10_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10489\u2010023\u201004858\u20100"},{"key":"e_1_2_10_17_1","unstructured":"D.Wang C.Yang andG.Chen \u201cUsing Vision Language Models to Detect Students\u2019 Academic Emotion Through Facial Expressions \u201d preprint arXiv:2506.10334 (2025) https:\/\/doi.org\/10.48550\/arXiv.2506.10334."},{"key":"e_1_2_10_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2023.3294099"},{"key":"e_1_2_10_19_1","doi-asserted-by":"publisher","DOI":"10.3778\/j.issn.1002\u20108331.2203\u20100170"},{"key":"e_1_2_10_20_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0312359"},{"key":"e_1_2_10_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.121419"},{"key":"e_1_2_10_22_1","first-page":"1144","volume-title":"2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE)","author":"Wang L.","year":"2019"},{"key":"e_1_2_10_23_1","doi-asserted-by":"publisher","DOI":"10.11772\/j.issn.1001\u20109081.2022101472"},{"key":"e_1_2_10_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00371\u2010019\u201001635\u20104"},{"key":"e_1_2_10_25_1","doi-asserted-by":"publisher","DOI":"10.16452\/j.cnki.sdkjzk.2025.01.010"},{"key":"e_1_2_10_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comcom.2023.12.032"},{"key":"e_1_2_10_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475292"},{"key":"e_1_2_10_28_1","unstructured":"F.Ma B.Sun andS.Li \u201cSpatio\u2010Temporal Transformer for Dynamic Facial Expression Recognition in the Wild \u201d preprint arXiv:2205.04749 (2022) https:\/\/doi.org\/10.48550\/arXiv.2205.04749."},{"key":"e_1_2_10_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2024.111680"},{"key":"e_1_2_10_30_1","unstructured":"H.Li M.Sui andZ.Zhu \u201cNr\u2010dfernet: Noise\u2010Robust Network for Dynamic Facial Expression Recognition \u201d preprint arXiv:2206.04975 (2022) https:\/\/doi.org\/10.48550\/arXiv.2206.04975."},{"key":"e_1_2_10_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.dsp.2025.105470"},{"key":"e_1_2_10_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2025.130020"},{"key":"e_1_2_10_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2025.108384"},{"key":"e_1_2_10_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.3027849"},{"key":"e_1_2_10_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00536"},{"key":"e_1_2_10_36_1","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/3518879"},{"key":"e_1_2_10_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s41870\u2010023\u201001183\u20100"},{"key":"e_1_2_10_38_1","first-page":"3015","volume-title":"ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Wang L.","year":"2024"},{"key":"e_1_2_10_39_1","first-page":"1","volume-title":"ICASSP 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Ma F.","year":"2023"},{"key":"e_1_2_10_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2024.120953"},{"key":"e_1_2_10_41_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"e_1_2_10_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/tcsvt.2023.3312321"},{"key":"e_1_2_10_43_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41598\u2010024\u201056623\u2010z"},{"key":"e_1_2_10_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413620"},{"key":"e_1_2_10_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.02025"},{"key":"e_1_2_10_46_1","doi-asserted-by":"publisher","DOI":"10.3390\/su15010198"},{"key":"e_1_2_10_47_1","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2023.0140371"},{"key":"e_1_2_10_48_1","volume-title":"Student Online Learning Dynamic Facial Expression Recognition Based on an Improved 3D Residual Neural Network(In Chinese)","author":"Zhu D.","year":"2025"},{"key":"e_1_2_10_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.502"},{"key":"e_1_2_10_50_1","unstructured":"K.SimonyanandA.Zisserman \u201cVery Deep Convolutional Networks for Large\u2010Scale Image Recognition \u201d preprint arXiv:1409.1556 (2014)."},{"key":"e_1_2_10_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_10_52_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compeleceng.2024.109125"},{"key":"e_1_2_10_53_1","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2511.10958"},{"key":"e_1_2_10_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/taffc.2025.3530973"},{"key":"e_1_2_10_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICEIEC.2019.8784507"},{"key":"e_1_2_10_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CRV52889.2021.00028"},{"key":"e_1_2_10_57_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2024.104915"},{"key":"e_1_2_10_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3612365"}],"container-title":["CAAI Transactions on Intelligence Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.70115","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/full-xml\/10.1049\/cit2.70115","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.70115","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T17:56:34Z","timestamp":1779386194000},"score":1,"resource":{"primary":{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/10.1049\/cit2.70115"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,3]]},"references-count":57,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2026,4]]}},"alternative-id":["10.1049\/cit2.70115"],"URL":"https:\/\/doi.org\/10.1049\/cit2.70115","archive":["Portico"],"relation":{},"ISSN":["2468-6557","2468-2322"],"issn-type":[{"value":"2468-6557","type":"print"},{"value":"2468-2322","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,3]]},"assertion":[{"value":"2025-10-12","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-02-08","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-03-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}