{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T11:31:16Z","timestamp":1781004676254,"version":"3.54.1"},"reference-count":63,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T00:00:00Z","timestamp":1755734400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>Sign language is a visual language articulated through body movements. Existing approaches predominantly leverage RGB inputs, incurring substantial computational overhead and remaining susceptible to interference from foreground and background noise. A second fundamental challenge lies in accurately modeling the nonlinear temporal dynamics and inherent asynchrony across body parts that characterize sign language sequences. To address these challenges, we propose a novel part-wise graph Fourier learning method for skeleton-based continuous sign language recognition (PGF-SLR), which uniformly models the spatiotemporal relations of multiple body parts in a globally ordered yet locally unordered manner. Specifically, different parts within different time steps are treated as nodes, while the frequency domain attention between parts is treated as edges to construct a part-level Fourier fully connected graph. This enables the graph Fourier learning module to jointly capture spatiotemporal dependencies in the frequency domain, while our adaptive frequency enhancement method further amplifies discriminative action features in a lightweight and robust fashion. Finally, a dual-branch action learning module featuring an auxiliary action prediction branch to assist the recognition branch is designed to enhance the understanding of sign language. Our experimental results show that the proposed PGF-SLR achieved relative improvements of 3.31%\/3.70% and 2.81%\/7.33% compared to SOTA methods on the dev\/test sets of the PHOENIX14 and PHOENIX14-T datasets. It also demonstrated highly competitive recognition performance on the CSL-Daily dataset, showcasing strong generalization while reducing computational costs in both offline and online settings.<\/jats:p>","DOI":"10.3390\/jimaging11080286","type":"journal-article","created":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T15:19:02Z","timestamp":1755789542000},"page":"286","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Part-Wise Graph Fourier Learning for Skeleton-Based Continuous Sign Language Recognition"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-8601-8638","authenticated-orcid":false,"given":"Dong","family":"Wei","sequence":"first","affiliation":[{"name":"College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-0933-9855","authenticated-orcid":false,"given":"Hongxiang","family":"Hu","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-8398-2923","authenticated-orcid":false,"given":"Gang-Feng","family":"Ma","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,8,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Camgoz, N.C., Hadfield, S., Koller, O., Ney, H., and Bowden, R. (2018, January 18\u201322). Neural sign language translation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00812"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Min, Y., Hao, A., Chai, X., and Chen, X. (2021, January 11\u201317). Visual alignment constraint for continuous sign language recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01134"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Laines, D., Gonzalez-Mendoza, M., Ochoa-Ruiz, G., and Bejarano, G. (2023, January 17\u201324). Isolated sign language recognition based on tree structure skeleton images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPRW59228.2023.00033"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Rao, Q., Sun, K., Wang, X., Wang, Q., and Zhang, B. (2024, January 20\u201327). Cross-sentence gloss consistency for continuous sign language recognition. Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada.","DOI":"10.1609\/aaai.v38i5.28265"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"11221","DOI":"10.1109\/TPAMI.2023.3269220","article-title":"Signbert+: Hand-model-aware self-supervised pre-training for sign language understanding","volume":"45","author":"Hu","year":"2023","journal-title":"IEEE Trans. Pattern Analysis Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Piergiovanni, A., and Ryoo, M.S. (2019, January 16\u201317). Representation flow for action recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01018"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Jiao, P., Min, Y., Li, Y., Wang, X., Lei, L., and Chen, X. (2023, January 1\u20136). Cosign: Exploring co-occurrence signals in skeleton-based continuous sign language recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.01890"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Koller, O., Zargaran, S., and Ney, H. (2017, January 21\u201326). Re-sign: Re-aligned end-to-end sequence modelling with deep recurrent cnn-hmms. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.364"},{"key":"ref_9","unstructured":"Cheng, K.L., Yang, Z., Chen, Q., and Tai, Y.-W. (2020, January 23\u201328). Fully convolutional networks for continuous sign language recognition. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part XXIV 16."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1109\/THMS.2022.3144000","article-title":"Sign language recognition based on r (2 + 1) d with spatial\u2013temporal\u2013channel attention","volume":"52","author":"Han","year":"2022","journal-title":"IEEE Trans. Hum.-Mach. Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Graves, A., Fern\u00e1ndez, S., Gomez, F., and Schmidhuber, J. (2006, January 25\u201329). Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks. Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA, USA.","DOI":"10.1145\/1143844.1143891"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhang, H., Guo, Z., Yang, Y., Liu, X., and Hu, D. (2023, January 1\u20136). C2st: Cross-modal contextualized sequence transduction for continuous sign language recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.01925"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Yan, S., Xiong, Y., and Lin, D. (2018, January 2\u20137). Spatial temporal graph convolutional networks for skeleton-based action recognition. Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.12328"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"111521","DOI":"10.1016\/j.patcog.2025.111521","article-title":"A generically contrastive spatiotemporal representation enhancement for 3d skeleton action recognition","volume":"164","author":"Zhang","year":"2025","journal-title":"Pattern Recognit."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Yan, X., Cheng, Z.-Q., Yan, Y., Dai, Q., and Hua, X.-S. (2024, January 16\u201322). Blockgcn: Redefine topology awareness for skeleton-based action recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.00200"},{"key":"ref_16","first-page":"5024713","article-title":"Adaptive part-level embedding GCN: Towards robust skeleton-based one-shot action recognition","volume":"74","author":"Liu","year":"2025","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Tunga, A., Nuthalapati, S.V., and Wachs, J. (2021, January 5\u20139). Pose-based sign language recognition using gcn and bert. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Online.","DOI":"10.1109\/WACVW52041.2021.00008"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1038\/s43586-024-00294-7","article-title":"Graph neural networks","volume":"4","author":"Corso","year":"2024","journal-title":"Nat. Rev. Methods Prim."},{"key":"ref_19","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, NIPS."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"111602","DOI":"10.1016\/j.patcog.2025.111602","article-title":"Mska: Multi-stream keypoint attention network for sign language recognition and translation","volume":"165","author":"Guan","year":"2025","journal-title":"Pattern Recognit."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1109\/TCSVT.2024.3465845","article-title":"Asynchronous joint-based temporal pooling for skeleton-based action recognition","volume":"35","author":"Gunasekara","year":"2024","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Gan, S., Yin, Y., Jiang, Z., Wen, H., Xie, L., and Lu, S. (2024, January 16\u201322). Signgraph: A sign sequence is worth graphs of nodes. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.01279"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hu, L., Gao, L., Liu, Z., and Feng, W. (2023, January 17\u201324). Continuous sign language recognition with correlation network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00249"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zuo, R., and Mak, B. (2022, January 19\u201320). C2slr: Consistency-enhanced continuous sign language recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00507"},{"key":"ref_25","unstructured":"MMPose Contributors (2025, August 16). OpenMMLab Pose Estimation Toolbox and Benchmark. Available online: https:\/\/github.com\/open-mmlab\/mmpose."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1109\/MMUL.2012.24","article-title":"Microsoft kinect sensor and its effect","volume":"19","author":"Zhang","year":"2012","journal-title":"IEEE Multimed."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, S., Zheng, L., Zhu, C., and Gao, Y. (2020, January 10\u201321). Bidirectional independently recurrent neural network for skeleton-based hand gesture recognition. Proceedings of the 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Sevilla, Spain.","DOI":"10.1109\/ISCAS45731.2020.9181028"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Chen, Y., Zhang, Z., Yuan, C., Li, B., Deng, Y., and Hu, W. (2021, January 11\u201317). Channel-wise topology refinement graph convolution for skeleton-based action recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01311"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Do, J., and Kim, M. (2024). Skateformer: Skeletal-temporal transformer for human action recognition. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-031-72940-9_23"},{"key":"ref_30","unstructured":"Shi, L., Zhang, Y., Cheng, J., and Lu, H. (December, January 30). Decoupled spatial-temporal attention network for skeleton-based action-gesture recognition. Proceedings of the Asian Conference on Computer Vision, Kyoto, Japan."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Wang, Q., Shi, S., He, J., Peng, J., Liu, T., and Weng, R. (2023, January 15\u201318). Iip-transformer: Intra-inter-part transformer for skeleton-based action recognition. Proceedings of the 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy.","DOI":"10.1109\/BigData59044.2023.10386970"},{"key":"ref_32","unstructured":"Du, Y., Wang, W., and Wang, L. (2015, January 7\u201312). Hierarchical recurrent neural network for skeleton based action recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chen, T., Zhou, D., Wang, J., Wang, S., He, Q., Hu, C., Ding, E., Guan, Y., and He, X. (2023, January 5\u20138). Part-aware prototypical graph network for one-shot skeleton-based action recognition. Proceedings of the 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG), Waikoloa Beach, HI, USA.","DOI":"10.1109\/FG57933.2023.10042671"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1474","DOI":"10.1109\/TPAMI.2022.3157033","article-title":"Constructing stronger and faster baselines for skeleton-based action recognition","volume":"45","author":"Song","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"110188","DOI":"10.1016\/j.patcog.2023.110188","article-title":"Multi-grained clip focus for skeleton-based action recognition","volume":"148","author":"Qiu","year":"2024","journal-title":"Pattern Recognit."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"126646","DOI":"10.1016\/j.eswa.2025.126646","article-title":"Multi-modal and multi-part with skeletons and texts for action recognition","volume":"272","author":"Zhou","year":"2025","journal-title":"Expert Syst. Appl."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"109231","DOI":"10.1016\/j.patcog.2022.109231","article-title":"Spatiotemporal focus for skeleton-based action recognition","volume":"136","author":"Wu","year":"2023","journal-title":"Pattern Recognit."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Xiang, W., Li, C., Zhou, Y., Wang, B., and Zhang, L. (2023, January 1\u20136). Generative action description prompts for skeleton-based action recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00943"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"104213","DOI":"10.1016\/j.cviu.2024.104213","article-title":"A gcn and transformer complementary network for skeleton-based action recognition","volume":"249","author":"Xiang","year":"2024","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Hang, R., and Li, M. (2022, January 4\u20138). Spatial-temporal adaptive graph convolutional network for skeleton-based action recognition. Proceedings of the Asian Conference on Computer Vision, Macao, China.","DOI":"10.1007\/978-3-031-26316-3_11"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Xie, J., Meng, Y., Zhao, Y., Nguyen, A., Yang, X., and Zheng, Y. (2024, January 26\u201327). Dynamic semantic-based spatial graph convolution network for skeleton-based human action recognition. Proceedings of the AAAI Conference on Artificial intelligence, Vancouver, BC, Canada.","DOI":"10.1609\/aaai.v38i6.28440"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Huang, Q., Geng, Q., Chen, Z., Li, X., Li, Y., and Li, X. (2025, January 6\u201311). Joint-wise distributed perception graph convolutional network for skeleton-based action recognition. Proceedings of the ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India.","DOI":"10.1109\/ICASSP49660.2025.10888560"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1406","DOI":"10.1109\/JAS.2022.105743","article-title":"Complex-valued neural networks: A comprehensive survey","volume":"9","author":"Lee","year":"2022","journal-title":"IEEE\/CAA J. Autom. Sin."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/s41524-024-01208-7","article-title":"Ultrafast bragg coherent diffraction imaging of epitaxial thin films using deep complex-valued neural networks","volume":"10","author":"Yu","year":"2024","journal-title":"npj Comput. Mater."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1016\/j.acha.2015.02.005","article-title":"Vertex-frequency analysis on graphs","volume":"40","author":"Shuman","year":"2016","journal-title":"Appl. Comput. Harmon. Anal."},{"key":"ref_46","unstructured":"Ai, G., Pang, G., Qiao, H., Gao, Y., and Yan, H. (2025, January 13\u201319). Grokformer: Graph fourier kolmogorov-arnold transformers. Proceedings of the 42st International Conference on Machine Learning (ICML), Vancouver, BC, Canada."},{"key":"ref_47","first-page":"69638","article-title":"Fouriergnn: Rethinking multivariate time series forecasting from a pure graph perspective","volume":"36","author":"Yi","year":"2023","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Katznelson, Y. (2004). An Introduction to Harmonic Analysis, Cambridge University Press.","DOI":"10.1017\/CBO9781139165372"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"7354","DOI":"10.1109\/TCSVT.2025.3544212","article-title":"Cross-modal adaptive prototype learning for continuous sign language recognition. IEEE Trans","volume":"35","author":"Wei","year":"2025","journal-title":"Circuits Syst. Video Technol."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Guo, L., Xue, W., Guo, Q., Liu, B., Zhang, K., Yuan, T., and Chen, S. (2023, January 17\u201324). Distilling cross-temporal contexts for continuous sign language recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01037"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.cviu.2015.09.013","article-title":"Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers","volume":"141","author":"Koller","year":"2015","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Zhou, H., Zhou, W., Qi, W., Pu, J., and Li, H. (2021, January 19\u201325). Improving sign language translation with monolingual data by sign back-translation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00137"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Hao, A., Min, Y., and Chen, X. (2021, January 11\u201317). Self-mutual distillation learning for continuous sign language recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01111"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Hu, L., Gao, L., Liu, Z., and Feng, W. (2022). Temporal lift pooling for continuous sign language recognition. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-031-19833-5_30"},{"key":"ref_56","unstructured":"Hu, L., Gao, L., Liu, Z., Pun, C.-M., and Feng, W. (November, January 9). Adabrowse: Adaptive video browser for efficient continuous sign language recognition. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Gan, S., Yin, Y., Jiang, Z., Xia, K., Xie, L., and Lu, S. (2023, January 19\u201325). Contrastive learning for sign language recognition and translation. Proceedings of the IJCAI, Macao, China.","DOI":"10.24963\/ijcai.2023\/85"},{"key":"ref_58","unstructured":"Hu, L., Gao, L., and Liu, Z. (2023, January 8\u201312). Self-emphasizing network for continuous sign language recognition. Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Kan, J., Hu, K., Hagenbuchner, M., Tsoi, A.C., Bennamoun, M., and Wang, Z. (2022, January 3\u20138). Sign language translation with hierarchical spatio-temporal graph neural network. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV51458.2022.00219"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Lu, H., Salah, A.A., and Poppe, R. (2024, January 26\u201327). Tcnet: Continuous sign language recognition from trajectories and correlated regions. Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada.","DOI":"10.1609\/aaai.v38i4.28181"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1109\/TMM.2021.3059098","article-title":"Spatial-temporal multi-cue network for sign language recognition and translation","volume":"24","author":"Zhou","year":"2021","journal-title":"IEEE Trans. Multimed."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1016\/S0167-6393(01)00041-3","article-title":"Testing the correlation of word error rate and perplexity","volume":"38","author":"Klakow","year":"2002","journal-title":"Speech Commun."},{"key":"ref_63","first-page":"17043","article-title":"Two-stream network for sign language recognition and translation","volume":"35","author":"Chen","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/11\/8\/286\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:33:04Z","timestamp":1760034784000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/11\/8\/286"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,21]]},"references-count":63,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2025,8]]}},"alternative-id":["jimaging11080286"],"URL":"https:\/\/doi.org\/10.3390\/jimaging11080286","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,21]]}}}