{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T10:32:59Z","timestamp":1769164379286,"version":"3.49.0"},"reference-count":19,"publisher":"Emerald","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,10,16]]},"abstract":"<jats:sec>\n                  <jats:title>Purpose<\/jats:title>\n                  <jats:p>To solve the 3C assembly task, the traditional robot needs a lot of manual coding and the skills learned are faced with the problems of adapting to the scene and task diversity, lack of generalization ability and so on. This paper aims to propose a skill knowledge and multimodal information fusion algorithm (SKMIF).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Design\/methodology\/approach<\/jats:title>\n                  <jats:p>This method combines skill knowledge and multimodal information in large language model to enhance 3C assembly task skills. The SKMIF algorithm is used to conduct experiments in simulated and real 3C assembly tasks, which verifies the effectiveness of the method in single-task and multi-task scenarios, and solves the problem of insufficient generalization ability of automated programming.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Findings<\/jats:title>\n                  <jats:p>Through the transfer from simulation to reality, the assembly strategy learned in the virtual environment is applied to the real scene, which significantly improves the assembly accuracy and success rate in the real environment. The verification of the enhanced 3C soft-row line assembly task shows that the success rate is 96%.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Originality\/value<\/jats:title>\n                  <jats:p>This paper proposes a new algorithm to enhance 3C assembly skills, to improve generalization ability and adaptability to multitasking environments.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1108\/ir-10-2024-0495","type":"journal-article","created":{"date-parts":[[2025,2,17]],"date-time":"2025-02-17T05:40:56Z","timestamp":1739770856000},"page":"722-728","source":"Crossref","is-referenced-by-count":1,"title":["Integrating skill knowledge and multimodal information to enhance 3C assembly skills"],"prefix":"10.1108","volume":"52","author":[{"given":"Liqun","family":"Liu","sequence":"first","affiliation":[{"name":"Northeast Petroleum University School of Physics and Electronic Engineering, , Daqing,","place":["China"]}]},{"given":"Yupeng","family":"Xie","sequence":"additional","affiliation":[{"name":"Northeast Petroleum University School of Physics and Electronic Engineering, , Daqing,","place":["China"]}]},{"given":"Ting","family":"Liu","sequence":"additional","affiliation":[{"name":"Northeast Petroleum University School of Physics and Electronic Engineering, , Daqing,","place":["China"]}]}],"member":"140","published-online":{"date-parts":[[2025,2,19]]},"reference":[{"key":"2025101507171599300_ref001","doi-asserted-by":"publisher","DOI":"10.1109\/roman.2009.5326198","article-title":"A probabilistic approach for attention-based multi-modal human-robot interaction","author":"Begum","year":"2009","journal-title":"Robot and Human Interactive Communication"},{"key":"2025101507171599300_ref002","doi-asserted-by":"publisher","DOI":"10.1109\/ICMCCE.2018.00085","article-title":"Analysis on the trajectory planning and simulation of six degrees of freedom manipulator","author":"Cheng","year":"2018"},{"issue":"11","key":"2025101507171599300_ref003","first-page":"13467","article-title":"Towards large-scale small object detection: survey and benchmarks","volume":"44","author":"Cheng","year":"2023","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2025101507171599300_ref004","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1016\/j.neunet.2018.06.016","article-title":"Social babbling: the emergence of symbolic gestures and words","volume":"106","author":"Cohen","year":"2018","journal-title":"Neural Networks"},{"key":"2025101507171599300_ref005","first-page":"551","article-title":"Design of a visual guidance robotic assembly system for flexible satellite equipment unit assembly","author":"Feng","year":"2023"},{"key":"2025101507171599300_ref006","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-57496-2_9","article-title":"Enabling seamless human-robot collaboration in manufacturing using LLMs","author":"Gkournelos","year":"2024"},{"key":"2025101507171599300_ref007","first-page":"175","article-title":"Instruction-driven history-aware policies for robotic manipulations","author":"Guhur","year":"2023"},{"issue":"4","key":"2025101507171599300_ref008","doi-asserted-by":"crossref","first-page":"475","DOI":"10.4271\/10-07-04-0031","article-title":"Robust multiagent reinforcement learning toward coordinated decision-making of automated vehicles","volume":"7","author":"He","year":"2023","journal-title":"SAE International Journal of Vehicle Dynamics, Stability, and NVH"},{"key":"2025101507171599300_ref009","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1705.05548","article-title":"Intel RealSense stereoscopic depth cameras","author":"Keselman","year":"2017"},{"key":"2025101507171599300_ref010","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2004.1389727","article-title":"Design and use paradigms for Gazebo, an open-source multi-robot simulator","volume-title":"IEEE","author":"Koenig","year":"2004"},{"issue":"8","key":"2025101507171599300_ref011","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2024.3415931","article-title":"Enhancing the LLM-based robot manipulation through human-robot collaboration","volume":"9","author":"Liu","year":"2024","journal-title":"IEEE Robotics and Automation Letters"},{"issue":"1","key":"2025101507171599300_ref012","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1108\/IR-07-2023-0156","article-title":"Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly","volume":"51","author":"Mu","year":"2024","journal-title":"Industrial Robot: The International Journal of Robotics Research and Application"},{"key":"2025101507171599300_ref013","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-13841-6_4","volume-title":"Model Predictive 6D Image-Based Visual Servoing for 3C Products Assembly","author":"Qu","year":"2022"},{"issue":"2","key":"2025101507171599300_ref014","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1109\/LCSYS.2020.3002852","article-title":"Self-configuring robot path planning with obstacle avoidance via deep reinforcement learning","volume":"5","author":"Sangiovanni","year":"2021","journal-title":"IEEE Control Syst. Lett"},{"key":"2025101507171599300_ref015","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-00126-0_25","article-title":"Toward auto-learning hyperparameters for deep learning-based recommender systems","author":"Sun","year":"2022"},{"issue":"7","key":"2025101507171599300_ref016","doi-asserted-by":"crossref","DOI":"10.1109\/TCYB.2024.3368148","article-title":"Digital-twin-assisted skill learning for 3C assembly tasks","volume":"54","author":"Sun","year":"2024","journal-title":"IEEE Transactions on Cybernetics"},{"key":"2025101507171599300_ref017","doi-asserted-by":"publisher","DOI":"10.1109\/CIS-RAM61939.2024.10672715","article-title":"Multi-modal LLM-enabled long-horizon skill learning for robotic manipulation","author":"Tan","year":"2024"},{"issue":"6","key":"2025101507171599300_ref018","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1016\/j.rcim.2004.12.002","article-title":"Dynamic scheduling in flexible assembly system based on timed petri nets model","volume":"21","author":"Zhang","year":"2015","journal-title":"Robotics and Computer-Integrated Manufacturing"},{"issue":"8","key":"2025101507171599300_ref019","doi-asserted-by":"crossref","first-page":"1389","DOI":"10.1109\/JAS.2021.1004084","article-title":"An RGB-D camera based visual positioning system for assistive navigation by a robotic navigation aid","volume":"8","author":"Zhang","year":"2021","journal-title":"IEEE\/CAA Journal of Automatica Sinica"}],"container-title":["Industrial Robot: the international journal of robotics research and application"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IR-10-2024-0495\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/ir\/article-pdf\/52\/5\/722\/10355721\/ir-10-2024-0495en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ir\/article-pdf\/52\/5\/722\/10355721\/ir-10-2024-0495en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T11:17:23Z","timestamp":1760527043000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ir\/article\/52\/5\/722\/1256537\/Integrating-skill-knowledge-and-multimodal"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,19]]},"references-count":19,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,10,16]]}},"URL":"https:\/\/doi.org\/10.1108\/ir-10-2024-0495","relation":{},"ISSN":["0143-991X","1758-5791"],"issn-type":[{"value":"0143-991X","type":"print"},{"value":"1758-5791","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,19]]}}}