{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:35:10Z","timestamp":1777703710759,"version":"3.51.4"},"reference-count":23,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2018,6,5]],"date-time":"2018-06-05T00:00:00Z","timestamp":1528156800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2018,7,27]]},"abstract":"<jats:p>A robot demonstration method is proposed based on the combination of locally weighted regression(LWR) and Q-learning algorithm. It is applied on a 6-DOF hitting-ball-system. This method can adapt to the work task by learning from demonstration and generating new actions. With the LWR algorithm, the mapping between target values and actions is established. According to deviation of landing position, a Q-learning algorithm is proposed to adjust the parameters of manipulator and compensate the errors caused by model and the controller. The model of LWR fits a local small space to approximate the global state and decision space. It turns out to reduce the dimension and simplify the training of Q-learning. The convergence rate is enhanced and the precision of performing task is improved. The simulation and experiment demonstrate the applicability of the proposed method.<\/jats:p>","DOI":"10.3233\/jifs-169564","type":"journal-article","created":{"date-parts":[[2018,6,5]],"date-time":"2018-06-05T14:30:30Z","timestamp":1528209030000},"page":"35-46","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["A robot demonstration method based on LWR and Q-learning algorithm"],"prefix":"10.1177","volume":"35","author":[{"given":"Guangzhe","family":"Zhao","sequence":"first","affiliation":[{"name":"Beijing University of Civil Engineering and Architecture, Beijing, China"},{"name":"University of Chinese Academy of Sciences, Beijing, China"},{"name":"Yanbian University, Yanji, China"}]},{"given":"Yong","family":"Tao","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Hui","family":"Liu","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Xianling","family":"Deng","sequence":"additional","affiliation":[{"name":"Chongqing University of Science and Technology, Chongqing, China"}]},{"given":"Youdong","family":"Chen","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]},{"given":"Hegen","family":"Xiong","sequence":"additional","affiliation":[{"name":"Wuhan University of Science and Technology, Wuhan, China"}]},{"given":"Xianwu","family":"Xie","sequence":"additional","affiliation":[{"name":"Wuhan University of Science and Technology, Wuhan, China"}]},{"given":"Zengliang","family":"Fang","sequence":"additional","affiliation":[{"name":"Beihang University, Beijing, China"}]}],"member":"179","published-online":{"date-parts":[[2018,6,5]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2004.03.001"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511489808"},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"EnglertP. ParaschosA. PetersJ. and DeisenrothM.P. Model-based imitation learning by probabilistic trajectory matching IEEE International Conference on Robotics and Automation IEEE2013 pp. 1922\u20131927.","DOI":"10.1109\/ICRA.2013.6630832"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2006.886952"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2016.2543000"},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","unstructured":"HwangK.S. JiangW.C. and TsengY.C. An unified approach to inverse reinforcement learning by oppositive demonstrations IEEE International Conference on Industrial Technology IEEE2016 pp. 1664\u20131668.","DOI":"10.1109\/ICIT.2016.7475012"},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","unstructured":"LevasA. SelfridgeM. A user-friendly high-level robot teaching system IEEE IEEE International Conference on Robotics and Automation Proceedings (Vol.1) (1984) pp. 413\u2013416.","DOI":"10.1109\/ROBOT.1984.1087193"},{"key":"e_1_3_2_9_2","unstructured":"KreuzigerJ. KaiserM. and DillmannR. Robot Programming by Demonstration(RPD) - Using Machine Learning and User Interaction Methods for the Development of Easy and Comfortable Robot Programming Systems International Symposium on Industrial Robots(1994) pp. 685\u2013693."},{"key":"e_1_3_2_10_2","doi-asserted-by":"crossref","unstructured":"MuellingK. KoberJ. PetersJ. Learning table tennis with a Mixture of Motor Primitives IEEE-Ras International Conference on Humanoid Robots IEEE2010 pp. 411\u2013416.","DOI":"10.1109\/ICHR.2010.5686298"},{"key":"e_1_3_2_11_2","unstructured":"RasmussenC.E. and WilliamsC.K.I. Gaussian Processes for Machine Learning(Adaptive Computation and Machine Learning) The MIT Press2005."},{"key":"e_1_3_2_12_2","doi-asserted-by":"crossref","unstructured":"Garcia-SillasD. Gorrostieta-HurtadoE. Soto-VargasE. Diaz-DelgadoG. and Rodriguez-RiveroC. Learning from demonstration with Gaussian processes Mechatronics Adaptive and Intelligent Systems IEEE(2016) pp. 1\u20136.","DOI":"10.1109\/MAIS.2016.7761899"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2005.844689"},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","unstructured":"SchaalS. AtkesonC.G. VijayakumarS. Real-time robot learning with locally weighted statistical learning. In Robotics and Automation Proceedings ICRA\u201900 IEEE International Conference on(Vol. 1) IEEE2000 pp. 288\u2013293.","DOI":"10.1109\/ROBOT.2000.844072"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2012.2208193"},{"key":"e_1_3_2_16_2","doi-asserted-by":"crossref","unstructured":"PetersJ. and SchaalS. Policy gradient methods for robotics In Intelligent Robots and Systems 2006 IEEE\/RSJ International Conference on IEEE 2006 pp. 2219\u20132225","DOI":"10.1109\/IROS.2006.282564"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"EnglertP. ParaschosA. PetersJ. DeisenrothM.P. Model-based imitation learning by probabilistic trajectory matching In Robotics and Automation(ICRA) 2013 IEEE International Conference on IEEE 2013 pp. 1922\u20131927.","DOI":"10.1109\/ICRA.2013.6630832"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1177\/1059712313491614"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1163\/156855307782148550"},{"key":"e_1_3_2_20_2","doi-asserted-by":"crossref","unstructured":"BayoumiA. BennewitzM. Learning optimal navigation actions for foresighted robot behavior during assistance tasks In Robotics and Automation(ICRA) 2016 IEEE International Conference on IEEE 2016 pp. 207\u2013212.","DOI":"10.1109\/ICRA.2016.7487135"},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"AnsariY. FaloticoE. MollardY. BuschB. CianchettiM. LaschiC. A Multiagent Reinforcement Learning approach for inverse kinematics of high dimensional manipulators with precision positioning In Biomedical Robotics and Biomechatronics(BioRob) 2016 6th IEEE International Conference on IEEE 2016 pp. 457\u2013463.","DOI":"10.1109\/BIOROB.2016.7523669"},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"GhadirzadehA. B\u00fctepageJ. MakiA. KragicD. Bj\u00f6rkmanM. A sensorimotor reinforcement learning framework for physical human-robot interaction In Intelligent Robots and Systems(IROS) 2016 IEEE\/RSJ International Conference on IEEE 2016 pp. 2682\u20132688.","DOI":"10.1109\/IROS.2016.7759417"},{"key":"e_1_3_2_23_2","doi-asserted-by":"crossref","unstructured":"ChebotarY. HausmanK. SuZ. SukhatmeG.S. SchaalS. Self-supervised regrasping using spatio-temporal tactile features and reinforcement learning In Intelligent Robots and Systems(IROS) 2016 IEEE\/RSJ International Conference on IEEE 2016 pp. 1960\u20131966.","DOI":"10.1109\/IROS.2016.7759309"},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"WangJ. LiangZ. ZhouZ. ZhangY. Kicking motion design of humanoid robots using gradual accumulation learning method based on Q-learning In Control and Decision Conference(CCDC) 2016 Chinese IEEE 2016 pp. 5274\u20135279.","DOI":"10.1109\/CCDC.2016.7531941"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169564","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-169564","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169564","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:39:23Z","timestamp":1777455563000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-169564"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,5]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,7,27]]}},"alternative-id":["10.3233\/JIFS-169564"],"URL":"https:\/\/doi.org\/10.3233\/jifs-169564","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,6,5]]}}}