{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T07:28:51Z","timestamp":1771658931978,"version":"3.50.1"},"reference-count":14,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2023,3,10]],"date-time":"2023-03-10T00:00:00Z","timestamp":1678406400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Science and Technology Council (NSTC)","award":["MOST 109-2221-E-018-001-MY2"],"award-info":[{"award-number":["MOST 109-2221-E-018-001-MY2"]}]},{"name":"National Science and Technology Council (NSTC)","award":["MOST 111-2623-E-005-003"],"award-info":[{"award-number":["MOST 111-2623-E-005-003"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>With the rise of Industry 4.0 and artificial intelligence, the demand for industrial automation and precise control has increased. Machine learning can reduce the cost of machine parameter tuning and improve high-precision positioning motion. In this study, a visual image recognition system was used to observe the displacement of an XXY planar platform. Ball-screw clearance, backlash, nonlinear frictional force, and other factors affect the accuracy and reproducibility of positioning. Therefore, the actual positioning error was determined by inputting images captured by a charge-coupled device camera into a reinforcement Q-learning algorithm. Time-differential learning and accumulated rewards were used to perform Q-value iteration to enable optimal platform positioning. A deep Q-network model was constructed and trained through reinforcement learning for effectively estimating the XXY platform\u2019s positioning error and predicting the command compensation according to the error history. The constructed model was validated through simulations. The adopted methodology can be extended to other control applications based on the interaction between feedback measurement and artificial intelligence.<\/jats:p>","DOI":"10.3390\/s23063027","type":"journal-article","created":{"date-parts":[[2023,3,13]],"date-time":"2023-03-13T03:28:33Z","timestamp":1678678113000},"page":"3027","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Manipulating XXY Planar Platform Positioning Accuracy by Computer Vision Based on Reinforcement Learning"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4065-7731","authenticated-orcid":false,"given":"Yi-Cheng","family":"Huang","sequence":"first","affiliation":[{"name":"Department of Mechanical Engineering, National Chung Hsing University, Taichung 40227, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yung-Chun","family":"Chan","sequence":"additional","affiliation":[{"name":"Department of Mechatronics Engineering, National Changhua University of Education, Changhua City 50074, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,3,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Lin, C.-J., Hsu, H.-H., Cheng, C.-H., and Li, Y.-C. (2016). Design of an Image-Servo Mask alignment system using dual CCDs with an XXY stage. Appl. Sci., 6.","DOI":"10.3390\/app6020042"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"592312","DOI":"10.1155\/2013\/592312","article-title":"Vision servo motion control and error analysis of a coplanar XXY stage for image alignment motion","volume":"2013","author":"Lee","year":"2013","journal-title":"Math. Probl. Eng."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1177\/00202940221083574","article-title":"An Improved Particle Swarm Optimization Using Long Short-Term Memory Model for Positioning Control of a Coplanar XXY Stage","volume":"55","author":"Huang","year":"2022","journal-title":"Meas. Control"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1292","DOI":"10.1016\/j.mspro.2014.07.108","article-title":"Optimization of Machining Parameters in EDM Process Using Cast and Sintered Copper Electrodes","volume":"6","author":"Balasubramaniana","year":"2014","journal-title":"Procedia Mater. Sci."},{"key":"ref_6","unstructured":"Sutton, R.S., and Barto, A.G. (2018). Reinforcement Learning: An Introduction, The MIT Press. [2nd ed.]."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"844","DOI":"10.1016\/j.ifacol.2018.11.440","article-title":"Online Tuning of PID controller using Black Box Multi-Objective Optimization and Reinforcement Learning","volume":"51","author":"Pandit","year":"2018","journal-title":"IFAC-PapersOnLine"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Nie, L., Mu, C., Yin, Z., and Jiang, W. (2020, January 27\u201328). Control law design of variable cycle engine based on DQN. Proceedings of the 3rd International Conference on Unmanned Systems (ICUS), Harbin, China.","DOI":"10.1109\/ICUS50048.2020.9274888"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.engappai.2014.01.015","article-title":"Reinforcement learning of ball screw feed drive controllers","volume":"30","author":"Gauna","year":"2014","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Chen, S., Yan, D., Zhang, Y., Tan, Y., and Wang, W. (2019, January 20\u201322). Live Working Manipulator Control Model based on DPPO-DQN Combined Algorithm. Proceedings of the IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chengdu, China.","DOI":"10.1109\/IAEAC47372.2019.8997839"},{"key":"ref_11","unstructured":"Sakryukin, A., Raissi, C., and Kankanhalli, M. (2020, January 13\u201318). Inferring DQN structure for high-dimensional continuous control. Proceedings of the 37th International Conference on Machine Learning, Virtual Event."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1038\/nature14236","article-title":"Human-level control through deep reinforcement learning","volume":"518","author":"Mnih","year":"2015","journal-title":"Nature"},{"key":"ref_13","unstructured":"Chiuan Yan Technology Co., Ltd (2022, December 05). XXY-25-7 Model Description. Available online: https:\/\/www.aafteck.com\/en\/product-609758\/XXY25-07-XXY25-07.html."},{"key":"ref_14","unstructured":"Adlink Technology Inc (2009). Motion Control Card User\u2019s Manual, PCI-8253\/8256 DSP-Based 3\/6-Axis Analog Motion Control Card User\u2019s Manual, Adlink Technology Inc."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/6\/3027\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:52:27Z","timestamp":1760122347000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/6\/3027"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,10]]},"references-count":14,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2023,3]]}},"alternative-id":["s23063027"],"URL":"https:\/\/doi.org\/10.3390\/s23063027","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,10]]}}}