{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T16:08:47Z","timestamp":1777910927971,"version":"3.51.4"},"reference-count":24,"publisher":"SAGE Publications","issue":"16","license":[{"start":{"date-parts":[[2024,4,11]],"date-time":"2024-04-11T00:00:00Z","timestamp":1712793600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"funder":[{"DOI":"10.13039\/501100006606","name":"Natural Science Foundation of Tianjin Municipality","doi-asserted-by":"publisher","award":["18JCYBJC87700"],"award-info":[{"award-number":["18JCYBJC87700"]}],"id":[{"id":"10.13039\/501100006606","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62103298"],"award-info":[{"award-number":["62103298"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Transactions of the Institute of Measurement and Control"],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:p>In this paper, a multi-agent\u2013based reinforcement learning (RL) algorithm is proposed to solve the leveling control problem of a multi-cylinder hydraulic press with coupling phenomena. This algorithm is a model-free control algorithm, which can avoid the modeling difficulties and low efficiency caused by the complexity of the model. The control algorithm of the hydraulic press adopts Multi-Agent Soft Actor\u2013Critic (MASAC). The concept of multi-agent is introduced to control each coupling input separately. The distributed updating method is used to realize accurate and stable control of the hydraulic press. At the same time, a reward function of the piecewise function type is proposed in this paper. Compared with common algorithms such as the quadratic reward function, this algorithm has a faster and more stable convergence effect in the whole process. Experiments show that the proposed algorithm has better convergence speed and leveling accuracy than the traditional single-agent algorithm.<\/jats:p>","DOI":"10.1177\/01423312241238034","type":"journal-article","created":{"date-parts":[[2024,4,11]],"date-time":"2024-04-11T05:48:45Z","timestamp":1712814525000},"page":"3153-3168","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":2,"title":["Leveling control of multi-cylinder hydraulic press based on multi-agent reinforcement learning"],"prefix":"10.1177","volume":"46","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0563-5796","authenticated-orcid":false,"given":"Chao","family":"Jia","sequence":"first","affiliation":[{"name":"School of Electrical Engineering and Automation, Tianjin University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peng","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Automation, Tianjin University of Technology, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2024,4,11]]},"reference":[{"key":"bibr1-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1243\/09596518JSCE484"},{"key":"bibr2-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2021.3075557"},{"key":"bibr3-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2022.105321"},{"key":"bibr4-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.tics.2019.02.006"},{"issue":"11","key":"bibr5-01423312241238034","first-page":"1275","volume":"16","author":"Cheng G","year":"2008","journal-title":"Control Theory & Applications"},{"key":"bibr6-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1049\/rpg2.12534"},{"key":"bibr7-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2021.3102476"},{"key":"bibr8-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2022.3223185"},{"key":"bibr9-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1515\/htmp-2022-0040"},{"key":"bibr10-01423312241238034","first-page":"01290","volume":"1801","author":"Haarnoja T","year":"2018","journal-title":"arXiv:"},{"key":"bibr11-01423312241238034","author":"Haarnoja T","year":"2018","journal-title":"arXiv:"},{"key":"bibr12-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2021.3139762"},{"key":"bibr13-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysconle.2021.104912"},{"key":"bibr14-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/CAC51589.2020.9326526"},{"key":"bibr15-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1002\/asjc.3038"},{"key":"bibr16-01423312241238034","author":"Lillicrap TP","year":"2015","journal-title":"arXiv:"},{"key":"bibr17-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2020.2977374"},{"key":"bibr18-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1109\/TCST.2022.3227502"},{"key":"bibr19-01423312241238034","author":"Schulman J","year":"2017","journal-title":"arXiv:"},{"key":"bibr20-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2009.10.020"},{"key":"bibr21-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.automatica.2023.110999"},{"key":"bibr22-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2021.102193"},{"key":"bibr23-01423312241238034","doi-asserted-by":"publisher","DOI":"10.26599\/TST.2021.9010012"},{"key":"bibr24-01423312241238034","doi-asserted-by":"publisher","DOI":"10.1002\/asjc.2432"}],"container-title":["Transactions of the Institute of Measurement and Control"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01423312241238034","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/01423312241238034","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01423312241238034","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T15:10:13Z","timestamp":1777648213000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/01423312241238034"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,11]]},"references-count":24,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["10.1177\/01423312241238034"],"URL":"https:\/\/doi.org\/10.1177\/01423312241238034","relation":{},"ISSN":["0142-3312","1477-0369"],"issn-type":[{"value":"0142-3312","type":"print"},{"value":"1477-0369","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,11]]}}}