{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,23]],"date-time":"2026-02-23T23:34:56Z","timestamp":1771889696489,"version":"3.50.1"},"reference-count":27,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2023,10,24]],"date-time":"2023-10-24T00:00:00Z","timestamp":1698105600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Due to the multi-loop coupling characteristics of multivariable systems, it is difficult for traditional control methods to achieve precise control effects. Therefore, this paper proposes a control method based on deep reinforcement learning to achieve stable and accurate control of multivariable coupling systems. Based on the proximal policy optimization algorithm (PPO), this method selects tanh as the activation function and normalizes the advantage function. At the same time, based on the characteristics of the multivariable coupling system, the reward function and controller are redesigned structures, achieving stable and precise control of the controlled system. In addition, this study used the amplitude of the control quantity output by the controller as an indicator to evaluate the controller\u2019s performance. Finally, simulation verification was conducted in MATLAB\/Simulink. The experimental results show that compared with decentralized control, decoupled control and traditional PPO control, the method proposed in this article achieves better control effects.<\/jats:p>","DOI":"10.3390\/s23218679","type":"journal-article","created":{"date-parts":[[2023,10,24]],"date-time":"2023-10-24T11:39:04Z","timestamp":1698147544000},"page":"8679","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Multivariable Coupled System Control Method Based on Deep Reinforcement Learning"],"prefix":"10.3390","volume":"23","author":[{"given":"Jin","family":"Xu","sequence":"first","affiliation":[{"name":"School of Artificial Intelligence, Shenyang Aerospace University, Shenyang 110136, China"}]},{"given":"Han","family":"Li","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Shenyang Aerospace University, Shenyang 110136, China"}]},{"given":"Qingxin","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Shenyang Aerospace University, Shenyang 110136, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,24]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Almeida, A.M.D., Lenzi, M.K., and Lenzi, E.K. (2020). A Survey of Fractional Order Calculus Applications of Multiple-Input, Multiple-Output (MIMO) Process Control. Fractal Fract., 4.","DOI":"10.3390\/fractalfract4020022"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1109\/JSYST.2021.3079293","article-title":"A Robust Stability Region-Based Decentralized PI Controller for a Multivariable Liquid Level System","volume":"16","author":"Mahapatro","year":"2022","journal-title":"IEEE Syst. J."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, J., and Li, P. (2021). Control and Real-Time Data Acquisition of an Experimental Platform for Stored Grain Aeration Study. Sensors, 21.","DOI":"10.3390\/s21165403"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1109\/TSMC.2016.2602826","article-title":"Virtual Unmodeled Dynamics Modeling for Nonlinear Multivariable Adaptive Control with Decoupling Design","volume":"48","author":"Zhang","year":"2018","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/j.aej.2019.09.016","article-title":"Decoupled control scheme for output tracking of a general industrial nonlinear MIMO system using improved active disturbance rejection scheme","volume":"58","author":"Ibraheem","year":"2019","journal-title":"Alex. Eng. J."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1109\/TPEL.2022.3213692","article-title":"Multivariable Control Design for Grid-Forming Inverters with Decoupled Active and Reactive Power Loops","volume":"38","author":"Rathnayake","year":"2023","journal-title":"IEEE Trans. Power Electron."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.automatica.2017.07.063","article-title":"A data-driven approach to robust control of multivariable systems by convex optimization","volume":"85","author":"Karimi","year":"2017","journal-title":"Automatica"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Yousfi, M., Ben Njima, C., and Garna, T. (2022). Robust multimodel control for uncertain nonlinear MIMO systems based on ARX-Laguerre multimodel and LSDP approach. Int. J. Control., 1\u201319.","DOI":"10.1080\/00207179.2022.2122574"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Belmonte, L.M., Morales, R., Fern\u00e1ndez-Caballero, A., and Somolinos, J.A. (2016). Robust Decentralized Nonlinear Control for a Twin Rotor MIMO System. Sensors, 16.","DOI":"10.5772\/64875"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5591","DOI":"10.1109\/JESTPE.2022.3162140","article-title":"Model Predictive Control for Grid-Connected Current-Source Converter with Enhanced Robustness and Grid-Current Feedback Only","volume":"10","author":"Xue","year":"2022","journal-title":"IEEE J. Emerg. Sel. Top. Power Electron."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"108112","DOI":"10.1016\/j.compchemeng.2022.108112","article-title":"Tube-based distributionally robust model predictive control for nonlinear process systems via linearization","volume":"170","author":"Zhong","year":"2023","journal-title":"Comput. Chem. Eng."},{"key":"ref_12","first-page":"1080","article-title":"Multivariable Inverted Decoupling Active Disturbance Rejection Control and Its Application to a Distillation Column Process","volume":"43","author":"Cheng","year":"2017","journal-title":"Zidonghua Xuebao\/Acta Autom. Sin."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"125344","DOI":"10.1016\/j.energy.2022.125344","article-title":"Multivariable active disturbance rejection control for compression liquid chiller system","volume":"262","author":"Wu","year":"2023","journal-title":"Energy"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1007\/s40435-016-0252-z","article-title":"Decentralized PID controller design for TITO processes with experimental validation","volume":"5","author":"Hajare","year":"2017","journal-title":"Int. J. Dyn. Control."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"604","DOI":"10.1007\/s00170-006-0474-x","article-title":"Robust control of a 3-DOF hybrid robot manipulator","volume":"33","author":"Zhou","year":"2007","journal-title":"Int. J. Adv. Manuf. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1327","DOI":"10.1007\/s00170-021-07682-3","article-title":"Review on model predictive control: An engineering perspective","volume":"117","author":"Schwenzer","year":"2021","journal-title":"Int. J. Adv. Manuf. Technol."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1038\/nature14236","article-title":"Human-level control through deep reinforcement learning","volume":"518","author":"Mnih","year":"2015","journal-title":"Nature"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"5284","DOI":"10.1109\/TSMC.2021.3122802","article-title":"Reinforcement-Learning-Based Tracking Control of Waste Water Treatment Process Under Realistic System Conditions and Control Performance Requirements","volume":"52","author":"Yang","year":"2022","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"109450","DOI":"10.1016\/j.asoc.2022.109450","article-title":"Reinforcement learning based adaptive PID controller design for control of linear\/nonlinear unstable processes","volume":"128","author":"Shuprajhaa","year":"2022","journal-title":"Appl. Soft Comput."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"107972","DOI":"10.1016\/j.ast.2022.107972","article-title":"Intelligent direct thrust control for multivariable turbofan engine based on reinforcement and deep learning methods","volume":"131","author":"Zhu","year":"2022","journal-title":"Aerosp. Sci. Technol."},{"key":"ref_21","first-page":"1772","article-title":"Approach of inverted decoupling suitable for high order multivariable system","volume":"38","author":"Zheng","year":"2012","journal-title":"J. Beijing Univ. Technol."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1109\/MSP.2017.2743240","article-title":"Deep Reinforcement Learning A brief survey","volume":"34","author":"Arulkumaran","year":"2017","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_23","first-page":"1889","article-title":"Trust Region Policy Optimization","volume":"Volume 37","author":"Schulman","year":"2015","journal-title":"Proceedings of the 32nd International Conference on International Conference on Machine Learning"},{"key":"ref_24","unstructured":"Nachum, O., Norouzi, M., Xu, K., and Schuurmans, D. (2017). Trust-PCL: An Off-Policy Trust Region Method for Continuous Control. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3067","DOI":"10.1002\/sim.9755","article-title":"Relative Sparsity for Medical Decision Problems","volume":"42","author":"Weisenthal","year":"2022","journal-title":"Stat. Med."},{"key":"ref_26","unstructured":"Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Klimov, O.J.A. (2017). Proximal Policy Optimization Algorithms. arXiv."},{"key":"ref_27","unstructured":"Engstrom, L., Ilyas, A., Santurkar, S., Tsipras, D., Janoos, F., Rudolph, L., and Madry, A.J. (2020). Implementation matters in deep policy gradients: A case study on ppo and trpo. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/21\/8679\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:11:01Z","timestamp":1760130661000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/21\/8679"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,24]]},"references-count":27,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2023,11]]}},"alternative-id":["s23218679"],"URL":"https:\/\/doi.org\/10.3390\/s23218679","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,24]]}}}