{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T19:52:05Z","timestamp":1774641125166,"version":"3.50.1"},"reference-count":40,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2024,7,14]],"date-time":"2024-07-14T00:00:00Z","timestamp":1720915200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Korean government (MSIT)","award":["IITP-2024-RS-2023-00259678"],"award-info":[{"award-number":["IITP-2024-RS-2023-00259678"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This paper presents a control strategy synthesis method for dynamical systems with differential constraints, emphasizing the prioritization of specific rules. Special attention is given to scenarios where not all rules can be simultaneously satisfied to complete a given task, necessitating decisions on the extent to which each rule is satisfied, including which rules must be upheld or disregarded. We propose a learning-based Model Predictive Control (MPC) method designed to address these challenges. Our approach integrates a learning method with a traditional control scheme, enabling the controller to emulate human expert behavior. Rules are represented as Signal Temporal Logic (STL) formulas. A robustness margin, quantifying the degree of rule satisfaction, is learned from expert demonstrations using a Conditional Variational Autoencoder (CVAE). This learned margin is then applied in the MPC process to guide the prioritization or exclusion of rules. In a track driving simulation, our method demonstrates the ability to generate behavior resembling that of human experts and effectively manage rule-based dilemmas.<\/jats:p>","DOI":"10.3390\/s24144567","type":"journal-article","created":{"date-parts":[[2024,7,15]],"date-time":"2024-07-15T14:15:49Z","timestamp":1721052949000},"page":"4567","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Model Predictive Control with Variational Autoencoders for Signal Temporal Logic Specifications"],"prefix":"10.3390","volume":"24","author":[{"given":"Eunji","family":"Im","sequence":"first","affiliation":[{"name":"Department of Information and Telecommunication Engineering, Incheon National University, Incheon 22012, Republic of Korea"}]},{"given":"Minji","family":"Choi","sequence":"additional","affiliation":[{"name":"Department of Information and Telecommunication Engineering, Incheon National University, Incheon 22012, Republic of Korea"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4679-6660","authenticated-orcid":false,"given":"Kyunghoon","family":"Cho","sequence":"additional","affiliation":[{"name":"Department of Information and Telecommunication Engineering, Incheon National University, Incheon 22012, Republic of Korea"}]}],"member":"1968","published-online":{"date-parts":[[2024,7,14]]},"reference":[{"key":"ref_1","unstructured":"Camacho, E.F., and Alba, C.B. (2013). Model Predictive Control, Springer Science & Business Media."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Dantec, E., Naveau, M., Fernbach, P., Villa, N., Saurel, G., Stasse, O., Taix, M., and Mansard, N. (2022, January 28\u201330). Whole-body model predictive control for biped locomotion on a torque-controlled humanoid robot. Proceedings of the IEEE-RAS International Conference on Humanoid Robots, Ginowan, Japan.","DOI":"10.1109\/Humanoids53995.2022.10000129"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"4712","DOI":"10.1109\/TRO.2023.3308773","article-title":"Hybrid iLQR model predictive control for contact implicit stabilization on legged robots","volume":"39","author":"Kong","year":"2023","journal-title":"IEEE Trans. Robot."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1617","DOI":"10.1109\/TRO.2024.3351554","article-title":"Fast contact-implicit model predictive control","volume":"40","author":"Howell","year":"2024","journal-title":"IEEE Trans. Robot."},{"key":"ref_5","unstructured":"Jang, E., Irpan, A., Khansari, M., Kappler, D., Ebert, F., Lynch, C., Levine, S., and Finn, C. (2021, January 8\u201311). Bc-z: Zero-shot task generalization with robotic imitation learning. Proceedings of the Conference on Robot Learning, London, UK."},{"key":"ref_6","unstructured":"Zare, M., Kebria, P.M., Khosravi, A., and Nahavandi, S. (2023). A survey of imitation learning: Algorithms, recent developments, and challenges. arXiv."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Abbeel, P., and Ng, A.Y. (2004, January 4\u20138). Apprenticeship learning via inverse reinforcement learning. Proceedings of the International Conference on Machine Learning, Banff, AB, Canada.","DOI":"10.1145\/1015330.1015430"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Cho, K., and Oh, S. (2018, January 21\u201325). Learning-based model predictive control under signal temporal logic specifications. Proceedings of the IEEE International Conference on Robotics and Automation, Brisbane, QLD, Australia.","DOI":"10.1109\/ICRA.2018.8460811"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Maler, O., and Nickovic, D. (2004, January 22\u201324). Monitoring temporal properties of continuous signals. Proceedings of the FORMATS\/FTRTFT, Grenoble, France.","DOI":"10.1007\/978-3-540-30206-3_12"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Donz\u00e9, A., and Maler, O. (2010, January 8\u201310). Robust satisfaction of temporal logic over real-valued signals. Proceedings of the FORMATS, Klosterneuburg, Austria.","DOI":"10.1007\/978-3-642-15297-9_9"},{"key":"ref_11","unstructured":"Fainekos, G.E., Kress-Gazit, H., and Pappas, G.J. (2005, January 18\u201322). Temporal logic motion planning for mobile robots. Proceedings of the IEEE International Conference on Robotics and Automation, Barcelona, Spain."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Karaman, S., and Frazzoli, E. (2008, January 11\u201313). Complex mission optimization for multiple-UAVs using linear temporal logic. Proceedings of the IEEE American Control Conference, Seattle, WA, USA.","DOI":"10.1109\/ACC.2008.4586787"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wongpiromsarn, T., Topcu, U., and Murray, R.M. (2009, January 15\u201318). Receding horizon temporal logic planning for dynamical systems. Proceedings of the IEEE Conference on Decision and Control, Shanghai, China.","DOI":"10.1109\/CDC.2009.5399536"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Pant, Y.V., Abbas, H., and Mangharam, R. (2022, January 23\u201325). Distributed Trajectory Planning for Multi-rotor UAVs with Signal Temporal Logic Objectives. Proceedings of the IEEE Conference on Control Technology and Applications, Trieste, Italy.","DOI":"10.1109\/CCTA49430.2022.9966096"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"7719","DOI":"10.1109\/LRA.2023.3315536","article-title":"Signal temporal logic neural predictive control","volume":"8","author":"Meng","year":"2023","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_16","unstructured":"Sohn, K., Lee, H., and Yan, X. (2015, January 7\u201312). Learning structured output representation using deep conditional generative models. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Karaman, S., Sanfelice, R.G., and Frazzoli, E. (2008, January 9\u201311). Optimal control of mixed logical dynamical systems with linear temporal logic specifications. Proceedings of the IEEE Conference on Decision and Control, Cancun, Mexico.","DOI":"10.1109\/CDC.2008.4739370"},{"key":"ref_18","unstructured":"Kwon, Y., and Agha, G. (2008, January 22\u201324). LTLC: Linear temporal logic for control. Proceedings of the Hybrid Systems: Computation and Control, St. Louis, MO, USA."},{"key":"ref_19","unstructured":"Wolff, E.M., Topcu, U., and Murray, R.M. (June, January 31). Optimization-based control of nonlinear systems with linear temporal logic specifications. Proceedings of the IEEE International Conference on Robotics and Automation, Hong Kong, China."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"25865","DOI":"10.1109\/ACCESS.2023.3257185","article-title":"Learning-based path planning under co-safe temporal logic specifications","volume":"11","author":"Cho","year":"2023","journal-title":"IEEE Access"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Raman, V., Donz\u00e9, A., Maasoumy, M., Murray, R.M., Sangiovanni-Vincentelli, A., and Seshia, S.A. (2014, January 15\u201317). Model predictive control with signal temporal logic specifications. Proceedings of the IEEE Conference on Decision and Control, Los Angeles, CA, USA.","DOI":"10.1109\/CDC.2014.7039363"},{"key":"ref_22","unstructured":"Sadigh, D., and Kapoor, A. (2015). Safe control under uncertainty. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Mao, Y., Acikmese, B., Garoche, P.L., and Chapoutot, A. (2022, January 4\u20136). Successive convexification for optimal control with signal temporal logic specifications. Proceedings of the ACM International Conference on Hybrid Systems: Computation and Control, Milan, Italy.","DOI":"10.1145\/3501710.3519518"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Lenz, I., Knepper, R.A., and Saxena, A. (2015, January 13\u201317). DeepMPC: Learning Deep Latent Features for Model Predictive Control. Proceedings of the Robotics: Science and Systems, Rome, Italy.","DOI":"10.15607\/RSS.2015.XI.012"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3758","DOI":"10.1109\/LRA.2019.2929987","article-title":"Data-driven model predictive control for trajectory tracking with a robotic arm","volume":"4","author":"Carron","year":"2019","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1109\/TIV.2020.3012947","article-title":"Comparison of deep reinforcement learning and model predictive control for adaptive cruise control","volume":"6","author":"Lin","year":"2020","journal-title":"IEEE Trans. Intell. Veh."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Kong, Z., Jones, A., Medina Ayala, A., Aydin Gol, E., and Belta, C. (2014, January 15\u201317). Temporal logic inference for classification and prediction from data. Proceedings of the International Conference on Hybrid Systems: Computation and Control, Berlin, Germany.","DOI":"10.1145\/2562059.2562146"},{"key":"ref_28","unstructured":"Castro, L.I.R., Chaudhari, P., Tumova, J., Karaman, S., Frazzoli, E., and Rus, D. (2013, January 10\u201313). Incremental sampling-based algorithm for minimum-violation motion planning. Proceedings of the IEEE Conference on Decision and Control, Firenze, Italy."},{"key":"ref_29","unstructured":"Lee, S., and Seo, S. (June, January 29). A learning-based framework for handling dilemmas in urban automated driving. Proceedings of the IEEE Conference on Robotics and Automation, Singapore."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Xu, H., Gao, Y., Yu, F., and Darrell, T. (2017, January 21\u201326). End-to-end learning of driving models from large-scale video datasets. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.376"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Codevilla, F., Miiller, M., L\u00f3pez, A., Koltun, V., and Dosovitskiy, A. (2018, January 21\u201325). End-to-end driving via conditional imitation learning. Proceedings of the IEEE International Conference on Robotics and Automation, Brisbane, QLD, Australia.","DOI":"10.1109\/ICRA.2018.8460487"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Schmerling, E., Leung, K., Vollprecht, W., and Pavone, M. (2018, January 21\u201325). Multimodal probabilistic model-based planning for human-robot interaction. Proceedings of the IEEE International Conference on Robotics and Automation, Brisbane, QLD, Australia.","DOI":"10.1109\/ICRA.2018.8460766"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Drews, P., Williams, G., Goldfain, B., Theodorou, E.A., and Rehg, J.M. (2017). Aggressive deep driving: Model predictive control with a cnn cost model. arXiv.","DOI":"10.1109\/ICRA.2016.7487277"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Jaritz, M., de Charette, R., Toromanoff, M., Perot, E., and Nashashibi, F. (2018, January 21\u201325). End-to-end race driving with deep reinforcement learning. Proceedings of the IEEE International Conference on Robotics and Automation, Brisbane, QLD, Australia.","DOI":"10.1109\/ICRA.2018.8460934"},{"key":"ref_35","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. (2017, January 4\u20139). Automatic differentiation in PyTorch. Proceedings of the NIPS-W, Long Beach, CA, USA."},{"key":"ref_36","unstructured":"Gurobi Optimization, LLC (2024, March 03). Gurobi Optimizer Reference Manual. Available online: http:\/\/www.gurobi.com."},{"key":"ref_37","first-page":"22","article-title":"The next generation simulation program","volume":"74","author":"Alexiadis","year":"2004","journal-title":"Inst. Transp. Eng."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Krajewski, R., Bock, J., Kloeker, L., and Eckstein, L. (2018, January 4\u20137). The highD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems. Proceedings of the IEEE International Conference on Intelligent Transportation Systems, Maui, HI, USA.","DOI":"10.1109\/ITSC.2018.8569552"},{"key":"ref_39","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_40","unstructured":"Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., and Riedmiller, M. (2013). Playing atari with deep reinforcement learning. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/14\/4567\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:16:49Z","timestamp":1760109409000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/14\/4567"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,14]]},"references-count":40,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2024,7]]}},"alternative-id":["s24144567"],"URL":"https:\/\/doi.org\/10.3390\/s24144567","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,7,14]]}}}