{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T16:05:01Z","timestamp":1777910701316,"version":"3.51.4"},"reference-count":30,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2021,9,27]],"date-time":"2021-09-27T00:00:00Z","timestamp":1632700800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Transactions of the Institute of Measurement and Control"],"published-print":{"date-parts":[[2022,2]]},"abstract":"<jats:p>In this paper, a novel dynamic position control (PC) approach for mobile nodes (MNs) is proposed for ocean sensor networks (OSNs) which directly utilizes a neural network to represent a PC strategy. The calculation of position estimation no longer needs to be carried out in the proposed scheme, so the localization error is eliminated. In addition, reinforcement learning is used to train the PC strategy, so that the MN can learn a more highly accurate and fast response control strategy. Moreover, to verify its applicability to the real-world environment, we conducted field experiment deployment in OSNs consisting of a MN designed by us and some fixed nodes. The experimental results demonstrate the effectiveness of our proposed control scheme with impressive improvements on PC accuracy by more than 53% and response speed by more than 15%.<\/jats:p>","DOI":"10.1177\/01423312211043034","type":"journal-article","created":{"date-parts":[[2021,9,27]],"date-time":"2021-09-27T15:58:06Z","timestamp":1632758286000},"page":"926-940","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["Reinforcement learning-based dynamic position control of mobile node for ocean sensor networks"],"prefix":"10.1177","volume":"44","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7323-898X","authenticated-orcid":false,"given":"Weijun","family":"Wang","sequence":"first","affiliation":[{"name":"Merchant Marine College, Shanghai Maritime University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3150-3407","authenticated-orcid":false,"given":"Huafeng","family":"Wu","sequence":"additional","affiliation":[{"name":"Merchant Marine College, Shanghai Maritime University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xianglun","family":"Kong","sequence":"additional","affiliation":[{"name":"China TranComm Technologies Co., Ltd, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuanyuan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Merchant Marine College, Shanghai Maritime University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"Ye","sequence":"additional","affiliation":[{"name":"Shanghai Zhuochen Info Tech Co., Ltd, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhongcheng","family":"Zeng","sequence":"additional","affiliation":[{"name":"Fujian Wanjiaxian Technology Co., Ltd, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jian","family":"Cheng","sequence":"additional","affiliation":[{"name":"China TranComm Technologies Co., Ltd, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Quandi","family":"Zhang","sequence":"additional","affiliation":[{"name":"Fujian Wanjiaxian Technology Co., Ltd, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2021,9,27]]},"reference":[{"key":"bibr1-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/IWCMC.2015.7289313"},{"key":"bibr2-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.adhoc.2007.06.004"},{"key":"bibr3-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/JSEN.2016.2517084"},{"key":"bibr4-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2012.82"},{"key":"bibr5-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2012.11.027"},{"issue":"3","key":"bibr6-01423312211043034","first-page":"1113","volume":"95","author":"Elmokadem T","year":"2018","journal-title":"Journal of Intelligent & Robotic Systems"},{"key":"bibr7-01423312211043034","unstructured":"Haarnoja T, Zhou A, Abbeel P, et al. (2018) Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. Available at: https:\/\/proceedings.mlr.press\/v80\/haarnoja18b\/haarnoja18b.pdf (accessed July 2021)."},{"key":"bibr8-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.apor.2015.07.005"},{"key":"bibr9-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/JOE.2017.2651242"},{"key":"bibr10-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2018.2890046"},{"key":"bibr11-01423312211043034","unstructured":"Juliani A, Berges V, Vckay E, et al. (2018) Unity: A general platform for intelligent agents. Available at: https:\/\/arxiv.org\/pdf\/1809.02627.pdf (accessed July 2021)."},{"key":"bibr12-01423312211043034","unstructured":"Kingma DP, Ba JL (2014) Adam: A method for stochastic optimization. Available at: https:\/\/arxiv.org\/pdf\/1412.6980.pdf (accessed June 2021)."},{"key":"bibr13-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/FSKD.2016.7603477"},{"key":"bibr14-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.apor.2021.102609"},{"key":"bibr15-01423312211043034","unstructured":"Lillicrap T, Hunt J, Pritzel A, et al. (2015) Continuous control with deep reinforcement learning. Available at: https:\/\/arxiv.org\/pdf\/1509.02971.pdf (accessed July 2021)."},{"key":"bibr16-01423312211043034","unstructured":"Mnih V, Badia A, Mirza M, et al. (2016) Asynchronous methods for deep reinforcement learning. In: International Conference on Machine Learning, 1928\u20131937. Available at: https:\/\/arxiv.org\/pdf\/1602.01783.pdf (accessed July 2021)."},{"key":"bibr17-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.oceaneng.2010.05.009"},{"key":"bibr18-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/MWC.2019.1800354"},{"key":"bibr19-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1142\/S0218126621501371"},{"key":"bibr20-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1007\/s12083-020-00945-y"},{"key":"bibr21-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2902371"},{"key":"bibr22-01423312211043034","unstructured":"Schulman J, Levine S, Moritz P, et al. (2015) Trust region policy optimization. Computer Science (1): 1889\u20131897. Available at: https:\/\/arxiv.org\/pdf\/1502.05477.pdf (accessed July 2021)."},{"key":"bibr23-01423312211043034","unstructured":"Schulman J, Wolski F, Dhariwal P, et al. (2017) Proximal policy optimization algorithms. Available at: https:\/\/arxiv.org\/pdf\/1707.06347.pdf (accessed July 2021)."},{"key":"bibr24-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2016.2617464"},{"key":"bibr25-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2019.2922823"},{"key":"bibr26-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2923425"},{"key":"bibr27-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/S1001-6058(14)60104-9"},{"key":"bibr28-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2018.2829730"},{"key":"bibr29-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2013.06.016"},{"key":"bibr30-01423312211043034","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2872779"}],"container-title":["Transactions of the Institute of Measurement and Control"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01423312211043034","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/01423312211043034","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01423312211043034","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T15:04:06Z","timestamp":1777647846000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/01423312211043034"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,27]]},"references-count":30,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,2]]}},"alternative-id":["10.1177\/01423312211043034"],"URL":"https:\/\/doi.org\/10.1177\/01423312211043034","relation":{},"ISSN":["0142-3312","1477-0369"],"issn-type":[{"value":"0142-3312","type":"print"},{"value":"1477-0369","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,27]]}}}