{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:43:02Z","timestamp":1754156582502,"version":"3.41.2"},"reference-count":28,"publisher":"Emerald","issue":"3","license":[{"start":{"date-parts":[[2024,4,30]],"date-time":"2024-04-30T00:00:00Z","timestamp":1714435200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJICC"],"published-print":{"date-parts":[[2024,7,17]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>In order to meet the different quality of service (QoS) requirements of vehicle-to-infrastructure (V2I) and multiple vehicle-to-vehicle (V2V) links in vehicle networks, an efficient V2V spectrum access mechanism is proposed in this paper.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>A long-short-term-memory-based multi-agent hybrid proximal policy optimization (LSTM-H-PPO) algorithm is proposed, through which the distributed spectrum access and continuous power control of V2V link are realized.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>Simulation results show that compared with the baseline algorithm, the proposed algorithm has significant advantages in terms of total system capacity, payload delivery success rate of V2V link and convergence speed.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The LSTM layer uses the time sequence information to estimate the accurate system state, which ensures the choice of V2V spectrum access based on local observation effective. The hybrid PPO framework shares training parameters among agents which speeds up the entire training process. The proposed algorithm adopts the mode of centralized training and distributed execution, so that the agent can achieve the optimal spectrum access based on local observation information with less signaling overhead.<\/jats:p><\/jats:sec>","DOI":"10.1108\/ijicc-01-2024-0027","type":"journal-article","created":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T01:19:36Z","timestamp":1714353576000},"page":"486-502","source":"Crossref","is-referenced-by-count":0,"title":["An LSTM-based hybrid proximal policy optimization spectrum access algorithm in vehicular network"],"prefix":"10.1108","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3892-3922","authenticated-orcid":false,"given":"Lin","family":"Kang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-6588-9580","authenticated-orcid":false,"given":"Junjie","family":"Chen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-9753-5252","authenticated-orcid":false,"given":"Jie","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-7383-411X","authenticated-orcid":false,"given":"Yaqi","family":"Wei","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","published-online":{"date-parts":[[2024,4,30]]},"reference":[{"article-title":"Technical specification group radio access network; study on LTE-based V2X services; (Release 14)","year":"2016","author":"3rd Generation Partnership Project","key":"key2024071612511893600_ref001"},{"article-title":"Technical specification group radio access network; study enhancement 3GPP support for 5G V2X services; (Release 15)","year":"2017","author":"3rd Generation Partnership Project","key":"key2024071612511893600_ref002"},{"key":"key2024071612511893600_ref003","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2023.119797","article-title":"NT-DPTC: a non-negative temporal dimension preserved tensor completion model for missing traffic data imputation","volume":"653","year":"2024","journal-title":"Information Sciences"},{"issue":"7","key":"key2024071612511893600_ref004","doi-asserted-by":"publisher","first-page":"6229","DOI":"10.1109\/JIOT.2020.2966660","article-title":"Deep-learning-based joint optimization of renewable energy storage and routing in vehicular energy network","volume":"7","year":"2020","journal-title":"IEEE Internet of Things Journal"},{"issue":"4","key":"key2024071612511893600_ref005","doi-asserted-by":"publisher","first-page":"905","DOI":"10.1109\/JSAC.2019.2898743","article-title":"Resource allocation for low-latency vehicular communications: an effective capacity perspective","volume":"37","year":"2019","journal-title":"IEEE Journal on Selected Areas in Communications"},{"issue":"1","key":"key2024071612511893600_ref006","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1109\/COMST.2020.3029723","article-title":"Challenges and solutions for cellular based V2X communications","volume":"23","year":"2020","journal-title":"IEEE Communications Surveys and Tutorials"},{"issue":"7","key":"key2024071612511893600_ref007","doi-asserted-by":"publisher","first-page":"25","DOI":"10.23919\/JCC.2021.07.003","article-title":"A joint power and bandwidth allocation method based on deep reinforcement learning for V2V communications in 5G","volume":"18","year":"2021","journal-title":"China Communications"},{"issue":"9","key":"key2024071612511893600_ref008","doi-asserted-by":"publisher","first-page":"9132","DOI":"10.1109\/TVT.2021.3100591","article-title":"LSTM-based channel access scheme for vehicles in cognitive vehicular networks with multi-agent settings","volume":"70","year":"2021","journal-title":"IEEE Transactions on Vehicular Technology"},{"issue":"1","key":"key2024071612511893600_ref009","doi-asserted-by":"publisher","first-page":"235","DOI":"10.11999\/JEIT211160","article-title":"Research on resource allocation algorithm based on service heterogeneity in V2V communication in C-V2X","volume":"45","year":"2022","journal-title":"Journal of Electronics and Information Technology"},{"issue":"8","key":"key2024071612511893600_ref010","doi-asserted-by":"publisher","first-page":"8810","DOI":"10.1109\/TVT.2022.3173057","article-title":"Federated multi-agent deep reinforcement learning for resource allocation of vehicle-to-vehicle communications","volume":"71","year":"2022","journal-title":"IEEE Transactions on Vehicular Technology"},{"key":"key2024071612511893600_ref011","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TIV.2023.3346506","article-title":"DNN partition and offloading strategy with improved particle swarm genetic algorithm in VEC","year":"2023","journal-title":"IEEE Transactions on Intelligent Vehicles"},{"issue":"7","key":"key2024071612511893600_ref012","doi-asserted-by":"publisher","first-page":"4579","DOI":"10.1109\/TWC.2018.2827958","article-title":"Graph-based resource sharing in vehicular communication","volume":"17","year":"2018","journal-title":"IEEE Transactions on Wireless Communications"},{"issue":"10","key":"key2024071612511893600_ref013","doi-asserted-by":"publisher","first-page":"2282","DOI":"10.1109\/JSAC.2019.2933962","article-title":"Spectrum sharing in vehicular networks based on multi-agent reinforcement learning","volume":"37","year":"2019","journal-title":"IEEE Journal on Selected Areas in Communications"},{"issue":"4","key":"key2024071612511893600_ref014","doi-asserted-by":"publisher","first-page":"3133","DOI":"10.1109\/COMST.2019.2916583","article-title":"Applications of deep reinforcement learning in communications and networking: a survey","volume":"21","year":"2019","journal-title":"IEEE Communications Surveys and Tutorials"},{"first-page":"2681","article-title":"Deep decentralized multi-task multi-agent reinforcement learning under partial observability","year":"2017","key":"key2024071612511893600_ref015"},{"key":"key2024071612511893600_ref016","doi-asserted-by":"publisher","DOI":"10.1016\/j.vehcom.2022.100543","article-title":"Joint spectrum allocation and power control in vehicular communications based on dueling double DQN","volume":"38","year":"2022","journal-title":"Vehicular Communications"},{"key":"key2024071612511893600_ref017","unstructured":"Schulman, J., Moritz, P., Levine, S., Jordan, M. and Abbeel, P. (2015), \u201cHigh-dimensional continuous control using generalized advantage estimation\u201d, arXiv preprint, arXiv:1506.02438. doi: 10.48550\/arXiv.1506.02438."},{"key":"key2024071612511893600_ref018","unstructured":"Schulman, J., Wolski, F., Dhariwal, P., Radford, A. and Klimov, O. (2017), \u201cProximal policy optimization algorithms\u201d, arXiv preprint, arXiv:1707.06347. doi: 10.48550\/arXiv.1707.06347."},{"issue":"5","key":"key2024071612511893600_ref019","doi-asserted-by":"publisher","first-page":"3410","DOI":"10.1109\/JIOT.2020.3022572","article-title":"Multiagent deep-reinforcement-learning-based virtual resource allocation through network function virtualization in Internet of Things","volume":"8","year":"2020","journal-title":"IEEE Internet of Things Journal"},{"key":"key2024071612511893600_ref020","doi-asserted-by":"publisher","first-page":"6792","DOI":"10.1109\/ACCESS.2020.2964697","article-title":"5G vehicular network resource management for improving radio access through machine learning","volume":"8","year":"2020","journal-title":"IEEE Access"},{"issue":"2","key":"key2024071612511893600_ref021","doi-asserted-by":"publisher","first-page":"1218","DOI":"10.1109\/JIOT.2018.2791627","article-title":"A distributed position-based protocol for emergency messages broadcasting in vehicular ad hoc networks","volume":"5","year":"2018","journal-title":"IEEE Internet of Things Journal"},{"issue":"3","key":"key2024071612511893600_ref022","doi-asserted-by":"publisher","first-page":"1683","DOI":"10.1109\/JIOT.2021.3089823","article-title":"Multiagent deep-reinforcement-learning-based resource allocation for heterogeneous QoS guarantees for vehicular networks","volume":"9","year":"2021","journal-title":"IEEE Internet of Things Journal"},{"key":"key2024071612511893600_ref023","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/ISWCS56560.2022.9940399","article-title":"Joint spectrum allocation and power control in vehicular networks based on reinforcement learning","volume-title":"International Symposium on Wireless Communication Systems (ISWCS)","year":"2022"},{"key":"key2024071612511893600_ref024","doi-asserted-by":"publisher","first-page":"18797","DOI":"10.1109\/ACCESS.2020.2968595","article-title":"Deep deterministic policy gradient (DDPG)-based resource allocation scheme for NOMA vehicular communications","volume":"8","year":"2020","journal-title":"IEEE Access"},{"issue":"10","key":"key2024071612511893600_ref025","doi-asserted-by":"publisher","first-page":"11001","DOI":"10.1109\/tits.2023.3279321","article-title":"HRST-LR: a hessian regularization spatio-temporal low rank algorithm for traffic data imputation","volume":"24","year":"2023","journal-title":"IEEE Transactions on Intelligent Transportation Systems"},{"key":"key2024071612511893600_ref026","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TMC.2023.3344645","article-title":"Adaptive swarm intelligent offloading based on digital twin-assisted prediction in VEC","year":"2023","journal-title":"IEEE Transactions on Mobile Computing"},{"key":"key2024071612511893600_ref027","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TNSE.2023.3344428","article-title":"Adaptive multi-UAV trajectory planning leveraging digital twin technology for urban IIoT applications","year":"2023","journal-title":"IEEE Transactions on Network Science and Engineering"},{"key":"key2024071612511893600_ref028","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TITS.2024.3350876","article-title":"Collaborative intelligent delivery with one truck and multiple heterogeneous drones in COVID-19 pandemic environment","year":"2024","journal-title":"IEEE Transactions on Intelligent Transportation Systems"}],"container-title":["International Journal of Intelligent Computing and Cybernetics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IJICC-01-2024-0027\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IJICC-01-2024-0027\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:54:03Z","timestamp":1753397643000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ijicc\/article\/17\/3\/486-502\/1228086"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,30]]},"references-count":28,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2024,4,30]]},"published-print":{"date-parts":[[2024,7,17]]}},"alternative-id":["10.1108\/IJICC-01-2024-0027"],"URL":"https:\/\/doi.org\/10.1108\/ijicc-01-2024-0027","relation":{},"ISSN":["1756-378X"],"issn-type":[{"type":"print","value":"1756-378X"}],"subject":[],"published":{"date-parts":[[2024,4,30]]}}}