Reinforcement learning for dynamic optimization of Eco-Driving in smart healthcare transportation networks
Cai, Wang ; Anwlnkom, Tomley ; Zhang, Lingling ; Basheer, Shakila ; Yang, Jing
Cai, Wang
Anwlnkom, Tomley
Zhang, Lingling
Basheer, Shakila
Yang, Jing
Citations
Altmetric:
Other Names
Location
Time Period
Advisors
Original Date
Digitization Date
Issue Date
2025-05-08
Type
Article
Genre
Keywords
Dynamic optimization,Eco-driving,Reinforcement learning,Smart healthcare transportation networks,Twin delayed deep deterministic policy gradient algorithm
Subjects (LCSH)
Citation
W. Cai, T. Anwlnkom, L. Zhang, S. Basheer and J. Yang, "Reinforcement Learning for Dynamic Optimization of Eco-Driving in Smart Healthcare Transportation Networks," in IEEE Transactions on Intelligent Transportation Systems, doi: 10.1109/TITS.2025.3561034.
Abstract
Smart transportation networks face increasing demands for efficiency and sustainability. This study presents a reinforcement learning approach that optimizes eco-driving strategies for connected and automated vehicles (CAVs) in urban environments, with a particular application to healthcare logistics. Specifically, we propose a novel approach using reinforcement learning, specifically a twin delayed deep deterministic policy gradient (TD3) algorithm, to dynamically optimize CAV trajectories at signalized intersections. The proposed healthcare eco-driving trajectory optimization (TD3-HETO) model incorporates real-time traffic conditions, signal timing information, and healthcare urgency levels to generate optimal acceleration profiles. The reward function is designed to balance energy efficiency, traffic flow, safety, comfort, and healthcare delivery timeliness. Additionally, the model introduces a dynamic exploration strategy that adapts to healthcare task urgency, enabling efficient balancing between energy consumption and delivery timelines. Experimental results show that TD3-HETO reduces energy consumption by up to 28.7% compared to baseline methods while improving average speeds by 3.7% for urgent healthcare deliveries. The model achieves superior safety performance with 98.7% of time steps showing zero conflicts, compared to 95.3% for the best baseline. TD3-HETO also demonstrates remarkable adaptability to varying traffic demands and signal timings, maintaining consistent performance even at high traffic volumes. This research contributes to developing intelligent transportation systems to enhance environmental sustainability and healthcare accessibility in smart cities, potentially improving patient outcomes and operational efficiency in urban healthcare logistics. © 2000-2011 IEEE.
Table of Contents
Description
Click on the DOI link to access this article at the publishers website (may not be free).
Publisher
Institute of Electrical and Electronics Engineers Inc.
Journal
IEEE Transactions on Intelligent Transportation Systems
Book Title
Series
Digital Collection
Finding Aid URL
Use and Reproduction
Archival Collection
PubMed ID
ISSN
15249050
