Research on Energy Management Optimization for Hybrid-Powered Port Tugboat Systems Based on a Dual-Delay Deep Deterministic Policy Gradient Algorithm.
Saved in:
| Title: | Research on Energy Management Optimization for Hybrid-Powered Port Tugboat Systems Based on a Dual-Delay Deep Deterministic Policy Gradient Algorithm. |
|---|---|
| Authors: | Li, Zhao1 (AUTHOR), Long, Wuqiang1 (AUTHOR), Tian, Hua1 (AUTHOR) tianhua@dlut.edu.cn |
| Source: | Energies (19961073). Feb2026, Vol. 19 Issue 4, p905. 33p. |
| Subject Terms: | *Real-time control, *Reinforcement learning, *Hardware-in-the-loop simulation, *Tugboats, *Methanol as fuel, *Electric power management, *Greenhouse gas mitigation |
| Abstract: | To address the energy management challenge for methanol range-extended series hybrid systems in port tugboats, characterized by highly transient and intermittent operations, this study proposes a real-time energy management strategy based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. A high-fidelity forward simulation model was constructed and validated to train the TD3 agent. In simulations of typical port operation cycles, TD3 reduced methanol consumption by approximately 18.5%, 10.2%, and 7.3% compared to rule-based (RB), equivalent consumption minimization strategy (ECMS), and deep deterministic policy gradient (DDPG) approaches, respectively. Emissions such as NOx and carbon dioxide (CO2) were also significantly reduced, while maintaining superior battery state of charge (SOC). Its overall performance approximates global optimal (DP) performance with a gap of less than 2.5%, while retaining real-time online decision-making capability. Hardware-in-the-loop (HIL) testing further demonstrates that TD3 exhibits less than 1.8% performance degradation under actual communication and execution conditions, validating its engineering feasibility and deployment potential. This study provides methodological and experimental foundations for developing high-performance, low-emission, real-time energy management algorithms for port tugboats. [ABSTRACT FROM AUTHOR] |
| Database: | Energy & Power Source |
|
Full text is not displayed to guests.
Login for full access.
|
|
| Abstract: | To address the energy management challenge for methanol range-extended series hybrid systems in port tugboats, characterized by highly transient and intermittent operations, this study proposes a real-time energy management strategy based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. A high-fidelity forward simulation model was constructed and validated to train the TD3 agent. In simulations of typical port operation cycles, TD3 reduced methanol consumption by approximately 18.5%, 10.2%, and 7.3% compared to rule-based (RB), equivalent consumption minimization strategy (ECMS), and deep deterministic policy gradient (DDPG) approaches, respectively. Emissions such as NOx and carbon dioxide (CO2) were also significantly reduced, while maintaining superior battery state of charge (SOC). Its overall performance approximates global optimal (DP) performance with a gap of less than 2.5%, while retaining real-time online decision-making capability. Hardware-in-the-loop (HIL) testing further demonstrates that TD3 exhibits less than 1.8% performance degradation under actual communication and execution conditions, validating its engineering feasibility and deployment potential. This study provides methodological and experimental foundations for developing high-performance, low-emission, real-time energy management algorithms for port tugboats. [ABSTRACT FROM AUTHOR] |
|---|---|
| ISSN: | 19961073 |
| DOI: | 10.3390/en19040905 |