Deep reinforcement learning for real-time latching control of wave energy converter in non-predicted irregular wave environments

Journal Article

Title: Deep reinforcement learning for real-time latching control of wave energy converter in non-predicted irregular wave environments

Author:

Su, H.; Qin, H.; Wen, Z.; Liang, H.; Jiang, H.

Publication Date:

February 1, 2026

Journal:

Renewable Energy

Volume:

257

Pages:

24821

Publisher:

Elsevier

Affiliation:

China University of Geosciences, Beijing Institute of Technology, Xi'an Jiaotong University, Shenzhen University

Technology:

Wave, Attenuator

Collection Method:

Modeling

Engineering:

Control, Performance

Language:

English

Document Access

Website:

External Link

Citation

APA
BibTex
RIS

Su, H.; Qin, H.; Wen, Z.; Liang, H.; Jiang, H. (2026). Deep reinforcement learning for real-time latching control of wave energy converter in non-predicted irregular wave environments. Renewable Energy, 257, 24821. https://doi.org/https://doi.org/10.1016/j.renene.2025.124821

@article{Su-2026-,
author = {Su, H and Qin, H and Wen, Z and Liang, H and Jiang, H},
title = {Deep reinforcement learning for real-time latching control of wave energy converter in non-predicted irregular wave environments},
journal = {Renewable Energy},
year = {2026},
month = {feb},
publisher = {Elsevier},
volume = {257},
pages = {24821},
doi = {https://doi.org/10.1016/j.renene.2025.124821},
url = {https://www.sciencedirect.com/science/article/pii/S0960148125024851},
keywords = {Wave, Attenuator, Modeling, Control, Performance},
}

Export Citation to BibTex

TY - JOUR
TI - Deep reinforcement learning for real-time latching control of wave energy converter in non-predicted irregular wave environments
AU - Su, H
AU - Qin, H
AU - Wen, Z
AU - Liang, H
AU - Jiang, H
T2 - Renewable Energy
AB - This paper proposes a data-driven coupled model for the latching control of the Edinburgh Duck wave energy converter. To solve the complex state space problem of irregular wave environments caused by inherent randomness and wide spectrum, the coupled model consists of a deep reinforcement learning (DRL) algorithm based on the Soft Actor-Critic (SAC) method and a numerical wave flume implemented using computational fluid dynamics (CFD) methods. The wave dataset used by the algorithm comprises irregular waves generated through the Pierson-Moskowitz (P-M) spectrum. The wave-making capability of the numerical wave flume is validated, and the latching control agent is trained by coupling the DRL algorithm with multiple parallel numerical wave flume (NWF) environments. In the irregular wave testing set, the non-predictive DRL method driven by the environmental state is compared with both predictive and non-predictive benchmark control methods. Under the testing wave, the energy capture efficiency of the DRL control is 8.85 % higher than that of the predictive benchmark method and 17.3 % higher than that of the non-predictive benchmark method. Additionally, the study of the peak angular velocity of the wave energy converter (WEC) under different control methods demonstrates the load advantage of DRL control over the benchmark methods. The DRL control neural network output delay results confirm the algorithm's real-time performance. The generalization capability of DRL control was further validated under extreme waves and different water depths. This research demonstrates the effectiveness of a discrete latching action reinforcement learning model in the irregular wave environment and proves the practicality of the DRL method in terms of both energy capture efficiency and application.
DA - 2026/02//
PY - 2026
PB - Elsevier
VL - 257
SP - 24821
UR - https://www.sciencedirect.com/science/article/pii/S0960148125024851
DO - https://doi.org/10.1016/j.renene.2025.124821
LA - English
KW - Wave
KW - Attenuator
KW - Modeling
KW - Control
KW - Performance
ER -

Export Citation to RIS

Abstract

This paper proposes a data-driven coupled model for the latching control of the Edinburgh Duck wave energy converter. To solve the complex state space problem of irregular wave environments caused by inherent randomness and wide spectrum, the coupled model consists of a deep reinforcement learning (DRL) algorithm based on the Soft Actor-Critic (SAC) method and a numerical wave flume implemented using computational fluid dynamics (CFD) methods. The wave dataset used by the algorithm comprises irregular waves generated through the Pierson-Moskowitz (P-M) spectrum. The wave-making capability of the numerical wave flume is validated, and the latching control agent is trained by coupling the DRL algorithm with multiple parallel numerical wave flume (NWF) environments. In the irregular wave testing set, the non-predictive DRL method driven by the environmental state is compared with both predictive and non-predictive benchmark control methods. Under the testing wave, the energy capture efficiency of the DRL control is 8.85 % higher than that of the predictive benchmark method and 17.3 % higher than that of the non-predictive benchmark method. Additionally, the study of the peak angular velocity of the wave energy converter (WEC) under different control methods demonstrates the load advantage of DRL control over the benchmark methods. The DRL control neural network output delay results confirm the algorithm's real-time performance. The generalization capability of DRL control was further validated under extreme waves and different water depths. This research demonstrates the effectiveness of a discrete latching action reinforcement learning model in the irregular wave environment and proves the practicality of the DRL method in terms of both energy capture efficiency and application.