Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter

Journal Article

Title: Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter

Author:

Anderlini, E.; Husain, S.; Parker, G.; Abusara, M.; Thomas, G.

Publication Date:

October 28, 2020

Journal:

Journal of Marine Science and Engineering

Volume:

Issue:

Pages:

Publisher:

MDPI

Affiliation:

University College London, Michigan Technological University, University of Exeter

Technology:

Wave, Point Absorber

Collection Method:

Modeling

Engineering:

Control, Performance

Language:

English

Document Access

Website:

External Link

Attachment:

Access File

Notice: This material may be protected by Copyright Law.

Citation

APA
BibTex
RIS

Anderlini, E.; Husain, S.; Parker, G.; Abusara, M.; Thomas, G. (2020). Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter. Journal of Marine Science and Engineering, 8(11), 16. https://doi.org/10.3390/jmse8110845

@article{Anderlini-2020-9871,
author = {Anderlini, E and Husain, S and Parker, G and Abusara, M and Thomas, G},
title = {Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter},
journal = {Journal of Marine Science and Engineering},
year = {2020},
month = {oct},
publisher = {MDPI},
volume = {8},
number = {11},
pages = {16},
doi = {10.3390/jmse8110845},
url = {https://www.mdpi.com/2077-1312/8/11/845},
keywords = {Wave, Point Absorber, Modeling, Control, Performance},
}

Export Citation to BibTex

TY - JOUR
TI - Towards Real-Time Reinforcement Learning Control of a Wave Energy Converter
AU - Anderlini, E
AU - Husain, S
AU - Parker, G
AU - Abusara, M
AU - Thomas, G
T2 - Journal of Marine Science and Engineering
AB - The levellised cost of energy of wave energy converters (WECs) is not competitive with fossil fuel-powered stations yet. To improve the feasibility of wave energy, it is necessary to develop effective control strategies that maximise energy absorption in mild sea states, whilst limiting motions in high waves. Due to their model-based nature, state-of-the-art control schemes struggle to deal with model uncertainties, adapt to changes in the system dynamics with time, and provide real-time centralised control for large arrays of WECs. Here, an alternative solution is introduced to address these challenges, applying deep reinforcement learning (DRL) to the control of WECs for the first time. A DRL agent is initialised from data collected in multiple sea states under linear model predictive control in a linear simulation environment. The agent outperforms model predictive control for high wave heights and periods, but suffers close to the resonant period of the WEC. The computational cost at deployment time of DRL is also much lower by diverting the computational effort from deployment time to training. This provides confidence in the application of DRL to large arrays of WECs, enabling economies of scale. Additionally, model-free reinforcement learning can autonomously adapt to changes in the system dynamics, enabling fault-tolerant control.
DA - 2020/10//
PY - 2020
PB - MDPI
VL - 8
IS - 11
SP - 16
UR - https://www.mdpi.com/2077-1312/8/11/845
DO - 10.3390/jmse8110845
LA - English
KW - Wave
KW - Point Absorber
KW - Modeling
KW - Control
KW - Performance
ER -

Export Citation to RIS

Abstract

The levellised cost of energy of wave energy converters (WECs) is not competitive with fossil fuel-powered stations yet. To improve the feasibility of wave energy, it is necessary to develop effective control strategies that maximise energy absorption in mild sea states, whilst limiting motions in high waves. Due to their model-based nature, state-of-the-art control schemes struggle to deal with model uncertainties, adapt to changes in the system dynamics with time, and provide real-time centralised control for large arrays of WECs. Here, an alternative solution is introduced to address these challenges, applying deep reinforcement learning (DRL) to the control of WECs for the first time. A DRL agent is initialised from data collected in multiple sea states under linear model predictive control in a linear simulation environment. The agent outperforms model predictive control for high wave heights and periods, but suffers close to the resonant period of the WEC. The computational cost at deployment time of DRL is also much lower by diverting the computational effort from deployment time to training. This provides confidence in the application of DRL to large arrays of WECs, enabling economies of scale. Additionally, model-free reinforcement learning can autonomously adapt to changes in the system dynamics, enabling fault-tolerant control.