Explainable deep reinforcement learning for resilient and battery-aware microgrid control
Nejati Amiri, Mohammad Hossein and Guéniat, Florimond (2026) Explainable deep reinforcement learning for resilient and battery-aware microgrid control. Energy Conversion and Management, 353. p. 121215. ISSN 0196-8904
Preview |
Text
1-s2.0-S0196890426001846-main.pdf - Published Version Available under License Creative Commons Attribution. Download (9MB) |
Abstract
Microgrids support renewable integration and resilience, but operation is challenged by intermittency, forecast uncertainty, and battery-life constraints. This paper presents an eXplainable Deep Reinforcement Learning (XDRL) framework for hourly microgrid energy management under uncertainty, trained with Proximal Policy Optimisation (PPO). The reward jointly optimises a priority-weighted Resilience Index (RI) and a life-cycle-aware battery term, and robustness is strengthened via curriculum learning over progressively noisier scenarios. Post-hoc explainability using Shapley Additive Explanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) is used to interpret how charging, discharging, and load-allocation decisions depend on net-energy balance, recent state of charge, and different load demands. Simulation on a cyclone-prone coastal microgrid in Kothapatnam, India (PV–wind–battery with prioritised loads) shows that, under uncertainty, the proposed policy achieves RI = 0.9956 (0.33% below an MPC benchmark of 0.9989) while increasing expected battery life by about 5% (15.9 vs. 15.1 years) and producing smoother SOC trajectories. From a computational perspective, online inference is about 5800 cheaper than solving MPC at each step, and the 15-year lifetime compute cost (including one-off training) is approximately three times lower. A 4000-run Monte Carlo study confirms robustness (median RI 0.992, 5–95%: 0.984–0.999; median battery life 16 years).
| Item Type: | Article |
|---|---|
| Identification Number: | 10.1016/j.enconman.2026.121215 |
| Dates: | Date Event 8 February 2026 Accepted 16 February 2026 Published Online |
| Uncontrolled Keywords: | Microgrids energy management, Deep reinforcement learning, Explainable AI, Resilience, Battery degradation |
| Subjects: | CAH10 - engineering and technology > CAH10-01 - engineering > CAH10-01-01 - engineering (non-specific) |
| Divisions: | Architecture, Built Environment, Computing and Engineering > Engineering |
| Depositing User: | Gemma Tonks |
| Date Deposited: | 02 Mar 2026 14:12 |
| Last Modified: | 02 Mar 2026 14:12 |
| URI: | https://www.open-access.bcu.ac.uk/id/eprint/16898 |
Actions (login required)
![]() |
View Item |

Tools
Tools