Abstract In this article, a one‐critic Q‐learning algorithm for the adaptive optimization of the distributed microgrids is proposed to improve the charging and discharging management and optimization of batteries in microgrids. Constrained by the random power generation of sustainable energy and the high uncertainty of power load, this article presents a dynamic balance point of the remaining power of the battery pack, and the dual battery pack constitutes a dynamic power game augmented system in the charge‐discharge process. The structure of the one‐critic neural network can effectively simplify the computational complexity, and the value can converge to the optimal solution in finite iterations. Simulation experiments also prove that the Q‐learning can effectively curb the overcharge and over‐discharge behavior of the battery pack, and realize the tracking of the dynamic SOC balance point.
Vahidreza NasirianAli DavoudiFrank L. Lewis
Lei DingQing‐Long HanBoda Ning
Ali BidramAli DavoudiFrank L. LewisShuzhi Sam Ge