JOURNAL ARTICLE

Model-Free Off-Policy Iterative Adaptive Dynamic Programming for Nitrate-Nitrogen Concentration Control

Abstract

To solve the optimal control problem of the nitrate-nitrogen concentration in the wastewater treatment plant (WWTP), a model-free off-policy iterative adaptive dynamic programming algorithm using online data is proposed. Under the actor-critic structure, the developed algorithm approximates the optimal Q-function by minimizing the temporal difference and improves the control law through the policy gradient method. Neural networks are utilized in the proposed scheme. Besides, the experience replay buffer is involved in the off-policy iteration of neural networks. Finally, simulation examples for a nonlinear system and WWTP are presented to verify the effectiveness of the proposed method.

Keywords:
Dynamic programming Computer science Nitrate Artificial neural network Iterative method Control theory (sociology) Mathematical optimization Optimal control Function (biology) Nonlinear system Control (management) Algorithm Artificial intelligence Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
14
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Adaptive Dynamic Programming Control
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Mechanical Circulatory Support Devices
Physical Sciences →  Engineering →  Biomedical Engineering
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.