Asymptotically-stable adaptive-optimal control algorithm with saturating actuators and relaxed persistence of excitation.

This paper proposes a control algorithm based on adaptive dynamic programming to solve the infinite-horizon optimal control problem for known deterministic nonlinear systems with saturating actuators and nonquadratic cost functionals. The algorithm is based on an actor/critic framework, where a critic neural network (NN) is used to learn the optimal cost, and an actor NN is used to learn the optimal control policy. The adaptive control nature of the algorithm requires a persistence of excitation condition to be a priori validated, but this can be relaxed using previously stored data concurrently with current data in the update of the critic NN. A robustifying control term is added to the controller to eliminate the effect of residual errors, leading to the asymptotically stability of the closed-loop system. Simulation results show the effectiveness of the proposed approach for a controlled Van der Pol oscillator and also for a power system plant.

Vamvoudakis, K. G., Miranda, M. F. and Hespanha, J.
IEEE Transactions on Neural Networks and Learning Systems
Date: October, 2015
ICB Affiliated Authors: João P Hespanha