Asymptotically-stable adaptive-optimal control algorithm with saturating actuators and relaxed persistence of excitation.

Abstract

This paper proposes a control algorithm based on adaptive dynamic programming to solve the infinite-horizon optimal control problem for known deterministic nonlinear systems with saturating actuators and nonquadratic cost functionals. The algorithm is based on an actor/critic framework, where a critic neural network (NN) is used to learn the optimal cost, and an actor NN is used to learn the optimal control policy. The adaptive control nature of the algorithm requires a persistence of excitation condition to be a priori validated, but this can be relaxed using previously stored data concurrently with current data in the update of the critic NN. A robustifying control term is added to the controller to eliminate the effect of residual errors, leading to the asymptotically stability of the closed-loop system. Simulation results show the effectiveness of the proposed approach for a controlled Van der Pol oscillator and also for a power system plant.

ICB Affiliated Authors

João Hespanha

Authors

Vamvoudakis, K. G., Miranda, M. F. and Hespanha, J.

Date

October 1, 2015

Type

Peer-Reviewed Article

Journal

IEEE Transactions on Neural Networks and Learning Systems

DOI

10.1109/TNNLS.2015.2487972