Loading…

Adaptive Dynamic Programming Using Lyapunov Function Constraints

This letter is concerned with a stabilizing adaptive dynamic programming (ADP) approach to approximate solution of a given infinite-horizon optimal control problem. Since the latter problem cannot, in general, be solved exactly, a parametrized function approximator for the infinite-horizon cost func...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE control systems letters 2019-10, Vol.3 (4), p.901-906
Main Authors:	Gohrt, Thomas, Osinenko, Pavel, Streif, Stefan
Format:	Article
Language:	English
Subjects:	Adaptive control Convergence Cost function Dynamic programming Lyapunov methods Optimal control optimization algorithms Stability analysis
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This letter is concerned with a stabilizing adaptive dynamic programming (ADP) approach to approximate solution of a given infinite-horizon optimal control problem. Since the latter problem cannot, in general, be solved exactly, a parametrized function approximator for the infinite-horizon cost function is introduced in ADP (so called "critic"). This critic is used to adapt the parameters of the function approximator. The so called "actor" in turn derives the optimal input of the system. It is a notoriously hard problem to guarantee closed-loop stability of ADP due to the use of approximation structures in the control scheme. Since at least stabilizability is always assumed in the analyzes of ADP, it is justified to invoke a respective Lyapunov function. The proposed ADP scheme explicitly uses the said Lyapunov function to simultaneously optimize the critic and guarantee closed-loop stability. A Hessian-free optimization routine is utilized for the actor and critic optimization problems. Convergence to prescribed vicinities of the optima is shown. A computational study showed significant performance improvement for the critic-based approach compared a nominal stabilizing controller for a range of initial conditions.
ISSN:	2475-1456 2475-1456
DOI:	10.1109/LCSYS.2019.2919439