Loading…

Adaptive dynamic programming for model‐free tracking of trajectories with time‐varying parameters

Summary Recently proposed adaptive dynamic programming (ADP) tracking controllers assume that the reference trajectory follows time‐invariant exo‐system dynamics—an assumption that does not hold for many applications. In order to overcome this limitation, we propose a new Q‐function that explicitly...

Full description

Saved in:
Bibliographic Details
Published in:International journal of adaptive control and signal processing 2020-07, Vol.34 (7), p.839-856
Main Authors: Köpf, Florian, Ramsteiner, Simon, Puccetti, Luca, Flad, Michael, Hohmann, Sören
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Summary Recently proposed adaptive dynamic programming (ADP) tracking controllers assume that the reference trajectory follows time‐invariant exo‐system dynamics—an assumption that does not hold for many applications. In order to overcome this limitation, we propose a new Q‐function that explicitly incorporates a parametrized approximation of the reference trajectory. This allows learning to track a general class of trajectories by means of ADP. Once our Q‐function has been learned, the associated controller handles time‐varying reference trajectories without the need for further training and independent of exo‐system dynamics. After proposing this general model‐free off‐policy tracking method, we provide an analysis of the important special case of linear quadratic tracking. An example demonstrates that our new method successfully learns the optimal tracking controller and outperforms existing approaches in terms of tracking error and cost.
ISSN:0890-6327
1099-1115
DOI:10.1002/acs.3106