PDP: Parallel Dynamic Programming

Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming(ADP)is first presented instead of direct dynamic programming(DP...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE/CAA journal of automatica sinica 2017, Vol.4 (1), p.1-5
Main Authors:	Wang, Fei-Yue, Zhang, Jie, Wei, Qinglai, Zheng, Xinhu, Li, Li
Format:	Article
Language:	English
Subjects:	Dynamic programming Machine learning
Citations:	Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Staff View