PDP: Parallel Dynamic Programming

Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming(ADP)is first presented instead of direct dynamic programming(DP...

Full description

Saved in:
Bibliographic Details
Published in:IEEE/CAA journal of automatica sinica 2017, Vol.4 (1), p.1-5
Main Authors: Wang, Fei-Yue, Zhang, Jie, Wei, Qinglai, Zheng, Xinhu, Li, Li
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!