Search Results - Bernardo Ávila Pires
-
1
-
2
-
3
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Published in arXiv.orgGet full text
Article -
4
-
5
-
6
-
7
-
8
-
9
Understanding Self-Predictive Learning for Reinforcement Learning
Published in arXiv.orgGet full text
Article -
10
-
11
Hierarchical Reinforcement Learning in Complex 3D Environments
Published in arXiv.orgGet full text
Article -
12
-
13
-
14
-
15
-
16
-
17
-
18
-
19
-
20