Search Results - Hussenot, Léonard
-
1
-
2
-
3
CopyCAT: Taking Control of Neural Policies with Constant Attacks
Published in arXiv.orgGet full text
Article -
4
-
5
Show me the Way: Intrinsic Motivation from Demonstrations
Published in arXiv.orgGet full text
Article -
6
Learning Energy Networks with Generalized Fenchel-Young Losses
Published in arXiv.orgGet full text
Article -
7
-
8
WARP: On the Benefits of Weight Averaged Rewarded Policies
Published in arXiv.orgGet full text
Article -
9
Continuous Control with Action Quantization from Demonstrations
Published in arXiv.orgGet full text
Article -
10
MusicRL: Aligning Music Generation to Human Preferences
Published in arXiv.orgGet full text
Article -
11
-
12
-
13
-
14
Offline Reinforcement Learning with Pseudometric Learning
Published in arXiv.orgGet full text
Article -
15
-
16
-
17
-
18
Gemma 2: Improving Open Language Models at a Practical Size
Published in arXiv.orgGet full text
Article -
19
-
20
Gemma: Open Models Based on Gemini Research and Technology
Published in arXiv.orgGet full text
Article