Jump to Content

Action-dependent Control Variates for Policy Optimization via Stein Identity

Hao Liu
Yihao Feng
Yi Mao
Jian Peng
Qiang Liu
ICLR (2018)

Abstract