Action-dependent Control Variates for Policy Optimization via Stein Identity

Hao Liu
Yihao Feng
Yi Mao
Jian Peng
Qiang Liu
ICLR(2018)

Abstract