DeepRL1.4A, Eligibility traces for policy gradient and actor-critic

From Wulfram Gerstner  

views comments