04ReinforcementLearning3.4, Log-likelihood trick: from batch to online
From Wulfram Gerstner
views
comments
From Wulfram Gerstner
When going from batch to online, care has to be taken to arrive at the statistical weight of online sampling. The log-likelhood trick solves this issue.
EPFL video portal by SWITCH | Terms of service | Disclaimer | EPFL Privacy policy |