03ReinforcementLearning2.3, TD Learning
From Wulfram Gerstner
views
comments
From Wulfram Gerstner
Temporal Difference Learning (TD learning) is introduced as well as the standard TD(0) algorithm and V-values.
EPFL video portal by SWITCH | Terms of service | Disclaimer | EPFL Privacy policy |