02ReinforcementLearning1.6B, Relation of SARSA and Bellman equation

views comments

Sketch of a proof of the relation between fluctuating Q-values in SARSA and the Bellman equation

Related Media