02ReinforcementLearning1.4A, Exploration versus Exploitation

views comments

In order to find out good action choices you need to explore possibilities. In order to collect rewards you need to play the most rewarding actions. This is the exploration-exploitation dilemma.

(updated on March 11, 2021)