02ReinforcementLearning1.4A, Exploration versus Exploitation
From Wulfram Gerstner
views
comments
From Wulfram Gerstner
In order to find out good action choices you need to explore possibilities. In order to collect rewards you need to play the most rewarding actions. This is the exploration-exploitation dilemma.
(updated on March 11, 2021)
Mediaspace will be updated on Saturday, March 29th. Users may experience minor access and performance restrictions.
(close this alert with x ↓↓ )