en-es
en-fr
en-sl
en
0.25
0.5
0.75
1.25
1.5
1.75
2
Advanced Topics in RL
Published on Aug 23, 20164801 Views
Related categories
Chapter list
Reinforcement Learning: Exploration00:00
Exploration/Exploitation00:36
Application #1: Internet advertising01:00
Application #2: Network server selection01:37
Personalized medical treatments02:06
The multi-arm bandit - 103:07
The multi-arm bandit - 204:41
The multi-arm bandit - 305:52
ε-greedy action selection08:15
Softmax action selection08:19
Thompson sampling (1933)09:05
Contextual bandits11:26
Upper Confidence Bound (UCB)11:29
Contextual bandits - 211:42
Bayesian reinforcement learning12:25
Final comments13:44
Questions? 15:02