Advanced Topics in RL thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Advanced Topics in RL

Published on Aug 23, 20164801 Views

Related categories

Chapter list

Reinforcement Learning: Exploration00:00
Exploration/Exploitation00:36
Application #1: Internet advertising01:00
Application #2: Network server selection01:37
Personalized medical treatments02:06
The multi-arm bandit - 103:07
The multi-arm bandit - 204:41
The multi-arm bandit - 305:52
ε-greedy action selection08:15
Softmax action selection08:19
Thompson sampling (1933)09:05
Contextual bandits11:26
Upper Confidence Bound (UCB)11:29
Contextual bandits - 211:42
Bayesian reinforcement learning12:25
Final comments13:44
Questions? 15:02