video thumbnail

Advanced Topics in RL

Published on 2016-08-234824 Views

Joelle Pineau

Deep Learning Summer School 2016 - Montreal

Related categories

Deep Learning Reinforcement Learning Unsupervised Learning

Presentation

Reinforcement Learning: Exploration00:00

Exploration/Exploitation00:36

Application #1: Internet advertising01:00

Application #2: Network server selection01:37

Personalized medical treatments02:06

The multi-arm bandit - 103:07

The multi-arm bandit - 204:41

The multi-arm bandit - 305:52

ε-greedy action selection08:15

Softmax action selection08:19

Thompson sampling (1933)09:05

Contextual bandits11:26

Upper Confidence Bound (UCB)11:29

Contextual bandits - 211:42

Bayesian reinforcement learning12:25

Final comments13:44

Questions? 15:02