Transfer of Samples in Batch Reinforcement Learning
Description
The main objective of transfer learning is to reduce the complexity of learning the solution of a target task by effectively reusing the knowledge retained from solving one or more source tasks. In this paper, we introduce a novel algorithm that transfers samples (i.e., experience tuples ) from source to target tasks. Under the assumption that tasks defined on the same environment often have similar transition models and reward functions, we propose a method to select samples from the source tasks that are mostly similar to the target task, and, then, to use them as input for batch reinforcement learning algorithms. As a result, the number of samples that the agent needs to collect from the target task to learn its solution is reduced. We empirically show that, following the proposed approach, the transfer of samples is effective in reducing the learning complexity, even when the source tasks are significantly different from the target task.
| Slides | |
| 0:00 | Transfer of Samples in Batch Reinforcement Learning |
| 0:01 | Outline |
| 0:19 | Transfer in Reinforcement Learning (1) |
| 0:36 | Transfer in Reinforcement Learning (2) |
| 0:52 | Transfer in Reinforcement Learning (3) |
| 1:10 | State of the Art (1) |
| 1:16 | State of the Art (2) |
| 1:34 | State of the Art (3) |
| 1:53 | State of the Art (4) |
| 2:21 | Outline |
| 2:23 | The Goal (1) |
| 2:38 | The Goal (2) |
| 2:57 | The Goal (3) |
| 3:17 | The Scenario (1) |
| 3:38 | The Scenario (2) |
| 3:59 | The Scenario (3) |
| 4:25 | Outline |
| 4:26 | Task Compliance (1) |
| 4:42 | Task Compliance (2) |
| 5:00 | Task Compliance (3) |
| 5:21 | Continuous Model Approximation (1) |
| 5:45 | Continuous Model Approximation (2) |
| 5:54 | Continuous Model Approximation (3) |
| 6:42 | Task Compliance (1) |
| 7:00 | Task Compliance (2) |
| 7:04 | Task Compliance (3) |
| 7:40 | Task Compliance (4) |
| 7:53 | Sample Relevance (1) |
| 8:15 | Sample Relevance (2) |
| 8:33 | Sample Relevance (3) |
| 8:58 | Sample Relevance (4) |
| 9:10 | Sample Relevance (5) |
| 9:18 | Sample Relevance (6) |
| 9:24 | Sample Relevance (7) |
| 9:32 | Sample Relevance (8) |
| 9:42 | Sample Relevance (5) |
| 9:55 | Sample Relevance (9) |
| 10:22 | Sample Relevance (10) |
| 10:31 | Transfer of Samples |
| 10:49 | Outline |
| 10:52 | The Boat Problem (1) |
| 11:54 | The Boat Problem (2) |
| 12:35 | The Boat Problem (3) |
| 12:56 | Outline |
| 12:58 | Transfer from S1 and S2 to T (1) |
| 13:53 | Transfer from S1 and S2 to T (2) |
| 14:23 | Transfer from S1 and S2 to T (3) |
| 14:40 | Transfer from S1 and S2 to T (4) |
| 14:50 | Transfer from S1 and S2 to T (5) |
| 15:10 | Transfer from S1 and S2 to T (6) |
| 15:24 | Transfer from S1 and S2 to T (7) |
| 15:37 | Transfer from S1 and S2 to T (8) |
| 16:01 | Transfer from S1 and S2 to T (9) |
| 16:18 | Transfer from S1 and S2 to T (10) |
| 16:51 | Outline |
| 16:54 | Conclusions (1) |
| 17:12 | Conclusions (2) |
| 17:41 | Conclusions (3) |
| 17:56 | Conclusions (4) |
| 18:05 | Conclusions (5) |
| 18:33 | Conclusions (6) |
| 18:44 | Conclusions (7) |
| 19:02 | Conclusions (8) |
| 19:29 | Thank you! |
| 21:15 | - Questions |
| 23:34 | - Questions |
| 26:05 | - Questions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !


