Online Learning and Game Theory

author:Adam Kalai, Toyota Technological Institute at Chicago
published: Feb. 25, 2007,   recorded: May 2005,   views: 2602
Categories
You might be experiencing some problems with Your Video player.

Related content

Visitors who watched this lecture also watched...
51:51
Basic Concepts of Game Theory

1237 views - Toni Jarimo, 2004
05:27:43
Online Learning, Regret Minimization, and Game Theory

990 views - Avrim Blum, 2008
03:32:21
Introduction to Learning Theory

2748 views - Olivier Bousquet, 2006
04:59:19
Machine Learning, Probability and Graphical Models

18458 views - Sam Roweis, 2006
01:43:02
Fuzzy Logic

16728 views - Michael Berthold, 2005
05:02:23
Statistical Learning Theory

8005 views - John Shawe-Taylor, 2004
02:54:53
Some Mathematical Tools for Machine Learning

3714 views - Chris Burges, 2003
02:22:16
Convex Optimization

2951 views - Lieven Vandenberghe, 2007
04:28:21
Online Learning

670 views - Nicolò Cesa-Bianchi, 2007
02:55:49
Game Theory & Clustering

371 views - Marcello Pelillo, 2009

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 1:03:51 Flash video Windows Media video
!NOW PLAYING
Watch Part 2
Part 2 0:33:55 Flash video Windows Media video

Description

We consider online learning and its relationship to game theory. In an online decision-making problem, as in Singer's lecture, one typically makes a sequence of decisions and receives feedback immediately after making each decision. As far back as the 1950's, game theorists gave algorithms for these problems with strong regret guarantees. Without making statistical assumptions, these algorithms were guaranteed to perform nearly as well as the best single decision, where the best is chosen with the benefit of hindsight. We discuss applications of these algorithms to complex learning problems where one receives very little feedback. Examples include online routing, online portfolio selection, online advertizing, and online data structures. We also discuss applications to learning Nash equilibria in zero-sum games and learning correlated equilibria in general two-player games.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 ram, January 6, 2008 at 6:57 p.m.:

Second video doesn't play past 9:20 seconds


Comment2 ram, January 10, 2008 at 12:45 a.m.:

works now.


Comment3 Praveen, December 26, 2008 at 7:08 p.m.:

Apparently, the stream is not available any more. I get the error "Stream not found". Is it possible to look into the problem?

Write your own review or comment:

make sure you have javascript enabled or clear this field: