event thumbnail image
Workshops

Generalization theory of two-part code MDL estimator

author: Tong Zhang, Yahoo! Research

Description

I will present a finite-sample generalization analysis of two-part code MDL estimator. This method selects a model that minimizes the sum of the model description length plus the data description length given the model. It can be shown that under various conditions, optimal rate of convergence can be achieved through an extended family of two-part code MDL that over-penalize the model description length.

As an example, we apply MDL to learning sparse linear representations when the system dimension is much larger than the number of training examples. This is a problem that has attracted considerable attention in recent years. The generalization performance of a two-part code MDL estimator is calculated based on our theory, and it compares favorably to other methods such as 1-norm regularization.

You might be experiencing some problems with Your Video player.
Slides
0:00 Generalization Theory of Two-part Code MDL Estimator
0:01 Outline
1:45 Notations
4:10 Estimation Problem
5:19 Basic Information Theoretical Inequality (with global KL-complexity)
7:07 Remarks
8:21 Information Theoretic Inequality with localized
9:29 Information Complexity Minimization
10:36 Special Case: Two-part code MDL estimation (1)
12:04 Special Case: Two-part code MDL estimation (2)
13:17 Information Complexity Minimization
13:30 Special Case: Two-part code MDL estimation (2)
13:47 Notation: Distances between Distributions
13:53 Information Theoretic Inequality with localized
13:58 Notation: Distances between Distributions
15:00 MDL Convergence Bound using Global Entropy (λ > 1)
17:08 MDL Convergence Bound under Localized Entropy (λ > 1)
17:35 Sub-optimality of MDL with locally well-specified prior
18:28 Sparse Learning
19:36 Two Part-code MDL Solution (1)
20:44 Sparse Learning
20:48 Two Part-code MDL Solution (1)
21:26 Two Part-code MDL Solution (2)
22:11 Sparse Learning
22:13 Two Part-code MDL Solution (1)
22:18 Two Part-code MDL Solution (2)
22:50 Two Part-code MDL Solution (1)
22:51 Two Part-code MDL Solution (2)
22:53 Global MDL Bound
24:17 Localized MDL Bound
25:16 Comparing to L1 convex relaxation
25:50 Summary

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: