Generalization theory of two-part code MDL estimator
Description
I will present a finite-sample generalization analysis of two-part code MDL estimator. This method selects a model that minimizes the sum of the model description length plus the data description length given the model. It can be shown that under various conditions, optimal rate of convergence can be achieved through an extended family of two-part code MDL that over-penalize the model description length. As an example, we apply MDL to learning sparse linear representations when the system dimension is much larger than the number of training examples. This is a problem that has attracted considerable attention in recent years. The generalization performance of a two-part code MDL estimator is calculated based on our theory, and it compares favorably to other methods such as 1-norm regularization.
| Slides | |
| 0:00 | Generalization Theory of Two-part Code MDL Estimator |
| 0:01 | Outline |
| 1:45 | Notations |
| 4:10 | Estimation Problem |
| 5:19 | Basic Information Theoretical Inequality (with global KL-complexity) |
| 7:07 | Remarks |
| 8:21 | Information Theoretic Inequality with localized |
| 9:29 | Information Complexity Minimization |
| 10:36 | Special Case: Two-part code MDL estimation (1) |
| 12:04 | Special Case: Two-part code MDL estimation (2) |
| 13:17 | Information Complexity Minimization |
| 13:30 | Special Case: Two-part code MDL estimation (2) |
| 13:47 | Notation: Distances between Distributions |
| 13:53 | Information Theoretic Inequality with localized |
| 13:58 | Notation: Distances between Distributions |
| 15:00 | MDL Convergence Bound using Global Entropy (λ > 1) |
| 17:08 | MDL Convergence Bound under Localized Entropy (λ > 1) |
| 17:35 | Sub-optimality of MDL with locally well-specified prior |
| 18:28 | Sparse Learning |
| 19:36 | Two Part-code MDL Solution (1) |
| 20:44 | Sparse Learning |
| 20:48 | Two Part-code MDL Solution (1) |
| 21:26 | Two Part-code MDL Solution (2) |
| 22:11 | Sparse Learning |
| 22:13 | Two Part-code MDL Solution (1) |
| 22:18 | Two Part-code MDL Solution (2) |
| 22:50 | Two Part-code MDL Solution (1) |
| 22:51 | Two Part-code MDL Solution (2) |
| 22:53 | Global MDL Bound |
| 24:17 | Localized MDL Bound |
| 25:16 | Comparing to L1 convex relaxation |
| 25:50 | Summary |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





