MDL Tutorial
Description
We give a self-contained tutorial on the Minimum Description Length (MDL) approach to modeling, learning and prediction. We focus on the recent (post 1995) formulations of MDL, which can be quite different from the older methods that are often still called 'MDL' in the machine learning and UAI communities. In its modern guise, MDL is based on the concept of a `universal model'. We explain this concept at length. We show that previous versions of MDL (based on so-called two-part codes), Bayesian model selection and predictive validation (a variation of cross-validation) can all be interpreted as approximations to model selection based on 'universal models'. Modern MDL prescribes the use of a certain `optimal' universal model, the so-called `normalized maximum likelihood model' or `Shtarkov distribution'. This is related to (yet different from) Bayesian model selection with non-informative priors. It leads to a penalization of `complex' models that can be given an intuitive differential-geometric interpretation. Roughly speaking, the complexity of a parametric model is directly related to the number of distinguishable probability distributions that it contains. We also discuss some recent extensions such as the 'luckiness principle', which can be used if the Shtarkov distribution is undefined, and the 'switch distribution', which allows for a resolution of the AIC-BIC dilemma.
| Slides | |
| 0:00 | Universal Modeling: Introduction to ‘Modern’ MDL |
| 1:17 | Overview |
| 1:58 | Minimum Description Length Principle (1) |
| 2:52 | Minimum Description Length Principle (2) |
| 5:01 | Minimum Description Length Principle (3) |
| 5:23 | Model Selection |
| 6:25 | Modern’ MDL? |
| 7:35 | Modern MDL! |
| 7:37 | Overview |
| 7:44 | Codes |
| 8:55 | Code Length & Probability |
| 13:04 | Code Lengths ‘are’ probabilities… |
| 18:36 | …and probabilities ‘are’ code lengths! |
| 19:08 | The Most Important Slide! (1) |
| 19:48 | The Most Important Slide! (2) |
| 20:34 | Remarks |
| 21:24 | The Most Important Slide! (2) |
| 22:55 | Remarks |
| 22:57 | Overview |
| 23:11 | Universal Codes (1) |
| 24:04 | Universal Codes (2) |
| 26:36 | Universal Codes (3) |
| 28:06 | Universal Models |
| 30:37 | Terminology |
| 31:18 | Bayesian Mixtures are universal models (1) |
| 32:25 | Bayesian Mixtures are universal models (2) |
| 33:48 | 2-part MDL code is a universal model (code) |
| 36:00 | 2-part vs. Bayes universal models |
| 36:26 | Bayesian Mixtures are universal models (2) |
| 37:15 | 2-part vs. Bayes universal models |
| 37:46 | Optimal Universal Model |
| 40:12 | Optimal Universal Model - II |
| 42:49 | MDL Model Selection (1) |
| 43:46 | MDL Model Selection (2) |
| 45:49 | Four Interpretations |
| 48:05 | Counting Interpretation of MDL (1) |
| 48:27 | Counting Interpretation of MDL (2) |
| 51:44 | Counting Interpretation of MDL (3) |
| 51:46 | Parametric Model Classes |
| 53:51 | Geometric Interpretation of MDL (1) |
| 55:08 | Geometric Interpretation of MDL (2) |
| 55:20 | Bayesian Model Selection vs. MDL (1) |
| 55:23 | Geometric Interpretation of MDL (2) |
| 58:48 | Bayesian Model Selection vs. MDL (1) |
| 61:08 | Bayesian Model Selection vs. MDL (2) |
| 62:03 | Bayes and MDL, remarks |
| 62:57 | Further topics |
| 63:06 | Predictive Interpretation |
| 65:59 | Predictive Interpretation, II |
| 67:12 | Predictive Interpretation, III |
| 70:01 | Predictive Interpretation, IV |
| 71:00 | Comparing infinitely many models (1) |
| 75:10 | Comparing infinitely many models (2) |
| 76:04 | Overview |
| 76:09 | New Developments |
| 80:21 | Luckiness Principle (2) |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





