event thumbnail image
The 25th International Conference on Machine Learning (ICML 2008)

Data Spectroscopy: Learning Mixture Models using Eigenspaces of Convolution Operators

author: Mikhail Belkin, Ohio State University

Description

In this paper we develop a spectral framework for estimating mixture distributions, specifically Gaussian mixture models. In physics, spectroscopy is often used for the identification of substances through their spectrum. Treating a kernel function K(x,y) as "light" and the sampled data as "substance", the spectrum of their interaction (eigenvalues and eigenvectors of the kernel matrix K) unveils certain aspects of the underlying parametric distribution p, such as the parameters of a Gaussian mixture. Our approach extends the intuitions and analyses underlying the existing spectral techniques, such as spectral clustering and Kernel Principal Components Analysis (KPCA). We construct algorithms to estimate parameters of Gaussian mixture models, including the number of mixture components, their means and covariance matrices, which are important in many practical applications. We provide a theoretical framework and show encouraging experimental results.

You might be experiencing some problems with Your Video player.
Slides
0:00 Data Spectroscopy: Learning Mixture Models with Eigenspaces of Probability Distributions
0:19 Spectral geometry
2:00 Eigenfunctions
2:16 Spectral geometry
2:55 Eigenfunctions
3:26 Can you hear the shape of a probability distribution?
3:45 Spectrum of a probability distribution p
4:11 Can you hear the shape of a probability distribution?
4:17 Spectrum of a probability distribution p
5:42 How to hear a Gaussian (1)
7:27 How to hear a Gaussian (2)
8:05 How to hear a Gaussian (1)
8:28 How to hear a Gaussian (2)
9:14 From data: mean
9:41 Top eigenfunction
10:07 Classical problem: Gaussian Mixtures
10:14 Gaussian Mixture Distributions
10:25 Expectation Maximization
10:40 How to hear a mixture (1)
12:00 How to hear a mixture (2)
13:32 Perturbation theory
13:48 Example
16:34 Data Spectroscopy
17:15 Example
17:35 Data Spectroscopy
22:05 - Questions
26:27 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: