Fast Clustering based on Kernel Density Estimation

author:Alexander Hinneburg, Institute for Informatics, Martin-Luther University
published: Oct. 8, 2007,   recorded: September 2007,   views: 557
Categories
You might be experiencing some problems with Your Video player.

Related content

Visitors who watched this lecture also watched...
03:24:20
Lectures on Clustering

5729 views - Ulrike von Luxburg, 2007
03:39:05
Clustering - An overview

1829 views - Marina Meila, 2006
02:06:49
Introduction to kernel methods

2714 views - Bernhard Schölkopf, 2007
48:57
Kernel Representations and Kernel Density Estimation

183 views - Peter J. Bickel, 2008
03:21
K-nearest neighbor classification

4054 views - Antal van den Bosch, 2007
02:14:31
Introduction to kernel methods

2918 views - Alexander J. Smola, 2007
20:00
Statistical Change Detection for Multi-Dimensional Data

430 views - Xiuyao Song, 2007
01:00:47
Gaussian Process Basics

12603 views - David MacKay, 2006
20:36
Learning to align: a statistical approach

2204 views - Elisa Ricci, 2007
04:59:19
Machine Learning, Probability and Graphical Models

18419 views - Sam Roweis, 2006

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

Description

The Denclue algorithm employs a cluster model based on kernel density estimation. A cluster is defined by a local maximum of the estimated density function. Data points are assigned to clusters by hill climbing, i.e. points going to the same local maximum are put into the same cluster. A disadvantage of Denclue 1.0 is, that the used hill climbing may make unnecessary small steps in the beginning and never converges exactly to the maximum, it just comes close. We introduce a new hill climbing procedure for Gaussian kernels, which adjusts the step size automatically at no extra costs. We prove that the procedure converges exactly towards a local maximum by reducing it to a special case of the expectation maximization algorithm. We show experimentally that the new procedure needs much less iterations and can be accelerated by sampling based methods with sacrificing only a small amount of accuracy.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: