Fast Clustering based on Kernel Density Estimation
Description
The Denclue algorithm employs a cluster model based on
kernel density estimation. A cluster is defined by a local maximum of
the estimated density function. Data points are assigned to clusters by
hill climbing, i.e. points going to the same local maximum are put into
the same cluster. A disadvantage of Denclue 1.0 is, that the used hill
climbing may make unnecessary small steps in the beginning and never
converges exactly to the maximum, it just comes close.
We introduce a new hill climbing procedure for Gaussian kernels,
which adjusts the step size automatically at no extra costs. We prove
that the procedure converges exactly towards a local maximum by reducing
it to a special case of the expectation maximization algorithm.
We show experimentally that the new procedure needs much less iterations
and can be accelerated by sampling based methods with sacrificing
only a small amount of accuracy.
Categories
Top: Computer Science: Machine Learning: ClusteringTop: Computer Science: Machine Learning: Density estimation
Top: Computer Science: Machine Learning: Kernel Methods
| Slides | |
| 0:00 | Fast Clustering Based on Kernel Density Estimation |
| 0:29 | Overview |
| 1:07 | Density-Based Clustering |
| 1:44 | Kernel Density Estimation |
| 3:00 | Denclue 1.0 Framework |
| 4:01 | Problem of Constant Step Size |
| 4:32 | New Hill Climbing Approach |
| 5:11 | New Denclue 2.0 Hill Climbing |
| 5:19 | New Hill Climbing Approach (a) |
| 5:31 | New Denclue 2.0 Hill Climbing (a) |
| 5:50 | Proof of Convergence pt 1 |
| 7:15 | Proof of Convergence pt 2 |
| 7:58 | Identification of Local Maxima |
| 9:40 | Acceleration |
| 11:29 | Experiments pt 1 |
| 12:09 | Experiments pt 2 |
| 12:33 | Experiments pt 3 |
| 13:19 | Experiments pt 4 |
| 14:22 | Conclusion |
| 14:58 | Thank You for Your Attention! |
| 18:52 | Experiments pt 4 (a) |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





