Non-standard Geometries and Data Analysis

author: Suresh Venkatasubramanian, School of Computing, University of Utah
published: Dec. 5, 2008,   recorded: November 2008,   views: 669
Categories
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Non-standard Geometries And Data Analysis
1:07 “Standard” Data Mining (1)
1:32 “Standard” Data Mining (2)
1:49 “Standard” Data Mining (3)
2:00 “Standard” Data Mining (4)
2:19 Data Mining: The “Convenient Fiction”
4:30 Data Mining: Algorithms
6:20 Non-standard Geometries I: Surfaces in Rd
7:44 Non-Standard Geometries II: Shape Manifolds
8:51 Non-Standard Geometries III: Information Geometry
9:53 Central Question of Study
10:58 An Aside: Convert Everything To A Metric Space
12:35 Part II
12:45 Shape Manifolds I: Rotations
13:20 Shape Manifolds II: Kendall Shape Space
14:08 Shape Manifolds III: MRI Scans
14:41 Riemannian Manifolds
15:05 Central Points on Manifolds
15:41 Measures of Centrality
16:55 Robustness
17:51 Medians on Manifolds [FVJ08]
18:51 Prior Art (1-median)
20:35 Force-Balance and the Weiszfeld Algorithm (1)
21:55 Force-Balance and the Weiszfeld Algorithm (2)
22:10 Force-Balance and the Weiszfeld Algorithm (3)
22:32 Displacement on Manifolds: Exp and Log
22:58 Weiszfeld Iteration On Manifolds (1)
23:14 Weiszfeld Iteration On Manifolds (2)
23:31 Main Results I
24:08 Main Results II
24:43 Results I: Rotations
25:01 Results II: Hands
25:15 Results III: MRI Scans
25:29 Questions
26:46 Part III
26:46 Information-Theoretic Clustering (1)
27:35 Information-Theoretic Clustering (2)
27:58 Information-Theoretic Clustering (3)
28:13 Information-Theoretic Clustering (4)
28:28 Information-Theoretic Clustering (5)
28:47 Why it works
30:15 Where’s The Geometry ?
31:14 Geometric View of Information-Theoretic Clustering
32:09 Data Integration
33:54 Heterogeneity Testing
34:13 What Is Heterogeneity ? (1)
34:22 What Is Heterogeneity ? (2)
34:27 What Is Heterogeneity ? (3)
34:30 Heterogeneity = Cluster Entropy Of Soft Clustering
34:52 Representing Strings as Distributions
35:00 Data Sets
35:02 Does The Scheme Work ? (1)
35:05 Does The Scheme Work ? (2)
35:06 Does The Scheme Work ? (3)
35:12 Questions
35:57 Central Question of Study
36:20 Acknowledgements
36:31 - questions
38:47 Information-Theoretic Clustering (5)

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
 
    Delicious Bibliography

Description

Traditional data mining starts with the mapping from entities to points in a Euclidean space. The search for patterns and structure is then framed as a geometric search in this space. Concepts like principal component analysis, regression, clustering, and centrality estimation have natural geometric formulations, and we now understand a great deal about manipulating such (typically high dimensional) spaces. For many domains of interest however, the most natural space to embed data in is not Euclidean.

Data might lie on curved manifolds, or even inhabit spaces endowed with different distance structures than l_p spaces. How does one do data analysis in such domains ? In this talk, I'll discuss two specific domains of interest that pose challenges for traditional data mining and geometric methods. One space consists of collections of distributions, and the other is the space of shapes. In both cases, I'll present ongoing work that attempts to interpret and understand clustering in such spaces, driven by different applications.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: