event thumbnail image
Sessions

Graph Construction and b-Matching for Semi-Supervised Learning

author: Tony Jebara, Department of Computer Science, Columbia University

Description

Graph based semi-supervised learning (SSL) methods play an increasingly important role in practical machine learning systems. A crucial step in graph based SSL methods is the conversion of data into a weighted graph. However, most of the SSL literature focuses on developing label inference algorithms without extensively studying the graph building method and its effect on performance. This article provides an empirical study of leading semi-supervised methods under a wide range of graph construction algorithms. These SSL inference algorithms include the Local and Global Consistency (LGC) method, the Gaussian Random Field (GRF) method, the Graph Transduction via Alternating Minimization (GTAM) method as well as other techniques. Several approaches for graph construction, sparsification and weighting are explored including the popular k-nearest neighbors method (kNN) and the b-matching method. As opposed to the greedily constructed kNN graph, the b-matched graph ensures each node in the graph has the same number of edges and produces a balanced or regular graph. Experimental results on both artificial data and real benchmark datasets indicate that b-matching produces more robust graphs and therefore provides significantly better prediction accuracy without any significant change in computation time.

You might be experiencing some problems with Your Video player.
Slides
0:00 Graph Construction and b-Matching for Semi-Supervised Learning (1)
0:04 Graph Construction and b-Matching for Semi-Supervised Learning (2)
0:51 Semi-Supervised Learning
1:54 Graph Based SSL
3:30 Graph Construction
4:50 Neighborhood Graphs
6:26 k-Nearest Neighbor Graphs
7:34 b-Matching Graphs (1)
7:58 b-Matching Graphs (2)
9:30 Bipartite 1-Matching
10:34 Bipartite b-Matching (1)
11:14 Bipartite b-Matching (2)
12:05 b-Matching (1)
12:30 b-Matching (2)
13:02 b-Matching (3)
13:54 Graph Weighting
15:02 Graph Labeling
16:06 Gaussian Random Fields
16:40 Local and Global Consistency
17:06 Graph Transduction via Alternating Minimization
17:58 Synthetic Experiments (1)
19:22 Synthetic Experiments (2)
20:01 Real Experiment Error Rates
20:46 Real Experiment Error Rates with More Labeling
21:34 Conclusions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: