event thumbnail image
Carnegie Mellon Machine Learning Lunch seminar

Inference Complexity as Learning Bias

author: Pedro Domingos, Dept. of Computer Science & Engineering, University of Washington

Description

Graphical models are usually learned without regard to the cost of doing inference with them. As a result, even if a good model is learned, it may perform poorly at prediction, because it requires approximate inference. We propose an alternative: learning models with a score function that directly penalizes the cost of inference. Specifically, we learn arithmetic circuits with a penalty on the number of edges in the circuit (in which the cost of inference is linear). Our algorithm is equivalent to learning a Bayesian network with context-specific independence by greedily splitting conditional distributions, at each step scoring the candidates by compiling the resulting network into an arithmetic circuit, and using its size as the penalty. We show how this can be done efficiently, without compiling a circuit from scratch for each candidate. Experiments on several real-world domains show that our algorithm is able to learn tractable models with very large treewidth, and yields more accurate predictions than a standard context-specific Bayesian network learner, in far less time. (Joint work with Daniel Lowd.)

You might be experiencing some problems with Your Video player.
Slides
0:00 - Announcement
0:35 Inference Complexity As Learning Bias
0:56 Don’t use model complexity as your learning bias …
1:43 The Goal
4:03 Outline
4:40 Solution 1: Exact Inference
5:38 Solution 2: Approximate Inference
7:11 Solution 3: Learn a tractable model
9:16 Thin junction trees are thin
9:34 Solution 3: Learn a tractable model
10:32 Outline
10:53 Bayesian networks
12:14 …With decision-tree CPDs
12:42 …Compiled to circuits
13:55 Arithmetic circuits
15:21 ACs for Inference
17:57 BN Structure Learning
20:27 Key Idea
21:28 Basic algorithm
22:44 Efficiency
24:07 Algorithm
24:41 Before split
25:23 After split
26:26 After split ...
27:12 How to split a circuit
29:42 Optimizations
32:36 Experiments
34:17 Learning time
35:11 Inference time
36:37 Accuracy: EachMovie
37:27 Inference time
37:29 Accuracy: EachMovie
37:41 Accuracy: MSWeb
37:43 Accuracy: KDD Cup
38:07 Conclusion
41:07 - Questions
43:04 - Questions
43:59 - Questions
44:14 - Questions
44:29 - Questions
44:40 - Questions
45:51 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: