event thumbnail image
The 13th International Conference on Knowledge Discovery and Data Mining

Predictive Discrete Latent Factor Models for Large Scale Dyadic Data

author: Deepak Agarwal, Yahoo Research

Description

We propose a novel statistical method to predict large scale dyadic response variables in the presence of covariate information. Our approach simultaneously incorporates the effect of covariates and estimates local structure that is induced by interactions among the dyads through a discrete latent factor model. The discovered latent factors provide a predictive model that is both accurate and interpretable.

We illustrate our method by working in a framework of generalized linear models, which include commonly used regression techniques like linear regression, logistic regression and Poisson regression as special cases.

We also provide scalable generalized EM-based algorithms for model fitting using both "hard" and "soft" cluster assignments. We demonstrate the generality and efficacy of our approach through large scale simulation studies and analysis of datasets obtained from certain real-world movie recommendation and internet advertising applications.

You might be experiencing some problems with Your Video player.
Slides
0:03 Predictive Discrete Latent Factor Models
0:37 Internet Advertising: Billion Dollar Industry
1:26 Recommender Systems
1:57 Click Fraud
2:39 Data
4:01 Problem Definition
5:15 Agenda
5:50 Existing Approaches
6:03 Non-Parametric Function Estimation
7:28 Random Effects Model
9:25 Generalized Linear Models
10:30 Unsupervised Approach
11:09 Clustering Animation
11:45 PDLF: High Level Overview
12:42 Model
13:15 Fitting Algorithm: Generalized EM
13:53 EM Algorithm Hard Clustering
14:04 Simulation Study on Movie Lens
14:14 Logistic Regression on Movie Lens
14:27 Experiments: Click Count Data
15:02 Co-Cluster Interactions: Plain Co-Clustering
15:49 Co-Cluster Interactions: PDLF
16:35 Prediction Results
16:51 Summary
17:36 Ongoing Work
18:30 Prediction Results (a)

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: