Estimating Rates of Rare Events at Multiple Resolutions
Description
We consider the problem of estimating occurrence rates of rare events for extremely sparse data, using pre-existing hierarchies to perform inference at multiple resolutions. In particular, we focus on the problem of estimating click rates for (webpage, advertisement) pairs (called impressions) where both the pages and the ads are classified into hierarchies that capture broad contextual information at different levels of granularity. Typically the click rates are low and the coverage of the hierarchies is sparse. To overcome these difficulties we devise a sampling method whereby we analyze a specially chosen sample of pages in the training set, and then estimate click rates using a two-stage model. The first stage imputes the number of (webpage, ad) pairs at all resolutions of the hierarchy to adjust for the sampling bias. The second stage estimates click rates at all resolutions after incorporating correlations among sibling nodes through a tree-structured Markov model. Both models are scalable and suited to large scale data mining applications. On a real-world dataset consisting of 1/2 billion impressions, we demonstrate that even with 95% negative (non-clicked)events in the training set, our method can effectively discriminate extremely rare events in terms of  heir click propensity.
| Slides | |
| 0:03 | Estimating Rates of Rare Events at Multiple Resolutions |
| 0:10 | Estimation in the “Tail” pt 1 |
| 1:06 | Estimation in the “Tail” pt 2 |
| 2:11 | System Overview |
| 3:34 | Sampling of Webpages |
| 4:34 | Imputation of Impression Volume pt 1 |
| 6:02 | Imputation of Impression Volume pt 2 |
| 6:21 | Imputation of Impression Volume pt 3 |
| 6:42 | Imputing Xij |
| 7:25 | Imputation: Summary |
| 7:46 | System Overview |
| 8:22 | Rare Rate Modeling pt 1 |
| 8:39 | Rare Rate Modeling pt 2 |
| 9:23 | Rare Rate Modeling pt 3 |
| 10:20 | Experiments pt 1 |
| 10:45 | Experiments pt 2 |
| 11:21 | Experiments pt 3 |
| 12:15 | Experiments pt 4 |
| 12:34 | Experiments pt 5 |
| 13:46 | Related Work |
| 14:23 | Conclusions |
| 15:55 | Experiments pt 5 (a) |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





