Large Scale Ranking Problem: some theoretical and algorithmic issues
Description
The talk is divided into two parts. The first part focuses on web-search ranking, for which I discuss training relevance models based on DCG (discounted cumulated gain) optimization. Under this metric, the system output quality is naturally determined by the performance near the top of its rank-list. I will mainly focus on various theoretical issues for this learning problem. The second part discusses related algorithmic issues in the context of optimizing the scoring function of a statistical machine translation system according to the BLEU metric (standard measure of translation quality). Our approach treats machine translation as a black-box, and can optimize millions of system parameters automatically. This has not been attempted before in this context. I will present our method and some initial results.
| Slides | |
| 0:05 | Advertizing |
| 2:04 | Ranking Problems |
| 4:06 | Earlier Work on Statistical Ranking |
| 7:38 | Theoretical Results on Ranking |
| 9:39 | Web-Search Problem |
| 12:32 | Relevance Ranking: Statistical Learning Formulation |
| 13:49 | Measuring Ranking Quality |
| 15:23 | Subset Ranking Model |
| 16:40 | Some Theoretical Questions |
| 17:42 | Bayes Optimal Scoring |
| 18:36 | Simple Regression |
| 20:38 | Importance Weighted Regression |
| 22:11 | Relationship of Regression and Ranking |
| 22:58 | Appropriate Parameter Choice (for previous Theorem) |
| 23:03 | Generalization Performance with Square Regularization |
| 23:41 | Interpretation of Results |
| 24:55 | Another Ranking Example: spelling correction in web-search |
| 25:49 | Some Conclusions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !






