Automatic Labeling of Multinomial Topic Models
Description
Multinomial distributions over words are frequently used to model topics in text collections. A common, major challenge in applying all such topic models to any text mining problem is to label a multinomial topic model accurately so that a user can interpret the discovered topic. So far, such labels have been generated manually in a subjective way. In this paper, we propose probabilistic approaches to automatically labeling multinomial topic models in an objective way. We cast this labeling problem as an optimization problem involving minimizing Kullback-Leibler divergence between word distributions and maximizing mutual information between a label and a topic model. Experiments with user study have been done on two text data sets with different genres. The results show that the proposed labeling methods are quite effective to generate labels that are meaningful and useful for interpreting the discovered topic models. Our methods are general and can be applied to labeling topics learned through all kinds of topic models such as PLSA, LDA, and their variations.
Categories
Top: Computer Science: Text MiningTop: Computer Science: Machine Learning: Human Language Technology
| Slides | |
| 0:03 | Automatic Labeling of Multinomial Topic Models |
| 0:16 | Outline |
| 0:48 | Statistical Topic Models for Text Mining |
| 2:01 | Topic Models: Hard to Interpret |
| 2:57 | What is a Good Label? |
| 4:26 | Our Method |
| 6:23 | Relevance (Task 2): the Zero-Order Score |
| 7:44 | Relevance (Task 2): the First-Order Score |
| 9:32 | Discrimination and Coverage (Tasks 3 & 4) |
| 10:31 | Variations and Applications |
| 12:03 | Experiments |
| 12:33 | Result Summary |
| 13:36 | Results: Sample Topic Labels |
| 14:54 | Results: Context-Sensitive Labeling |
| 15:46 | Summary |
| 16:37 | Thanks |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





