Large Scale High-Precision Topic Modeling on Twitter

Published on 2014-10-077262 Views

Shuang-Hong Yang

We are interested in organizing a continuous stream of sparse and noisy texts, known as "tweets", in real time into an ontology of hundreds of topics with measurable and stringently high precision. Th

Industry & Government Sessions

Related categories

Presentation

Large Scale High-Precision Topic Modeling on Twitter00:00

Topic modeling of Tweets00:42

Many Use Cases02:09

Quality requirement02:56

Existing approaches?03:42

Meet Jubjub05:56

Technical solutions06:10

Architecture overview06:15

Labeled data acquisition07:42

Tweet text classification08:35

Label correlation08:58

Two stage learning09:43

Diagnosis & corrective learning10:40

Decision Rejection11:00

Quality evaluation11:21

Beyond text11:23

Derive topics from other signals11:34

Integrative inference11:47

Summary11:56

Thank you! 12:17