BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies

Published on 2016-05-272135 Views

Shihao Ji

We propose BlackOut, an approximation algorithm to efficiently train massive recurrent neural network language models (RNNLMs) with million word vocabularies. BlackOut is motivated by using a discrimi

ICLR 2016 - San Juan

Related categories

Presentation

BlackOut: Speeding up RNNLMs with Very Large Vocabularies00:00

Prevalence of Large Softmax Output Layers00:22

Case Study: RNNLM01:35

System Optimization 04:50

Strategies to Speed up Softmax05:55

Blackout Training06:51

Connection to Importance Sampling09:28

Connection to Noise Constrastive Estimate (NCE)10:45

Comparison to Dropout12:04

Experiments on Small Datasets - 113:34

Experiments on Small Datasets - 214:34

Experiments on 1-Billion Word Benchmark14:41

Comparison to State-of-The-Arts15:15

Conclusion16:33