event thumbnail image
Workshops

Deep Learning via Semi-Supervised Embedding

author: Jason Weston, NEC Laboratories America, Inc.

Description

We show how nonlinear embedding algorithms popular for use with shallow semi-supervised learning techniques such as kernel methods can be applied to deep multi-layer architectures, either as a regularizer at the output layer, or on each layer of the architecture. This provides a simple alternative to existing approaches to deep learning whilst yielding competitive error rates compared to those methods, and existing shallow semi-supervised techniques.

We then go on to generalize this approach to take advantage of sequential data: for images, and text.

For images, we take advantage of the temporal coherence that naturally exists in unlabeled video recordings. That is, two successive frames are likely to contain the same object or objects. We demonstrate the effectiveness of this method in a semi-supervised setting on some pose invariant object and face recognition tasks.

For text, we describe a unified approach to tagging: a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, and semantic roles. State-of-the-art performance is attained by learning word embeddings using a text specific semi-supervised task called a language model.

Joint work with: Ronan Collobert, Frederic Ratle, Hossein Mobahi, Pavel Kuksa and Koray Kavukcuoglu.

You might be experiencing some problems with Your Video player.
Slides
0:00 DEEP LEARNING VIA SEMI-SUPERVISED EMBEDDING
0:26 Summary
2:26 Deep Learning with Neural Networks
3:21 Some Deep Training Methods That Exist
3:42 Deep and Shallow Research
5:01 Deep NNs: Multitask with auxiliary unsupervised tasks
6:46 Existing Embedding Algorithms
8:42 Siamese Network: functional embedding
10:07 Shallow Semi-supervision
10:36 New regularizer for NNs: Deep Embedding
11:53 Deep Semi-Supervised Embedding
12:11 Pairwise Example Prior: more general than using k-NN
12:57 Some Perspectives
13:33 Some Experiments: Small Semi-Supervised Setup
13:56 Deep Semi-Supervised Results - 1
14:09 Deep Semi-Supervised Results - 2
14:50 Really Deep Results
15:34 Conclusion (so far)
15:58 DEEP LEARNING FOR VIDEO
16:04 APPLICATION: LEARNING FROM VIDEO
16:23 COIL
16:48 Experimental setup
16:57 Test Accuracy Performance on COIL100 in various settings
17:34 DEEP LEARNING FOR TEXT
17:56 NLP Tasks
18:03 The Brain Way
18:25 The Deep Learning Way
18:37 Using Unlabeled Data
18:59 Language Model: Embedding
19:16 Deep Text Results
19:45 Final Conclusion (really)
21:20 Deep Text Results

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: