Learnable Representations for Natural Language

author: Alexander Clark, Royal Holloway, University of London
published: Jan. 19, 2010,   recorded: December 2009,   views: 490
Categories
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Acknowledgements
0:13 Summary
2:20 Table of contents
3:09 Problem
3:52 How to study learnability? - 1
4:29 How to study learnability? - 2
5:30 Reasonable Research Strategy - 1
6:06 Reasonable Research Strategy - 2
6:30 Reasonable Research Strategy - 3
7:29 Empiricist models
9:09 Example
9:44 Transitions
10:46 Learning
11:06 Learning DFAs and PDFAs
12:03 Outline
12:08 Beyond regular languages
13:30 Observable structure
14:51 Example - 1
15:15 Example - 2
17:09 Elementary results
18:18 Limitations
19:00 NP - DT N - 1
19:37 NP - DT N - 2
19:57 Lattice
21:13 Contextual Features
22:08 Representation - 1
22:46 Representation - 2
23:47 Lattice rules - 1
24:07 Lattice rules - 2
24:44 Contextual Binary Feature Grammars
25:52 Recursive computation - 1
26:13 Recursive computation - 2
26:21 Power of BFGs
26:27 Writing down a grammar
27:40 Search
27:59 Monotonicity of K
28:30 Monotonicity of F
29:09 Dyck language - 1
29:45 Dyck language - 2
30:01 Algorithm
30:25 Diagram
30:42 Finite Context Property - 1
31:18 Finite Context Property - 2
31:22 Fiduciality
32:11 Scope of the FCP
32:58 Result
33:16 Outline
33:35 Concept lattices
34:06 Syntactic Concept Lattice
34:49 Example
35:23 L = (ab)
35:59 Concatenation
36:18 Residuated lattice
36:36 Concatenation
36:39 Learnable
36:56 Constructing a representation - 1
37:01 Constructing a representation - 2
37:39 Regular languages
38:12 Non-regular languages
38:39 Finite feature set
39:00 L = {anbn|n ≥ 0}
39:17 Two problems
39:45 Compute an upper bound 
40:29 Representation
40:56 Power of Representation
41:29 Monotonicity lemma
41:52 Writing down a grammar - 1
42:21 Writing down a grammar - 2
42:33 Monotonicity lemma - 1
43:25 Monotonicity lemma - 2
44:44 Search problem is trivial
45:01 Chomsky Hierarchy
46:12 Statistical Modelling
47:08 Conclusion

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
 
    Delicious Bibliography

Description

The Chomsky hierarchy was explicitly intended to represent the hypotheses from distributional learning algorithms; yet these standard representations are well known to be hard to learn, even under quite benign learning paradigms, because of the computationally complexity of inferring rich hidden structures like trees.

There is a lot of interest in unsupervised learning of natural language -- current approaches (e.g. Klein and Manning, Johnson's Adaptor Grammars) use modifications of existing models such as tree or dependency structures together with sophisticated statistical models in order to recover structures that are as close as possible to gold standard manual annotations.

This tutorial will cover a different approach: recent algorithms for the unsupervised learning of representations of natural language based on distributional learning (Clark & Eyraud 2007; Clark, Eyraud and Habrard, 2008; Clark 2009). This research direction involves abandoning the standard models and designing new representation classes for formal languages that are richly structured but where the structure is not hidden but based on observable structures of the language -- the syntactic monoid or a lattice derived from that monoid. These representation classes are as a result easy to learn.

We will look briefly at algorithms for learning deterministic automata, and then move on to algorithms for learning context free and context sensitive languages. These algorithms explicitly model the distribution of substrings of the language: they are efficient (polynomial update time) and provably correct for a class of languages that includes all regular languages, many context free languages and a few context sensitive languages. This class may be rich enough to represent natural language syntax.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: