0.25
0.5
0.75
1.25
1.5
1.75
2
Corpus Design Principles and Challenges in the COST Action 'Distant Reading for European Literary History
Published on Nov 12, 201942 Views
Related categories
Chapter list
COST Action Distant Reading for European Literary History00:00
Outline - 101:24
Schedule - 101:37
Schedule - 202:21
Introduction02:57
Outline - 203:32
COST Actions are research networks03:37
Distant reading04:23
COST Action Distant Reading05:56
WG1 Scholarly Resources06:18
ELTeC07:00
Outline - 307:46
Romanian Language collection07:49
Introduction to Romanian novels / literary context08:18
Table - 108:45
Table - 210:57
Table - 311:48
Cultural context - 112:54
Cultural context - 215:43
Cultural context - 317:27
“Birth certificates” of Romanian novels18:54
The Romanian Novel in figures relevant for ELTeC sampling19:30
Corpus22:02
Corpus design23:17
Corpus design – Action’s purpose24:43
Corpus design – challenges25:48
Corpus design – Action’s approach26:25
ELTeC – sampling criteria28:27
ELTeC – balancing criteria30:13
ELTeC – current state35:14
Research data management for ELTeC - 138:12
Research data management for ELTeC - 239:05
Is that possible?40:24
How to... Build a National novel collection?40:37
Drawback 1: DIGITIZATION from scratch41:32
Samples I: printing popular books in 19th-century Romania42:49
Samples II: take a close look at the glyphs43:13
OCR output: really untidy44:06
Samples III: take a close look at the glyphs44:22
OCR output: absolutely unreliable44:40
Some normal and, of course, normalized texts44:49
Drawback II: TEI HEADER – particularities - 145:31
Drawback III: The Splendors and Miseries of Digital Literary Studies in Romania in 201946:29
Treatment of metadata rather than data47:33
Low digitization of both resources and library metadata48:10
Briefly...50:47
Output of Romanian novels vs. ELTeC groups (Time Slots)51:27
How many of them actually available in Romanian libraries after54:22
Female authors: bettering the scores55:07
Sampling 9-11 authors with exactly 3 books56:38
Open questions57:33