
0.25
0.5
0.75
1.25
1.5
1.75
2
Corpus Design Principles and Challenges in the COST Action 'Distant Reading for European Literary History
Published on 2019-11-1247 Views
Related categories
Presentation
COST Action Distant Reading for European Literary History00:00
Outline - 101:24
Schedule - 101:37
Schedule - 202:21
Introduction02:57
Outline - 203:32
COST Actions are research networks03:37
Distant reading04:23
COST Action Distant Reading05:56
WG1 Scholarly Resources06:18
ELTeC07:00
Outline - 307:46
Romanian Language collection07:49
Introduction to Romanian novels / literary context08:18
Table - 108:45
Table - 210:57
Table - 311:48
Cultural context - 112:54
Cultural context - 215:43
Cultural context - 317:27
“Birth certificates” of Romanian novels18:54
The Romanian Novel in figures relevant for ELTeC sampling19:30
Corpus22:02
Corpus design23:17
Corpus design – Action’s purpose24:43
Corpus design – challenges25:48
Corpus design – Action’s approach26:25
ELTeC – sampling criteria28:27
ELTeC – balancing criteria30:13
ELTeC – current state35:14
Research data management for ELTeC - 138:12
Research data management for ELTeC - 239:05
Is that possible?40:24
How to... Build a National novel collection?40:37
Drawback 1: DIGITIZATION from scratch41:32
Samples I: printing popular books in 19th-century Romania42:49
Samples II: take a close look at the glyphs43:13
OCR output: really untidy44:06
Samples III: take a close look at the glyphs44:22
OCR output: absolutely unreliable44:40
Some normal and, of course, normalized texts44:49
Drawback II: TEI HEADER – particularities - 145:31
Drawback III: The Splendors and Miseries of Digital Literary Studies in Romania in 201946:29
Treatment of metadata rather than data47:33
Low digitization of both resources and library metadata48:10
Briefly...50:47
Output of Romanian novels vs. ELTeC groups (Time Slots)51:27
How many of them actually available in Romanian libraries after54:22
Female authors: bettering the scores55:07
Sampling 9-11 authors with exactly 3 books56:38
Open questions57:33