Extending Tables with Data from over a Million Websites thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Extending Tables with Data from over a Million Websites

Published on Dec 19, 20144476 Views

This Big Data Track submission demonstrates how the BTC 2014 dataset, Microdata annotations from thousands of websites, as well as millions of HTML tables are used to extend local tables with additi

Related categories

Chapter list

Extending Tables with Data from over a Million Websites00:00
Goal00:27
Operation 1: Extend Local Table with Single Column00:44
Operation 2: Extend Local Table with Many Columns01:08
Types of Web Data Used01:57
Billion Triple Challenge Dataset 201402:20
Web Data Commons-Microdata Corpus02:36
Web Data Commons –Web Tables Corpus - 102:59
Web Data Commons –Web Tables Corpus - 203:49
WikiTables04:39
Internal Data Model: Entity-Attributes-Tables04:58
Indexed Tables06:59
The Mannheim Search JoinsEngine (MSJE)07:26
The Search Operator08:18
Multi-Join Operator08:38
Consolidation Operator08:43
http://searchjoins.webdatacommons.org09:29
Result: Extend with Single Column09:52
Provenance Summary10:18
Provenance Details10:48
Evaluation Results11:21
Result: Extend with Many Columns12:55
Provenance Summary13:20
Conclusion13:43