0.25
0.5
0.75
1.25
1.5
1.75
2
Extending Tables with Data from over a Million Websites
Published on Dec 19, 20144476 Views
This Big Data Track submission demonstrates how the BTC 2014 dataset, Microdata annotations from thousands of websites, as well as millions of HTML tables are used to extend local tables with additi
Related categories
Chapter list
Extending Tables with Data from over a Million Websites00:00
Goal00:27
Operation 1: Extend Local Table with Single Column00:44
Operation 2: Extend Local Table with Many Columns01:08
Types of Web Data Used01:57
Billion Triple Challenge Dataset 201402:20
Web Data Commons-Microdata Corpus02:36
Web Data Commons –Web Tables Corpus - 102:59
Web Data Commons –Web Tables Corpus - 203:49
WikiTables04:39
Internal Data Model: Entity-Attributes-Tables04:58
Indexed Tables06:59
The Mannheim Search JoinsEngine (MSJE)07:26
The Search Operator08:18
Multi-Join Operator08:38
Consolidation Operator08:43
http://searchjoins.webdatacommons.org09:29
Result: Extend with Single Column09:52
Provenance Summary10:18
Provenance Details10:48
Evaluation Results11:21
Result: Extend with Many Columns12:55
Provenance Summary13:20
Conclusion13:43