About
Good decision-making is dependent on comprehensive, accurate knowledge. But the information relevant to many important decisions in areas such as business, government, medicine and scientific research is massive, and growing at an accelerating pace. Relevant raw data is widely available on the web and other data sources, but usually in order to be useful it must be gathered, extracted, organized, and normalized into a knowledge base.
Hand-built knowledge bases such as Wikipedia have made us all better decision-makers. However more than human editing will be necessary to create a wide variety of domain-specific, deeply comprehensive, more highly structured knowledge bases.
A variety of automated methods have begun to reach levels of accuracy and scalability that make them applicable to automatically constructing useful knowledge bases from text and other sources. These capabilities have been enabled by research in areas including natural language processing, information extraction, information integration, databases, search and machine learning. There are substantial scientific and engineering challenges in advancing and integrating such relevant methodologies.
This workshop gathered researchers in a variety of fields that contribute to the automated construction of knowledge bases.
There has recently been a tremendous amount of new work in this area, some of it in traditionally disconnected communities. In this workshop the organizers aim to bring these communities together.
Topics of interest include:
- information extraction; open information extraction, named entity extraction; entity resolution, relation extraction.\
- information integration; schema alignment; ontology alignment; ontology constrution.\
- monolingual alignment, alignment between knowlege bases and text.\
- joint inference between text interpretation and knowledge base\
- pattern analysis, semantic analysis of natural language, reading the web, learning by reading.\
- databases; distributed information systems; probabilistic databases.\
- scalable computation; distributed computation.\
- information retrieval; search on mixtures of structured and unstructured data; querying under uncertainty.\
- machine learning; unsupervised, lightly-supervised and distantly-supervised learning; learning from naturally-available data.\
- human-computer collaboration in knowledge base construction; automated population of wikis.\
- dynamic data, online/on-the-fly adaptation of knowledge.\
- inference; scalable approximate inference.\
- languages, toolkits and systems for automated knowledge base construction.\
- demonstrations of existing automatically-built knowledge bases.\
More about the event here.
Videos
Systems and Overviews

Building Structured Web Databases: A Midterm Report from the Cimple Project
Jun 7, 2010
·
3774 views
Databases, evidence, provenance, inference

PrDB: Increasing the Representational Power and Scaling Reasoning in Probabilist...
Jun 7, 2010
·
3281 views

DBToaster: Aggressive Compilation Techniques for Online Aggregation
Jun 7, 2010
·
4360 views

Timely Knowledge
Jun 7, 2010
·
3461 views

MCMC Inference Inside the DB for Extraction, Resolution, Alignment, Provenance a...
Jun 7, 2010
·
4314 views
Schema/ontology alignment, data resources, querying

WWT: A system for query-driven relation extraction from the semi-structured web
Jun 7, 2010
·
4495 views

Table Search
Jun 7, 2010
·
2690 views

Worth its Weight in Gold or Yet Another Resource — A Comparative Study of Wiktio...
Jun 7, 2010
·
4540 views

Entity Disambiguation using Relations extracted from Wikipedia
Jun 7, 2010
·
4032 views

Query-Driven Integration: The Q System
Jun 7, 2010
·
2991 views

ProbaMap: a scalable tool for discovering probabilistic mappings between taxonom...
Jun 7, 2010
·
3049 views

Combined Structured and Keyword-Based Search in Textually Enriched Entity-Relati...
Jun 7, 2010
·
3694 views

Aligning Sense Inventories in Wikipedia and Wordnet
Jun 7, 2010
·
3414 views
Information Extraction

Automatic Extraction of Human Activity Knowledge from Method-Describing Web Arti...
Jun 7, 2010
·
4105 views

Mining Commonsense Knowledge From Personal Stories in Internet Weblogs
Jun 7, 2010
·
4076 views

Reinforcement Learning for Structured Data Labeling
Jun 7, 2010
·
3566 views

Finding Frequent and Interesting Triples in Text
Jun 7, 2010
·
3800 views

The Web’s Many Models
Jun 7, 2010
·
3234 views

Meaning Propagation
Jun 7, 2010
·
3448 views

Robust Web Extraction, A Principled Approach
Jun 7, 2010
·
3667 views

Welcome Speech at the First Workshop on Automated Knowledge Based Construction 2...
Jun 7, 2010
·
3246 views