Boilerplate Detection Using Shallow Text Features

Published on 2010-10-0723873 Views

Christian Kohlschütter

In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, may deteriorate search

WSDM 2010 - New York

Related categories

Boilerplate Detection Using Shallow Text Features

Christian Kohlschütter

WSDM 2010 - New York

Related categories

Presentation