Robust Web Extraction, A Principled Approach
Published on Jun 07, 20103662 Views
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus the tree structure e