Website Privacy Preservation for Query Log Publishing
Description
In this work we study privacy preservation for the publi-
cation of search engine query logs. In particular, we introduce a new
privacy concern, which is that of website privacy (or business privacy).
We define the possible adversaries that could be interested in disclosing
website information and the vulnerabilities found in the query log, from
which they could benefit. We also detail anonymization techniques to
protect website information, and explore the different types of attacks
that an adversary could use. We then present a graph-based heuristic to
validate the effectiveness of our anonymization method, and perform an
experimental evaluation of this approach. Our experimental results show
that the query log can be appropriately anonymized against a specific
attack for website exposure, by only removing approximately 9% of the
total volume of queries and clicked URLs.
| Slides | |
| 0:00 | mmdss07_poblete_wpp_Page_01 |
| 0:19 | Introduction (1) |
| 2:17 | Introduction (2) |
| 4:14 | Introduction (3) |
| 5:08 | Introduction (4) |
| 6:29 | Introduction (5) |
| 7:17 | Introduction (6) |
| 8:44 | Introduction (7) |
| 10:37 | Outline |
| 11:24 | Scope of Our Work |
| 11:58 | Why Website Privacy Preservation? |
| 13:29 | Why is Website Privacy Preservation Difficult? |
| 16:17 | The Data Sources |
| 16:57 | Related Work |
| 17:56 | A Few Word on Query Log Anonymization |
| 20:17 | Adversaries - Two types |
| 21:27 | Outline (1) |
| 21:36 | Outline (2) |
| 22:06 | Query Log Anonymization (1) |
| 23:13 | Query Log Anonymization (2) |
| 24:57 | Attacks using Vulnerable Queries (1) |
| 25:18 | Attacks using Vulnerable Queries (2) |
| 27:18 | Attacks using Vulnerable Queries (3) |
| 28:10 | Attacks using Vulnerable Queries (4) |
| 29:45 | Attacks using Vulnerable Queries (5) |
| 29:55 | Heuristic Approach Against Attack 3 (1) |
| 30:43 | Heuristic Approach Against Attack 3 (2) |
| 31:47 | Heuristic Approach Against Attack 3 (3) |
| 32:32 | Other Attacks: Using Website Logs |
| 32:47 | Heuristic Approach Against Attack 3 (1) |
| 32:56 | Other Attacks: Using Website Logs |
| 34:56 | Other Attacks: User Identification |
| 36:01 | Outline |
| 36:07 | Evaluation |
| 37:24 | Degree Distribution |
| 37:54 | Density Vs. Number of Nodes Removed |
| 38:19 | Conclusions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




