Cost-effective Outbreak Detection in Networks
Description
Which blogs should we read to avoid missing important information? Where should we place sensors in a water distribution network to quickly detect contaminants? These seemingly different problems share common structure: Outbreak detection can be modeled as a problem of selecting nodes (blogs, sensor locations, ...) in a network, in order to detect the spreading of a virus or information as quickly as possible. We present a general methodology for near optimal sensor placement in these and related problems. We demonstrate that many realistic outbreak detection objectives (e.g., detection likelihood, population affected) exhibit the property of “submodularity’’. We exploit submodularity to develop an efficient algorithm that scales to large problems, provably achieving near optimal placements, while being 700 times faster than a simple greedy algorithm. We evaluate our approach on several large real-world problems, including a model of a water distribution network, and real blog data. We also show how the approach leads to deeper insights in both applications, answering multicriteria trade-off, cost-sensitivity and generalization questions. Joint work with: Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen and Natalie Glance Recepient of best student paper award at ACM SIGKDD ‘07 conference.
| Slides | |
| 0:00 | Cost-effective Outbreak Detection in Networks |
| 0:42 | Diffusion in Social Networks - Page 1 |
| 1:04 | Diffusion in Social Networks - Page 2 |
| 1:40 | Empirical Studies of Diffusion |
| 2:34 | Diffusion in Networks |
| 3:14 | Scenario 1: Water Network |
| 3:45 | Scenario 2: Online media |
| 4:16 | Cascade Detection: General Problem |
| 5:18 | Two Parts to the Problem |
| 6:46 | Problem Setting |
| 7:37 | Structure of the Problem |
| 8:31 | Analysis |
| 9:21 | An Approximation Result |
| 11:08 | Reward functions: Submodularity |
| 12:18 | Reward Functions are Submodular |
| 13:01 | Background: Submodular functions |
| 14:43 | Towards a New Algorithm |
| 15:51 | Benefit‐Cost: More Problems |
| 16:55 | Solution: CELF Algorithm |
| 18:00 | How good is the solution? |
| 18:47 | Scaling up CELF algorithm |
| 19:34 | Scaling up CELF - page 1 |
| 20:07 | Scaling up CELF - page 2 |
| 21:59 | Experiments: 2 Case Studies |
| 22:51 | Case study 1: Cascades in Blogs |
| 23:27 | Diffusion in Blogs |
| 25:37 | Q1: Blogs: Solution Quality |
| 27:04 | Q2: Blogs: Cost of a Blog - part 1 |
| 27:49 | Q2: Blogs: Cost of a Blog - part 2 |
| 29:10 | Q4: Blogs: Heuristic Selection |
| 31:03 | Blogs: Generalization to Future |
| 34:01 | Q5: Blogs: Scalability |
| 35:16 | Case study 2: Water Network |
| 37:01 | Water: Solution Quality |
| 37:22 | Water: Heuristic Placement |
| 38:46 | Water: Placement Visualization |
| 39:41 | Water: Algorithm Scalability |
| 40:05 | Results of BWSN competition |
| 41:27 | Other results |
| 42:56 | Conclusion |
| 43:30 | Conclusion and Connections |
| 43:55 | Topic Diffusion: what blogs to read? |
| 45:29 | Diffusion in Blogs |
| 47:18 | Water: Placement Visualization |
| 48:08 | Further Connections |
| 48:45 | References |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





