How to discount Information: Information flow in sensing-acting systems and the emergence of heirarchies
published: Oct. 16, 2012, recorded: September 2012, views: 3992
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
We argue that consistent formulation of optimal sensing and control must include information terms, yielding an extension of the standard POMDP setting. To make the standard reward/costs terms consistent with the information terms, while still allowing tractable computation, the standard uniformity of time must be altered. We argue that this can be done by successive refinement of the information-value tradeoff, which also leads to the emergence of hierarchies and reverse-hierarchies for both perception and planning.
Download slides: cyberstat2012_tishby_bellman_equation_01.pdf (3.4 MB)
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !