Information Geometry: Duality, Convexity and Divergences
Description
In this talk, I explore the mathematical relationships between duality in information geometry, convex analysis, and divergence functions.
First, from the fundamental inequality of a convex function, a family of divergence measures can be constructed, which specializes to the familiar Bregman divergence, Jenson difference, beta-divergence, and alpha-divergence, etc. Second, the mixture parameter turns out to correspond to the alpha <-> -alpha duality in information geometry (which I call "referential duality", since it is related to the choice of a reference point for computing divergence). Third, convex conjugate operation induces another kind of duality in information geometry, namely, that of biorthogonal coordinates and their transformation (which I call "representational duality", since it is related to the expression of geometric quantities, such as metric, affine connection, curvature, etc of the underlying manifold). Under this analysis, what is traditionally called "+1/-1 duality" and "e/m duality" in information geometry reflect two very different meanings of duality that are nevertheless intimately interwined for dually flat spaces.
| Slides | |
| 0:00 | Information Geometry: Duality, Convexity, and Divergences |
| 0:22 | Lecture Plan |
| 2:33 | Bregman Divergence |
| 5:00 | Canonical Divergence and Fenchel Inequality |
| 7:09 | Convex Inequality and a-Divergence Induced by it |
| 10:15 | Significance of Bregman Divergence Among a-Divergence Family |
| 11:45 | Statistical Manifold Structure Induced From Divergence Function (Eguchi, 1983) |
| 13:09 | a-Hessian Geometry (of Finite-Dimension Vector Space) (1) |
| 15:05 | a-Hessian Geometry (of Finite-Dimension Vector Space) (2) |
| 15:58 | From Vector Space to Function Space |
| 17:38 | A Special Case of DClassic a-Divergence |
| 18:21 | Other Examples ofD(a) |
| 19:08 | From Vector Space to Function Space |
| 19:22 | A Short Detour: Monotone Scaling (1) |
| 20:43 | A Short Detour: Monotone Scaling (2) |
| 21:48 | Parameterized Functions as Forming a Submanifold under Monotone Scaling (1) |
| 24:12 | Parameterized Functions as Forming a Submanifold under Monotone Scaling (2) |
| 25:51 | An Application: the (a,ß)-Divergence |
| 28:05 | Information Geometry on Banach Space (1) |
| 30:04 | Information Geometry on Banach Space (2) |
| 31:08 | Information Geometry on Banach Space (3) |
| 32:10 | Information Geometry on Banach Space (4) |
| 33:54 | Summary of Current Approach |
| 34:28 | References |
| 34:35 | Questions? |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




