Category Theoretic Probability Theory
Posted by David Corfield
Having noticed (e.g., here and here) that what I do in my day job (statistical learning theory) has much to do with my hobby (things discussed here), I ought to be thinking about probability theory in category theoretic terms. What would seem to be the most promising approach is described by Prakash Panangaden in Probabilistic Relations.
The category SRel (stochastic relations) has as objects sets equipped with a -field. Morphisms are conditional probability densities or stochastic kernels. So, a morphism from to is a function such that
- is a bounded measurable function,
- is a subprobability measure on .
If is a morphism from to , then from to is defined as .
Apparently this is based on work by Michele Giry, which in turn was based on earlier work by Lawvere. This definition differs from Giry’s in the second clause where subprobability measures are allowed, rather than ordinary probability measures.
Panangaden notes that something very similar to the way that the category of relations can be constructed from the powerset functor is at stake. Just as the category of relations is the Kleisli category of the powerset functor over the category of sets, SRel is the Kleisli category of the functor over the category of measurable spaces and measurable functions which sends a measurable space, , to the measurable space of subprobability measures on . This functor gives rise to a monad.
Now I’d like to know
- What is gained by the move from probability measures to subprobability measures?
- How to put this work into relation with the fact that there is a natural choice of metric with which to give a riemannian geometry to a space of probability distributions, and also to a space of conditional distributions (see section 6, p. 39, of this).
Posted at February 7, 2007 11:50 AM UTC
TrackBack URL for this Entry: http://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/1152
Re: Category Theoretic Probability Theory
Can you tell us what a subprobability measure is? It doesn’t seem to be mentioned in Panangaden’s paper.
Also, you might like to take a look at Alex Simpson’s very interesting talk notes, Probabilistic Observations and Valuations.
Re: Category Theoretic Probability Theory
David Mumford (yes, that one) has an interesting paper about refounding mathematics using probability measures rather than sets. (Experts will surely complain about my phrasing there; see the paper for yourself.) Apparently it settles the Continuum Hypothesis and the Axiom of Choice.
It’s been a while since I read it but I don’t recall it being particularly categorical.
Re: Category Theoretic Probability Theory
Re: Category Theoretic Probability Theory
For the CS folk out there, Avi Pfeffer and Norman Ramsey make nice application of probability monads in their Stochastic lambda calculus paper. Pfeffer has a full-fledged probabilistic modelling language, IBAL, based on the calculus.
Another implementation, in which the monads are used directly (rather than implicitly, as in IBAL) is by Martin Erwig in his Probabilistic Functional Programming project (in Haskell). What I like about Erwig’s implementation is that you can take the same monadic description of a model, and use it in two different semantic contexts, one corresponding to direct Bayesian updates, the other corresponding to sampling.
This degree of reuse is rarely seen in statistical software, and reminds me of a point made by John Langford in his blog that Bayesianism, seen as an extension of logic, naturally interpolates from the statistical setting all the way through to pure software engineering/computer programming. This point seemed very deep to me when I first read it — not n-categorically deep maybe, but enough for me!
Re: Category Theoretic Probability Theory
I am unfortunately very much out of my mathematical depth here, but convinced there is something in it.
Here is a relevant old reference:
Statistical Isomorphism.
Norman Morse. Richard Sacksteder. The Annals of Mathematical Statistics, Vol. 37, No. 1, 203-214. Feb., 1966.
I myself built on this (and related work by Le Cam) in:
Dawid, A. P. (1980). Conditional independence for statistical operations. Ann. Statist. 8, 598–617.
Another important book-length work is:
Statistical decision rules and optimal inference. N.N.Čencov (Chentsov).
Amer. Math. Soc. (1982) (Translated from Russian: 1972 - Nauka, Moscow).
In particular Chentsov develops information geometry from a category-theoretic angle.
Read the post
Category Theoretic Probability Theory II
Weblog: The n-Category Café
Excerpt: Encouraged by the comment of a statistician as eminent as Phil Dawid, I shall continue with what category theory has to say about probability theory. Recall that we were considering a monad on the category of measurable spaces, Meas....
Tracked: February 13, 2007 6:58 PM
Read the post
Category Theory in Machine Learning
Weblog: The n-Category Café
Excerpt: Does category theory have a future in machine learning?
Tracked: September 5, 2007 4:04 PM
Re: Category Theoretic Probability Theory
Can you tell us what a subprobability measure is? It doesn’t seem to be mentioned in Panangaden’s paper.
Also, you might like to take a look at Alex Simpson’s very interesting talk notes, Probabilistic Observations and Valuations.