Möbius Inversion for Categories
Posted by Tom Leinster
Every small category has a classifying space . It’s a topological space, and it can be constructed as the geometric realization of the nerve of .
The homotopy type of depends on every aspect of : its underlying directed graph, its composition, and its identities. But the Euler characteristic of does not: it only depends on the underlying graph. What’s going on?
Last week I gave a talk about this at a conference in Louvain-la-Neuve, Belgium. I’ll also be doing a shorter version at the major annual category theory meeting in Vancouver. So, I’d be grateful for any comments, suggestions or thoughts. The slides are here.
(If the title slides mystify you, what you need to know is that the conference marked the appointment of Ross Street to a visiting chaire.)
As you’ll have surmised from the title, my answer was all about Möbius inversion. Let me explain…
In 1832, August Ferdinand Möbius introduced the number-theoretic Möbius inversion that crops up in elementary (and not so elementary) number theory. In modern language, we might say that the original Möbius inversion takes place in the poset of positive integers, ordered by divisibility.
In the mid-20th century, Möbius inversion was generalized to arbitrary posets. Several people came up with this idea, but it’s usually associated with the name of Gian-Carlo Rota, who demonstrated its importance in enumerative combinatorics.
(I tell a small lie: instead of ‘arbitrary posets’, I should say ‘arbitrary posets satisfying a suitable finiteness condition’. I told a similar lie when speaking of the Euler characteristic of , since it won’t have a well-defined Euler characteristic unless satisfies some finiteness condition.)
The stage was then set for generalizing from posets to (suitably finite) categories. This was done in two distinct ways.
First, there’s what I refer to in the slides as fine Möbius inversion. You can find the definition there. Here I just want to mention a couple of its features. For a category to have fine Möbius inversion means that there’s a fine Möbius function, that is, a function
satisfying certain equations. Here is the set of arrows of and is a commutative ring over which we’ve chosen to work. Whether a category has fine Möbius inversion does depend on its composition and identities, as does the fine Möbius function (if it exists).
Second, there’s what I call coarse Möbius inversion. For a category to have coarse Möbius inversion means that there’s a coarse Möbius function, that is, a function
satisfying certain equations. Here is the set of objects of . Whether a category has coarse Möbius inversion does not depend on its composition and identities, and nor does the coarse Möbius function (if it exists).
All of this is described in more detail in the slides. References are on the final page.
What has this got to do with Euler characteristic? Well, it’s a theorem that if has coarse Möbius inversion (and is suitably finite) then
This result appeared in my first paper on Euler characteristic of categories. Since the right-hand side is independent of the composition and identities in , so is the left-hand side.
(I should add that it’s also easy to show directly that is independent of composition and identities, at least under the usual finiteness assumptions. None of the proofs here are in any way difficult; it’s just a matter of trying to arrange things for maximum insight.)
But also, coarse and fine Möbius inversion are related as follows. If a category has fine Möbius inversion then it also has coarse Möbius inversion. Moreover, the coarse Möbius function is determined by the fine one:
So for a category with fine Möbius inversion, we also have
The right-hand side appears to depend on the composition and identities in , but, being equal to the left-hand side, does not.
Re: Möbius Inversion for Categories
What a coincidence! Yesterday I happened to be thinking about Möbius inversion in the context of entropy! It turns out that mutual information can be generalized to interaction information, and this latter concept arises from a Möbius inversion on a poset. Since mutual information is of central importance in many applications of information theory, this might be an interesting example of Möbius inversion.
So let be a finite set of random variables. The relevant poset is the power set . Any subset of variables has a joint entropy , which is the ordinary entropy of the joint distributions of the variables in . For any , we define the conditional entropy
and then is an element of the incidence algebra of the poset . If we now consider and take the first argument to be the empty set, we recover precisely the formula for interaction information:
Now on to the question: given any commutative diagram in (in terms of a functor from some finite category ), is there a way to talk about the entropy associated to the diagram?
The mutual information should be a special case of such a construction where the diagram consists of the two morphisms and . Here, is the joint distribution of the random variables and , and I don’t distinguish between distributions of random variables and probability spaces.
Okay, as John just said there are many possible routes for continuing on the entropy business, so this is just yet another question wanting to be pursued…
Your talk and Wikipedia call the constant function the zeta function. Is there a particular reason for this term?