Integrating Against the Euler Characteristic
Posted by Tom Leinster
The Euler characteristic of topological spaces behaves something like a measure. For example, under suitable hypotheses,
One of the main things you can do with a measure is integrate with respect to it — or ‘against’ it, as they say.
So: what happens if you try to integrate against the Euler characteristic?
I don’t completely understand the answer myself, but I’ll explain as well as I can. Along the way, we’ll see:
- how this train of thought helps us to define Euler characteristic
- how it also leads to the notion of curvature.
Simple functions on the line
Let’s begin in one dimension. Our aim is to define the integral for suitable functions .
Whatever we think Euler characteristic is, the Euler characteristic of a compact, nonempty interval should be . So, writing for the indicator function (or characteristic function) of , we should have
Since integration is supposed to be linear, this tells us how we must integrate any finite linear combination of indicator functions of compact nonempy intervals. I’ll call these simple functions on . So, for a simple function
where each is a compact nonempty interval, we should have
Do I hear you sigh? If you’ve seen this kind of thing before, you’ll recognize the standard problem: the definition isn’t obviously consistent, since can be expressed as a combination of indicator functions in multiple ways, and maybe these give multiple different values for the integral. In that case, you’ll also know that the standard solution, involving common refinements, is pretty tedious work.
Happily, we can avoid it. Here’s how. For a simple function , put
where means . This quantity is well-defined, linear in , and takes value when is the indicator function of a compact nonempty interval. So, whenever . We therefore put
and the consistency problem evaporates.
Let’s have some examples. What, for instance, is the integral against the Euler characteristic of this function?
The solid and empty circles indicate that . So
and
Since , this tells us that we’re treating the Euler characteristic of the open interval as . That might strike you as wrong if you’re used to Euler characteristic being invariant under homotopy equivalence. But as James Propp pointed out long ago, there’s a tension between the requirement that Euler characteristic is homotopy-invariant and the requirement that it behaves like a finitely additive measure. You can’t have both at once. (See also John Baez’s excellent talk on the mysteries of counting.) Here we’re not worrying about homotopy invariance; we’re using what Propp would call ‘Euler measure’.
What about this function?
It’s not hard to see that , whatever the unlabelled values on the axes might happen to be. I chose the letter to stand for ‘jump’: is the total vertical jump occurring at jump discontinuities from the left.
One more example: what is the integral against the Euler characteristic of the following function?
Here
so
(regardless of the values of , , and ). Alternatively, you can calculate via the formula for , giving again.
More general functions on the line
Classical measure theory also involves things called ‘simple functions’ (with a different but analogous meaning). There, defining integration for simple functions is just a prelude to defining integration for a larger class of functions. Can we do something similar here, extending our integral to a larger class of functions?
We can. Indeed, the formula
for immediately makes sense for more than just the simple functions.
But before we charge ahead and generalize, let’s correct the ugly asymmetry you see here. In everything so far, we could equally well have used the formula
It makes no difference: this is still equal to for simple functions . Of course, it’s just as asymmetric. However, taking the average of the two formulas, we also have
This is now symmetric, and therefore more likely to give us a useful definition for more general functions .
(Maybe you can see a hint of how curvature is going to enter the story: this looks like the expression for a second derivative, and second derivatives have something to do with curvature.)
So: let be a function such that the limits and exist for all , and are equal to for all but finitely many . In other words, is continuous except for a finite number of jump discontinuities. The integral against the Euler characteristic of such a function is defined by the formula above:
Be warned: this has some properties that you might not expect of an integral. For instance, the integral of any continuous function is , the integral of a function that is everywhere strictly positive can be strictly negative, and changing the value of a function at a single point can change the value of the integral. On the other hand, this integral has some interesting properties too, as we’ll see when we get to higher dimensions.
Simple functions in higher dimensions
Let’s now consider functions on , for . The role of intervals will be played by convex sets. For brevity, I’ll use ‘convex’ to mean ‘compact, nonempty and convex’.
A function is simple if it can be expressed as a finite linear combination of indicator functions of convex sets. Again, we want to ‘define’
whenever for some convex sets , and again we might groan at the prospect of having to do those tedious consistency checks.
But once more, the jump functional comes to the rescue. I’ll explain in the case ; the strategy for higher dimensions should be clear. Let be a simple function. For each , the function
is simple, and so we can define a function by . This function , too, is simple, so we get a real number . We define to be this number: .
Thus, we have defined for every simple function on . It is linear in , with whenever is convex. So just as in the one-dimensional case, when . This solves the consistency problem, and .
I learned this from Chapter 5 of Klain and Rota’s Introduction to Geometric Probability (source of so many wonderful things). It is remarkably little effort, and is even based on a very standard technique: reducing a multivariable integral to a sequence of single-variable integrals. But it has an immediate nontrivial consequence: the definition of Euler characteristic for a large class of subsets of .
Indeed, call a subset of polyconvex if it can be expressed as a finite union of convex sets. For example, any picture on a black and white television is polyconvex (assuming that each pixel is convex). And quite simply, we define
You can prove, as Klain and Rota do, that this coincides with the usual definition.
More general functions in higher dimensions, and curvature
The rough idea now is that given a function whose discontinuities are no worse than those of a simple function, we should be able to define by repeating verbatim the definition for simple functions.
In the one-dimensional case, we first had to deal with the pesky problem that the formula
isn’t symmetric, so probably wouldn’t generalize well. To fix that, we considered the formula obtained by reversing the orientation of the line, and then we averaged over the two orientations to get something symmetric.
In higher dimensions, the asymmetry problem can no longer be brushed aside with this algebraic flick of the wrist. In fact, some interesting geometry comes in here. This asymmetry issue, which looked like a nuisance distracting us from the main business, turns out to be exactly the reason why integration against the Euler characteristic is closely related to curvature.
Again I’ll stick to , leaving higher dimensions to your imagination.
In defining for simple functions , we used the standard coordinate system on . When is simple, the choice of basis does not affect the value of (which is always equal to , if ). But for more general functions , it certainly does make a difference. What we should do is consider all ordered orthonormal bases of , calculate with respect to each basis, and define to be the average.
(You can see that this generalizes what we did for : there we took the average of two things, and there are two orthonormal bases of .)
I don’t want to make this post any longer by explaining exactly what, for instance, ‘average’ means. Nor will I say much about which functions this will be a reasonable definition for. Instead, I’ll focus on the geometric interpretation of .
Let’s think about a function of the form , where is continuous and is convex. Thus, is supported on and continuous everywhere except perhaps on the boundary of , where it might jump in value as it crosses the boundary. I’ve just been reading something that talks about functions `suffering a jump discontinuity’. The suffering of is limited to .
What is ? How can we understand it?
Well, is the average over all orthonormal bases of the quantity “ with respect to that basis”. So, take an orthonormal basis — a coordinate system — and let’s consider .
Recall that the definition of was slicewise. For each , we take the function and put . In the picture, the value of shown is in the vertical shadow of , so is equal to .
Now since is continuous, is too, except that has a jump discontinuity at each end of the shadow. So, . Finally, by definition, , and so .
So in the end, is something really trivial:
is the value of at the leftmost point of
where ‘leftmost’ refers to the basis concerned. (I’m assuming for simplicity that the boundary of is smooth and contains no line segments. This isn’t crucial.)
But this isn’t the same as . To get that, we have to average over all orthonormal bases of . That is, we slowly rotate our coordinate axes through , at each moment recording the value of at the ‘leftmost’ point of (with respect to the current axes), then taking the mean. As we rotate, that leftmost point works itself around the whole boundary of , never backtracking. And here’s the important thing:
It spends more time at some boundary points than others.
To see why, consider a convex set like this:
For almost all choices of axes, the leftmost point of will be in one of the red parts of the boundary. Only rarely will it be elsewhere.
So as we rotate the axes, the leftmost point with respect to the axes moves quickly over parts of the boundary with low curvature, and lingers where the curvature is high. We’d therefore imagine that would be the integral of over the boundary of with respect to some kind of curvature measure on .
This turns out to be true. In fact,
where is the angle that the tangent makes with some fixed, arbitrarily chosen reference line:
This is good, but there’s another way to put it too. When we integrate along a curve, we usually do it with respect to the arclength measure, typically written as . And the rate of change of the angle per unit arclength is nothing but the classical curvature . That is, . Putting this together with the last equation, we get
So, the integral against the Euler characteristic of a function of this type is naturally expressed in terms of curvature.
You can go further down this track. If you know about intrinsic volumes, you can ask and answer the question: what does it mean to integrate against an intrinsic volume? You’ll see that each intrinsic volume corresponds to a different curvature measure; in dimensions, we get different curvature measures on the boundary of a convex set.
What I’d like to know is how much of this story is well-known. Curvature measures are extremely well-studied, and connections between curvature and Euler characteristic go back to the Gauss–Bonnet theorem at least. On the other hand, I’ve never heard anyone talking explicitly about integration against the Euler characteristic. Does anyone know where this stuff is written up?
Re: Integrating Against the Euler Characteristic
Well, what it makes me think of isn’t terribly new (I don’t suppose), except that the later formulas look like inventing a change-of-variables from the geometric measure theorem writing the -homogeneous invariant volume of polyconvex (or very smooth, or …) object , as an integral over an affine Grassmannian. It’s a different change-of-variables that leads to those formulas involving Second Fundamental Forms and all that. Intriguing! I will have to re-read soon.