Benoît Mandelbrot
Posted by Tom Leinster
Benoît Mandelbrot died last week, aged 85. To mark the occasion, I will say something about the set that bears his name.
Tim Gowers, in the introduction to his wonderful little book Mathematics: A Very Short Introduction, writes:
I do presuppose some interest on the part of the reader rather than trying to drum it up myself. For this reason I have done without anecdotes, cartoons, exclamation marks, jokey chapter titles, or pictures of the Mandelbrot set. I have also avoided topics such as chaos theory and Gödel’s theorem, which have a hold on the public imagination out of proportion to their impact on current mathematical research
Mandelbrot was enormously successful in popularizing aspects of his work, as this quotation indicates. Unfortunately, the popular appeal of the Mandelbrot set, and fractals more generally, has sometimes made life hard for mathematicians who do serious work on such things. I don’t read Gowers’s statement as a criticism of those people or their work, but I think he’s expressing, at least, the feeling that the Mandelbrot set has become a kind of cliché.
Here I want to do a little bit to alter perceptions of the Mandelbrot set, by explaining how it fits into a wider mathematical context.
Preamble and some history
Most of this story is about complex dynamics: what happens when you take a rational function over and apply it repeatedly. By the end of the 1920s, the theory had been developed to a very sophisticated level, largely by the French mathematicians Pierre Fatou and Gaston Julia. They weren’t collaborators. In fact, according to John Milnor’s book Dynamics in One Complex Variable (early version here),
The most fundamental and incisive contributions were those of Fatou himself. However Julia was a determined competitor, and tended to get more credit because of his status as a wounded war hero.
Anyway, complex dynamics was mostly dormant for the mid-20th century, until the computing power became available to depict the sets that Fatou and Julia had been studying. Then there was a reawakening, and in the late 1970s, various people started generating computer graphics of the kind of space that we now know as the Mandelbrot set. The details of who did what are in an appendix of Milnor’s book, but here I’ll just list the people he mentions: Brooks, Matelski, Hubbard, Douady, and, of course, Mandelbrot.
The Mandelbrot set is connected, but contains some ‘islands’ only attached to the main body by delicate filaments. The first computer images were too crude to show those filaments. So when Mandelbrot wrote his first paper on the subject, showing pictures of , he conjectured that it had many connected-components. Anyone who’s experienced the frustration of having their carefully Latexed article messed up by a journal will sympathize with what happened next:
The editors of the journal thought that his islands were specks of dirt, and carefully removed them from the pictures.
(Milnor, Appendix G).
There are at least two different definitions of the Mandelbrot set. First I’ll give you the one that you’ll see in popular books. Then I’ll spend most of the rest of the post building up to the one you’ll see in unpopular books (I mean, textbooks on complex dynamics). That, I hope, will illustrate how the Mandelbrot set sits in a wider context.
The simple definition is this. For , let be the map The Mandelbrot set is the set of for which the sequence is bounded. Here is the -fold composite .
It’s an easy exercise to show that the sequence is bounded if and only if it stays inside the disk . (It follows that is a subset of that disk, and that is closed.) It’s also easy to show that if the sequence ever escapes that disk then as .
So, for each there are two possibilities: either the sequence stays inside that bounded disk, or it converges to . This is the dichotomy defined by the Mandelbrot set.
Now I’ll begin the story of the not-so-simple definition. It starts with Julia sets.
Julia sets
For every Riemann surface and holomorphic self-map , there is an associated ‘Julia set’ . Informally, it is the part of on which is unstable under iteration.
The only case I’ll discuss here is where is the Riemann sphere . A holomorphic self-map of is a rational function with complex coefficients.
I said that the Julia set was the part of on which is unstable under iteration. By ‘stable’ I mean something like this: if and are close together then and stay close together for all .
Formally, let be a set of functions from one metric space to another, . Let . We say that is equicontinuous at if
for all there exists such that
for all and ,
.
(Here is the open ball of radius about .) In other words, a family of functions is equicontinuous if not only each of its members is continuous, but for any , the same will do for the whole family.
The Riemann sphere can be metrized in at least a couple of ways, e.g. the length of the chord between two points or the length of a geodesic between them. It doesn’t make any difference in what follows (or elsewhere in complex dynamics) which you choose, because they’re Lipschitz equivalent.
Let be a complex rational function. The Fatou set of is the set of such that the family is equicontinuous on some neighbourhood of . In other words, it is the largest open subset of on which this family is equicontinuous.
I’ll say the same thing again in a different way. Let and . Since (our rational function) is continuous, there is some neighbourhood of such that for all , Since is also continuous, there is a smaller neighbourhood of such that for all , And so on. So we have an ever-decreasing, infinite sequence of neighbourhoods of . The question is whether this sequence is doomed to ‘shrink down to nothing’, meaning that has empty interior. If is in the Fatou set then it is not so doomed—in fact, you could take , for a judicious choice of .
The Julia set of is the complement of the Fatou set: If is in the Fatou set then, for any , there is some neighbourhood of such that the iterated images never have diameter bigger than . If is in the Julia set, there is no such guarantee. In fact, there are theorems saying that for , no how matter how small a neighbourhood of you pick, the iterated images become arbitrarily large. (I won’t make that precise unless someone asks.)
Here are some properties of the Julia set. It’s closed, because the Fatou set is open. It’s nonempty, as long as has degree . It always has empty interior, except in some rare cases where it is the whole Riemann sphere.
Example There are few really easy examples, because the Julia set is almost always a fractal. When has degree or , things are a bit too easy. The simplest nontrivial example is .
If is in the open disk then as . This seems like uncomplicated, uniform behaviour, and indeed a short calculation shows that the family is equicontinuous on this disk. The same goes for the open disk centred at . So, the Fatou set is the whole of the Riemann sphere except possibly the unit circle; that is, is a subset of the unit circle.
Now, if is on the unit circle then there are points arbitrarily close to that converge to under iterated application of , and other points arbitrarily close that converge to . So the family is certainly not equicontinuous at . Hence the Julia set is precisely the unit circle.
I like to think of Julia sets as analogous to kernels in linear algebra. I can’t offer any convincing justification for that. All I can say is: the kernel of a linear endomorphism is a subset of canonically associated to , and the Julia set of a holomorphic endomorphism is a subset of canonically associated to . Both are closed, and both are usually rather insubstantial, the kernel having smaller dimension than and the Julia set having empty interior.
I’ll develop this unconvincing analogy later. I’m not even convinced myself, and actually I suspect that the eventual kernel would be a better analogue, though for what I’m going to say the difference doesn’t matter.
This definition of the Julia set is one of many, all equivalent. I won’t give any of the other definitions, but I will give a characterization that might appeal to Café readers:
Theorem Let be a rational function of degree . Then is the smallest closed, completely invariant subset of containing at least three points.
A subset is completely invariant if (which, since is surjective, implies that ). The theorem states that is a subset of any closed, completely invariant set containing at least three points, and is such a set itself.
That’s tantalizingly close to the kind of statement you might phrase categorically—maybe something about initial algebras. But there’s the pesky matter of those three points. That condition has to be there: consider and the set .
(The relevance of the three-point condition is that the triply-punctured sphere is a hyperbolic Riemann surface. Milnor’s book takes great advantage of this fact to prove results in complex dynamics. Compare the Picard theorems.)
Now let’s stop peering at the anatomy of Julia sets, and step back.
Parameter space
In the abstract, we have defined a map You can ask how the Julia set of varies as moves around in the space of rational functions.
This is a very vague, general question, so let’s try to be more specific. You could ask, for instance, for a description of the subspace of the domain consisting of those for which satisfies some property—say, that it’s the whole Riemann sphere. (It’s hard to construct even one such , but there are in fact many.)
Questions of the form ‘which rational functions satisfy such-and-such?’ can still be difficult, because the space of rational functions is large. So you could cut down further, and stick to the polynomials, or just the polynomials of fixed degree. If even that’s too hard then you could stick to degree 2. (Almost everything in complex dynamics is trivial in degrees 0 and 1, but degree is generally much harder.) So we’d be studying the map A quadratic can, of course, be represented or ‘parametrized’ as That’s not the only parametrization of quadratics: you might prefer , or, perversely, , etc. But once you’ve chosen a parametrization, in this case with three variables, you’re essentially studying a map In this context, gets called parameter space.
Now this three-variable parametrization of quadratics turns out to be very inefficient. Just one parameter will do: by a change of coordinates that preserves everything we care about, we may assume that and . (I’ll gloss over what exactly that means.) So now our parameter space is and, writing as we did much earlier, the general question is how varies with .
For example, we could ask for which values of the set is the whole Riemann sphere. (Answer: none.) But what turns out to be really interesting is to ask for which values of the Julia set is connected. In fact:
Theorem The set is the Mandelbrot set .
This, then, is the sophisticated definition of the Mandelbrot set.
Example From the original, simple definition, it’s clear that . So according to this theorem, the Julia set of must be connected. But is the function , whose Julia set we worked out earlier: it’s the unit circle. This is indeed connected, so we’ve confirmed the theorem in one little case.
Why is this such an important definition? I don’t really know. I understand why Julia sets are fundamental in complex dynamics, but I don’t understand much about why connectedness of Julia sets is so important.
Maybe I can say one useful thing, though. You might imagine drawing a coloured picture of the plane with one colour for values of for which has one connected-component (that is, the Mandelbrot set), another colour for two connected-components, another for three, and so on. But that would be pointless, because there’s a theorem:
a Julia set is either connected or has connected-components.
(Those components needn’t be points, or even contractible.) So for Julia sets, the difference between connected and not connected is really dramatic. Suppose you pick a path in parameter space , starting inside the Mandelbrot set and ending outside it. As you travel along the path, the corresponding Julia set will start off connected, then as you get close the boundary of it will look more and more distressed (though still connected), until the instant you step outside it will disintegrate into uncountably many pieces.
Nils Baas once said to me that he thinks of the Mandelbrot set as a ‘second order fractal’, because it’s a fractal parametrizing the properties of other fractals (Julia sets, which would then be first order fractals).
I like to think of the Mandelbrot set as the spectrum of the map . This is a continuation of my unconvincing analogy, and doesn’t get any more convincing now, but again I’ll give my meagre reasons.
Let be a vector space (over , say) and let be a linear endomorphism of . There are three ingredients needed in order to define the spectrum ( set of eigenvalues) of :
- gives rise to the family of endomorphisms of
- any endomorphism of has a kernel, a vector subspace of
- any vector subspace of can be classified as nontrivial or trivial.
The spectrum is the set of parameters for which the kernel of is nontrivial.
Now in the world of complex dynamics:
- the rational function ‘gives rise to’ the family of rational functions on
- any rational function on has a Julia set, a subset of
- any subset of can be classified as connected or disconnected.
Recall that Julia sets were supposed to be analogous to kernels. And now, the Mandelbrot set is the set of parameters for which the Julia set of is connected. This is why I want to think of the Mandelbrot set as a kind of spectrum of the function .
Re: Benoît Mandelbrot
Much sadness at my alma mater, Caltech, where he’d received a Distinguished Alumnus Award.
“Benoit B. Mandelbrot M.S. ‘48 (Aeronautics) Engineer ‘49 (Aeronautics)
Awarded 1988
Benoit B. Mandelbrot, IBM Fellow at the IBM Thomas J. Watson Research Center, in the acknowledged creator of fractal geometry, a field of mathematics dealing with the irregular shapes of natural objects. After studying at Caltech, Mandelbrot completed work on his PhD in mathematics at the University of Paris in 1952. Dr. Mandelbrot’s first positions were with the French Research Council, the School of Mathematics at the Institute for Advanced Study, and the University of Geneva. Immediately before joining IBM, he was a junior professor of applied mathematics at the University of Lille and of mathematical analysis at Ecole Polytechnique. A Fellow of the American Academy of Sciences and Foreign Associate of the US National Academy of Sciences, Mandelbrot was awarded the 1985 F. Barnard Medal by the US National Academy of Sciences and Columbia University, and the 1986 Franklin medal for Signal and Eminent Service in Science by the Franklin Institute of Philadelphia. He has been a Fellow of the Guggenheim Foundation, a Trumbull Lecturer at Yale, Samuel Wilks Lecturer at Princeton, Abraham Wald Lecturer at Columbia, Goodwin-Richards Lecturer at the University of Pennsylvania, and National Lecturer of Sigma Xi, the Scientific Research Society. Mandlebrot is the author of numerous articles and books, the best known being Les Objets Fractals, 1975 and 1984 (translated into Italian, Spanish, and Hungarian) and The Fractal Geometry of Nature, 1982 (translated into Japanese and German).”