Algebraic Closure
Posted by Tom Leinster
This semester I’ve been teaching an undergraduate course on Galois theory. It was all online, which meant a lot of work, but it was also a lot of fun: the students were great, and I got to know them individually better than I usually would.
For a category theorist, Galois theory is a constant provocation: very little is canonical or functorial, or at least, not in the obvious sense (for reasons closely related to the nontriviality of the Galois group). One important not-obviously-functorial construction is algebraic closure. We didn’t get to it in the course, but I spent a while absorbed in an expository note on it by Keith Conrad.
Proving that every field has an algebraic closure is not entirely trivial, but the proof in Conrad’s note seems easier and more obvious than the argument you’ll find in many algebra books. As he says, it’s a variant on a proof by Zorn, which he attributes to “B. Conrad” (presumably his brother Brian). It should be more widely known, and now I find myself asking: why would you prove it any other way?
What follows is a somewhat categorical take on the Conrad–Zorn proof.
The reason why many constructions in field theory fail to be functorial is that they involve arbitrary choices, often of roots or irreducible factors of polynomials. For instance, to construct the splitting field of a polynomial, we have to make a finite number of arbitrary choices.
The algebraic closure of a field is something like the splitting field of all polynomials, so you’d expect it to require an infinite number of arbitrary choices (at least if the field is infinite). It’s no surprise, then, that it cannot be done without some form of the axiom of choice. Actually, it doesn’t require the full-strength axiom, but it does need a small dose: there are models of ZF in which some fields have no algebraic closure.
That explains why Zorn’s name came up, and it also suggests why no proof of the existence of algebraic closures can be entirely trivial.
An algebraic closure of a field is, by definition, an extension of that is algebraically closed (every polynomial over splits in ) and is minimal as such. So to construct an algebraic closure of , it looks as if we have to manufacture an extension in which it’s possible to split not only every polynomial over , but also every polynomial over itself.
Fortunately, there’s a lemma that saves us from that: any algebraic extension of in which every polynomial over splits is, in fact, an algebraic closure of . This is proved by a standard field-theoretic argument that I won’t dwell on here, except to say that the main ingredient is the “transitivity of algebraicity”: given extensions and , if is algebraic over and is algebraic over then is algebraic over .
So: an algebraic closure of is an algebraic extension of in which every polynomial over splits. That’s the formulation I’ll use for the rest of this post.
Colin McLarty’s paper The rising sea: Grothendieck on simplicity and generality begins with a joke I find very resonant:
In 1949, André Weil published striking conjectures linking number theory to topology and a striking strategy for a proof [Weil, 1949]. Around 1953, Jean-Pierre Serre took on the project and soon recruited Alexander Grothendieck. Serre created a series of concise elegant tools which Grothendieck and coworkers simplified into thousands of pages of category theory.
How do we construct an algebraic closure of a given field? If you just want a concise presentation, I can’t do better than point you to Conrad’s note. But here I want to come at it a bit differently.
In my mind, the key to the Conrad–Zorn approach is to work in the world of rings for as long as possible and only descend to fields at the last minute. Behind this is the fact that the theory of rings is algebraic and the theory of fields is not, which is ultimately why constructions with fields often fail to be functorial.
(Here I mean “algebraic theory” in the technical sense. One could argue that little is more algebraic than the theory of fields, just as one could argue that “algebraic group” is a tautology…)
So we’re going to use rings, by which I mean commutative rings with . A field extension of our field is just a homomorphism of fields , but we’re going to be interested in “ring extensions” of , by which I mean homomorphisms , where is a ring. Such a thing is just a (commutative) -algebra, which is why you don’t hear anyone talking about “ring extensions”.
Given a polynomial over , we could consider the splitting field of over . But instead, let’s consider what I’ll call its splitting ring . Informally, is the -algebra you get by freely adjoining to the right number of roots, say
and imposing relations to say that they are the roots of . Write
where is the leading coefficient of . The relations should make and be equal as polynomials over .
Formally, then, we define to be the -algebra
where identifies the th coefficient of with the th coefficient of for each . So is the free -algebra in which splits.
For example, take . The splitting field of over is just itself. But the splitting ring is another story. It’s the polynomial ring quotiented out by the relations that identify the coefficients of with those of
Comparing constant terms gives , and comparing coefficients of degree gives . So
or equivalently,
This is bigger than the splitting field! A field can’t have nontrivial idempotents: forces . But a ring can, and does.
More generally, we can form the splitting ring of not just one polynomial but any set of them (providing they’re nonzero). For , put
where identifies the coefficients of with the corresponding coefficients of for each . Equivalently, is the coproduct in the category of -algebras. It’s the free -algebra in which all the polynomials in split.
The Conrad proof goes like this: put , then (i) show that is nontrivial, (ii) quotient out by a maximal ideal to get a field , and (iii) show that is an algebraic closure of . Step (i) is where the substance of the argument lies. Step (ii) calls on the fact that every nontrivial ring has a maximal ideal (a routine application of Zorn’s lemma). And step (iii) is pretty much immediate by the lemma I started with, since by construction every polynomial over splits in , hence in .
But you can come at it a bit differently, as follows. First let me list some apparent trivialities.
A ring is trivial if and only if . (Triviality is an equational property.)
The forgetful functor from rings to sets preserves filtered colimits. This is equivalent to the statement that the theory of rings is finitary: the ring operations (such as addition) take only finitely many inputs.
A filtered colimit of nontrivial rings is nontrivial. This isn’t as obvious as it might seem: for instance, the analogous statement for abelian groups is false. It follows from the previous two bullet points and the way filtered colimits work in .
A ring is nontrivial if and only if there is a homomorphism from to some field, if and only if there is a surjective homomorphism from to some field. Indeed, the first bullet point together with the nontriviality of fields shows that if admits a homomorphism to a field then is nontrivial. On the other hand, if is nontrivial then by Zorn’s lemma, we can find a maximal ideal of , and the natural homomorphism is then a surjection to a field.
Given a homomorphism of -algebras and a polynomial over , if splits in then splits in .
Now, here’s a proof that every field has an algebraic closure.
Put , where the colimit is over finite subsets of . Here I’m implicitly using the fact that is functorial in with respect to inclusion. This is a filtered colimit.
For each , the polynomial has a splitting field . Then every polynomial in splits in , so by the universal property of , there is a homomorphism . It follows that is nontrivial.
So the -algebra is a filtered colimit of nontrivial rings, and is therefore nontrivial.
Hence there is a surjective homomorphism from to some field . Composing the natural maps makes into an extension of .
For each nonzero , we have homomorphisms of -algebras , and splits in , so splits in . Hence the field is an extension of in which every polynomial over splits.
Each of the -algebras is generated by elements algebraic over , so the same is true of . Hence is algebraic over , and is, therefore, an algebraic closure of .
In any construction of algebraic closures, the challenge is to find some extension of in which every polynomial over splits. It may be that is wastefully big, but you can cut it down to size by taking the subfield of consisting of the elements algebraic over , and is then an algebraic closure of . This provides an alternative to the last step above. But I like it less, because the fact is that the we constructed is already the algebraic closure.
Whether one uses the approach in Conrad’s note or the slight variant I’ve just given, I prefer this to the Artin (?) proof that appears in many books and is also summarized on page 2 of Conrad’s note. It seems much more direct and appealing.
There’s another proof I also like: the existence of algebraic closures follows very quickly from the compactness theorem of model theory. Like the proof above, this argument relies on the knowledge that for any finite collection of polynomials, there’s some field in which they all split. In other words, it uses the existence of splitting fields.
And there’s one more thing I can’t resist saying. Although algebraic closure isn’t functorial in the obvious sense, I believe it’s functorial in a non-obvious sense — what I’d think of as a “Hakim-type” sense.
Let me explain what I mean, taking a bit of a run-up.
The category of rings has a subcategory consisting of the local rings. The inclusion has no left adjoint: there’s no way to take the free local ring on a ring. But there is if you allow the ambient topos to vary! In other words, consider all pairs where is a topos and is a ring in . Given such a pair, one can look for the universal map from to some pair where is local.
Monique Hakim proved, among other things, that if we start with a ring in the topos , then this construction gives the topos of sheaves on the prime spectrum of , together with the structure sheaf , which is a sheaf of local rings on , or equivalently a local ring in . This wonderful result appeared in her 1972 book Topos Annelés et Schémas Relatifs.
Now if I’m not mistaken, there’s a similar description of algebraic closure. Consider all pairs where is a topos and is a field in . Given such a pair, one can look for the universal map from it to a pair where is algebraically closed. And I believe that if we start with a field in , the resulting pair is the topos of sets equipped with a continuous action by the absolute Galois group of , together with the algebraic closure with its natural -action. There are lots of details I’ve skipped here, but perhaps someone can tell me whether that’s more or less right and, if so, where this is all written up.
Re: Algebraic Closure
Back in the day, we’d sometimes hear around these parts of the groupoid of algebraic closures of a field (and again). Is that not a way to get around making arbitrary choices?