The Univalence Principle
Posted by Mike Shulman
(guest post by Dimitris Tsementzis, about joint work with Benedikt Ahrens, Paige North, and Mike Shulman)
The Univalence Principle is the informal statement that equivalent mathematical structures are indistinguishable. There are various ways of making this statement formally precise, and a long history of work that does so. In our recently-published (but long in the making!) book we proved to our knowledge the most general version of this principle, which applies to set-based, categorical, and higher-categorical structures defined in a non-algebraic and space-based style, as well as models of higher-order theories such as topological spaces.
This work achieves three main goals. Firstly, it greatly extends the “Structure Identity Principle” from the original HoTT book, to include any (finite) level of structure, instead of just set-based structures, thus establishing in the strongest sense yet made precise that the Univalent Foundations provide an equivalence-invariant foundation for higher-categorical mathematics. Secondly, it provides very general novel definitions of equivalence between structures and between objects in a given structure that “compile” to most known notions of equivalence in known cases, but which can also be used to suggest notions in new settings; in doing so it extends M. Makkai’s classic work on First Order-Logic with Dependent Sorts (FOLDS). Thirdly, the setting in which our result is proved (a form of Two-Level Type Theory) provides a framework in which to do metamathematics in the Univalent Foundations/HoTT, i.e. carry out the mathematical study of how mathematics is formalized in UF/HoTT. The Univalence Principle we prove is a foundational metamathematical result in that sense.
Setting the Stage
Any “Univalence Principle” type of result has the following form:
The result gains in strength if the class of is as large as possible, and the notion of equivalence between them coincides with known notions of equivalence in practice (where is a placeholder for a notion of signature in terms of which and are structures). Such a result also gains in strength if the middle notion of equivalence is as “structural” as possible, ensuring that not only properties of the structures are preserved, but also constructions built on top of them. This last feature can only really be pursued within a univalent type theory, like HoTT.
In our work, we define: diagrammatic signatures as inverse categories of finite height, -structures as Reedy-fibrant functors , and a notion of indiscernibility relating objects within structures, which yields a derived notion of equivalence between structures (essentially structure-preserving bijections up to indiscernibility). The primitive notions and are given by equivalence and equality in 2LTT, appropriately defined.
Signatures, Structures (and where they live)
Two-level type theory (2LTT)
In 2LTT, there is an external level for exo-types and other exo-gizmos (allowing strictly functorial constructions), and the usual fibrant (HoTT/UF) level where mathematical objects live. The external level is the metamathematical or syntactic level, where a strict equality exists that allows us to define syntax and symbols. Our exo-gizmos are analogous to sets of syntactical symbols used to define signatures in first-order logic.
Such syntax consists of categories with strict equality on composition, which we call exo-categories. Equalities here are strict (exo-equalities), while the internal world has homotopical paths. The 2LTT setting is convenient for type-theoretic reasoning, and allows us to neatly separate the various notions of equality at play.
Diagram signatures are inverse categories of finite height
A diagram signature is an inverse exo-category of finite height equipped with a rank functor that reflects identities. Objects of are the sorts (analogous to “sorts” in first-order logic); morphisms encode dependencies (“an arrow depends on a pair of objects,” etc.). Inverse-ness gives matching objects and allows Reedy methods to apply.
To illustrate the idea, take the example of a reflexive graph. The diagram signature for reflexive graphs would be given by the following inverse (exo-)category
where . The intuition is that we have a sort of “arrows” between any two objects, and a predicate (“identity”) that can be used to select which arrows are identity arrows with the relation ensuring that this predicate can only be “asked” of loops.
Structures are Reedy fibrant diagrams over these signatures
Given this notion of a signature, a structure in our sense is simply a (Reedy fibrant) functor . In more detail, a raw -diagram is an exo-functor For each sort , the matching object collects the compatible lower-rank data needed to specify the “boundary” for an element of . The Reedy fibrancy condition is: the canonical boundary map is a fibration (i.e., a dependent type) for each . The category of such Reedy-fibrant diagrams then forms a fibrant type whose points are the -structures.
To illustrate with the example of from above, an -structure in our sense would be given by the following data, in type-theoretic notation:
- A type
- A family dependent on
- A family for the “identity”
The trick here is to ensure the repeated in the definition of , obeying the relations of the signature. This is what the matching object machinery achieves for arbitrary signatures.
Other examples of -structures include: categories (with sorts for objects and morphisms), groupoids, -categories for any , preorders, and models of higher-order theories like topological spaces and sup-lattices. Furthermore, all of what we say applies to theories with axioms (not just structures), which we define in the book too.
Indiscernibility and local univalence
With our formal set-up complete, we address the central question: for arbitrary signatures , when are two “objects” in an -structure equivalent? Such an “object” could be a categorical (or higher-categorical) gadget itself when has height , e.g. the “objects” of are themselves categories, which are -structures for an of lower height. The key idea is: two “objects” are indiscernible if nothing in the signature can distinguish them…up to indiscernibility!
Indiscernibilities and Local Univalence
Fix a signature , an -structure , a rank (“bottom”) sort , and elements . Think of as a “set” of objects (e.g. of a category) on top of which properties and structures are defined.
To define when and are indiscernible, we package together everything that could distinguish them. The intuition is: if there is an equivalence between “everything you can say about ” and “everything you can say about ” (outside of directly referencing or themselves), then they are indiscernible. We achieve this through a formal definition of a “boundary” , which one can think of as the -structure that contains everything that could distinguish in from anything else in . Which naturally leads us to the following definitions.
Definition (Indiscernibility). We say that are indisernible, written , iff there is a levelwise equivalence that is coherent in the right way.
Definition (Univalent Structure). We say that is locally univalent at K if the canonical map is an equivalence. We then say a structure is univalent if this holds at all rank-0 sorts and (recursively) for all sorts of higher rank.
We prove our main results for univalent -structures. These are quite special in that they are “saturated” in their equivalence information: two indistinguishable gizmos are actually equal. Or, put differently: when there is not “enough” structure to distinguish two gizmos, there is always enough to prove them equivalent. Some examples to illustrate (see Part 2 of the book for many more!):
- In a reflexive graph structure, two nodes and are indiscernible iff they have the same number of arrows coming in and out, and the same number of loops that are identities.
- In a category, two objects are indiscernible iff they are isomorphic. A univalent category is precisely a category (precategory in UF) that is locally univalent at the sort of its objects.
- In an appropriately defined bicategory, an indiscernibility amounts to a pair of coherent adjoint equivalenes, as expected.
Equivalences of structures
We are almost there! On to the final missing piece: equivalences between structures themselves. Let be a morphism of -structures, defined in the expected way (as a natural transformation between the corresponding -valued functors). Then, for each sort , we have a matching square
and for each “context” an induced fiber map Write for indiscernibility at sort (as above). Then, what we really want to say now is: , are equivalent if they are “level-wise equivalent up to indiscernibility”. This idea gives us the from the beginning, and we define it as follows:
Definition (Equivalence of -structures). is an equivalence if, for every sort , every , and every , there exists a specified with i.e., is essentially split surjective up to indiscernibility on each fiber. We write for the type of equivalences.
The key innovation is the “up to indiscernibility” part; it makes our notion significantly weaker than usual notions, and hence the final result stronger. Note that we have not been able to prove our result without a splitness condition in the definition of equivalence, and to our mind this remains an open problem.
Our definition is related to Makkai’s original notion of FOLDS equivalence, but Makkai was not able to define a general notion of non-surjective equivalence directly, relying instead on spans. Our notion of indiscernibility circumvents this difficulty and allows us to consider the whole structure of equivalences between structures.
The Univalence Principle
With all our apparatus in place we prove our main result, a very general form of a univalence principle, as promised.
Theorem (“The Univalence Principle”). For a signature and univalent -structures , the canonical map is an equivalence of types. In other words, .
The proof proceeds by induction on the height of . The key insight is that level-wise equivalences between univalent structures must “reflect indiscernibilities”: if doesn’t preserve the ability to distinguish elements, then whatever structure distinguishes them in the source would transfer to structure distinguishing their images in the target, contradicting the equivalence. With the splitness assumption in the map and the assumption of univalence (of our -structure), we are able to achieve the lifting of the indiscernibility.
Our result is actually proved for an even more general class of signatures called functorial signatures, which strictly extends diagram signatures and covers “higher-order” situations (topological spaces, suplattices, DCPOs, etc.). We have stuck to the diagrammatic view in this post for intuition, but all results and definitions carry over to this more general notion.
Conclusion
In the course of this work there were quite a few questions we tried to answer, but could not. To mention a couple: Can we define a Rezk completion for arbitrary structures, providing a universal way to turn any structure into a univalent one? Can we remove the splitness condition from our definition of equivalence between structures? We list more open problems at the end of the book.
Beyond our specific result, the framework in which it is proven establishes a way to answer metamathematical questions around the univalence axiom in a precise and fruitful way. It is important to emphasize that carrying out this type of mathematical study does not require choosing one foundation over the other. In any setting that interprets the fibrant part of 2LTT, the univalence principle will hold, including in set theory.
The metamathematics of UF is the mathematical study of formalizing mathematics in terms of a hierarchy of -levels vs. a cumulative hierarchy of sets. Formalizing mathematics in this way has all sorts of unique mathematical properties. The Univalence Principle is one of them.
