Skip to the Main Content

Note:These pages make extensive use of the latest XHTML and CSS Standards. They ought to look great in any standards-compliant modern browser. Unfortunately, they will probably look horrible in older browsers, like Netscape 4.x and IE 4.x. Moreover, many posts use MathML, which is, currently only supported in Mozilla. My best suggestion (and you will thank me when surfing an ever-increasing number of sites on the web which have been crafted to use the new standards) is to upgrade to the latest version of your browser. If that's not possible, consider moving to the Standards-compliant and open-source Mozilla browser.

February 21, 2008

Logicians Needed Now

Posted by John Baez

Mike Stay and I are writing a paper for a book Bob Coecke is editing: New Structures for Physics. The deadline is coming up soon, and we need your help!

We’d really love comments on the ‘logic’ section, because neither of us are professional logicians. I haven’t included the ‘computation’ section in this draft, since it’s embarrassingly far from finished… but Mike knows computation.

You’ll probably notice if you take a look at this draft: we’re not trying to give a conventional treatment of proof theory… we’re just trying to sketch how a few ideas in that subject are connected to category theory. So, we expect that what we say will sound weird and sort of superficial to expert logicians. But, we don’t want to say anything too embarrassing!

In particular, we only discuss an incredibly impoverished form of the propositional calculus, without ‘or’ or ‘false’. This is because we’re only trying to discuss what can be done in a symmetric monoidal closed category… the paper is long enough just doing that! We can’t get into the richer categorical structures beloved by logicians.

Worse, we never mention sequents like this:

A 1 ,,A nB

We only mention these:

AB

As Jon Cohen pointed out, this means our inference rules lack the ‘subformula property’: namely that every formula about the line is a subformula of some formula below the line.

Alas, I don’t know what to do about this problem without significantly modifying our setup. Josh mentioned multicategories as one approach, but I have no idea how this would work for the ‘quantum logic’ systems our setup is designed to include. So, I guess I’ll just mention the problem.

In a way it doesn’t really matter, since we’re just trying to set up tenuous bridges between various subjects… just enough to get more people talking to each other. But, it’s good to know about the hot water we’re getting into.

Posted at February 21, 2008 3:03 AM UTC

TrackBack URL for this Entry:   http://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/1607

70 Comments & 1 Trackback

Re: Logicians Needed Now

You probably know that chemistry (p. 35 of your paper) and logic have been related before. I recall Charles Peirce writing on this. More recently we have the chemical abstract machine.

Hmm, the connection goes back at least as far as Hegel.

Posted by: David Corfield on February 21, 2008 11:34 AM | Permalink | Reply to this

Re: Logicians Needed Now

David Corfield:

You probably know that chemistry (p. 35 of your paper) and logic have been related before.

I’d forgotten this, though now I seem to recall you mentioning it. Thanks — that’ll make a nice touch of erudition!

(I wonder if other people explaining linear logic have mentioned the analogy to chemical reactions: it seems pretty obvious, but I haven’t seen it in that context before.)

Now I’m reminded of how Kummer compared ‘ideal numbers’ to the element fluorine, whose existence was only known indirectly at the time, because it hadn’t been isolated: it existed only in combination with other elements.

Hey!

There’s a talk by Thierry Coquand that mentions Kummer, prime ideals, fluorine and cut elimination, all together! They’re a lot more deeply related than I thought!

He says:

Prime ideals were introduced by Kummer by analogy with chemistry. “These ideal complex numbers are comparable to hypothetical radicals that do not exist by themselves, but only in their combinations.” Kummer gave an example of an element (fluorine) that, at the time, existed only hypothetically comparable to a prime ideal (this element was isolated later).

But then he goes much deeper, showing how the Nullstellensatz and related results in algebra are connected to proof theory and ultimately (though I don’t understand this yet) Gentzen’s cut elimination theorem!

Interesting bunch of ideas!

Posted by: John Baez on February 22, 2008 4:51 AM | Permalink | Reply to this

Re: Logicians Needed Now

I need to reply to Todd and Robin’s important comments on cut elimination and symmetric monoidal closed categories, but it’s late, and I’m tired, and for now I just feel like saying a bit more about Kummer’s thoughts on prime ideals and fluorine, and Coquand’s talk… in part because what I’ve said so far was so sketchy it must seem almost deranged to someone who hasn’t thought about this stuff before.

Suppose we have a set of ‘kinds of atoms’: for example, the usual kinds of atoms you see on the periodic table. These freely generate a commutative monoid M. Every molecule or collection of molecules determines an element of M.

But, it’s possible that at some stage in the development of chemistry, only certain guys in M will have been observed in nature! For example, until quite late in the history of chemistry, nobody ever saw fluorine except in molecules containing other elements.

So, there is a subset IM consisting of ‘observed collections of atoms’. And, it’s quite reasonable to assume this is a submonoid.

So, there’s an interesting math problem: given a monoid I that is a submonoid of some free commutative monoid, find some free commutative monoid M containing I.

The solution to that problem is far from unique. But in fact, we want to find some M that’s the ‘smallest’ free commutative monoid containing I. For example, in our chemistry example, we can always enlarge M by positing extra never-observed atoms… but that would be useless! On the other hand, real-world chemists sensibly postulated the existence of fluorine before ever seeing it, because it was one of the generators of the ‘smallest’ free commutative monoid M containing the observed I.

This business of finding the ‘smallest’ free commutative monoid M for a given I is closely related to the study of monomial ideals in polynomial algebras — that is, ideals generated by polynomials. After all, free commutative monoids map to polynomial algebras under the ‘free vector space on a set’ functor. In the paper I just linked to, you’ll see some charismatic ways of drawing monomial ideals.

(However, monomial ideals are in 1-1 correspondence, not with general submonoids of a free commutative monoid, but to submonoids I with IMI. This extra property simplifies things immensely, but is not typical in the examples from chemistry. For example, once upon a time, chemists could get ahold of hydrogen fluoride, but not hydrogen and two fluorine atoms.)

Anyway, Coquand takes these ideas and goes in a different direction, more related to logic… but now it’s bed time.

Posted by: John Baez on February 22, 2008 7:44 AM | Permalink | Reply to this

Re: Logicians Needed Now


There’s a talk by Thierry Coquand that mentions Kummer, prime ideals, fluorine and cut elimination, all together! They’re a lot more deeply related than I thought!

Thierry taught me these ideas a couple of years ago. They may be seen as a partial realization of Hilbert’s program to (mechanically) eliminate ideal object’s from mathematical proofs. Hilbert tried to use such an elimination to justify the use of these ideal objects and of non-constructive proofs. These ideas are developing in all kinds of directions and I am still trying to see the full scope of them (and combine them with my own). Thierry used cut elimination to obtain proofs of various Stone and Gelfand dualities. We recently used this to obtain Gelfand duality for C*-algebras valid in any topos. These ideas may be seen as concrete versions of the fundemental work by Banachewski and Mulvey who again used logical methods (Barr’s theorem) to generalize
Gelfand’s theorem
to Grothendieck toposes. We discussed these ideas before.
Hilbert’s program is also crucial in our work on topos theory in quantum mechanics.

Bas

Posted by: Bas Spitters on February 22, 2008 10:26 AM | Permalink | Reply to this

Re: Logicians Needed Now

Bas wrote:

Thierry taught me these ideas a couple of years ago.

Cool! Could you say what some of the basic ideas are, in very simple terms? I think there are some very simple big ideas lurking around here, which could be explained with almost no jargon… but I don’t quite see what they are.

Coquand mentions the Nullstellensatz. Behind the Nullstellensatz there seems to be some idea like “If you can’t derive a contradiction of a specific form from assuming that a bunch of polynomials has a simultaneous solution, then there must be a solution”. Which is like saying “If you can’t derive a contradiction from some axioms, these axioms must have a model.”

In general, in the absence of any other machinery, if someone handed me some axioms that led to no contradictions, and told me to find a model of them, I’d try building this model syntactically… since what else could I do?

For example, this is how people invented the complex numbers. They posited 1 , and noticed you could play around with it and never get in trouble… so, they were able to ‘create’ the complex numbers in a purely syntactical way, as expressions of form a+b1 .

The Nullstellensatz seems deeper, because you’re not just making up new numbers to solve your system of polynomial equations: you’re finding solutions using the existing number system. I guess the Fundamental Theorem of Algebra is the basic result of this type.

But what does cut elimination have to do with any of this? Coquand writes: “Introduction to some recent developments in constructive algebra and abstract functional analysis. This approach can be seen as an application of some basic results in proof theory: the main one being the completeness of cut-free provability. It can also be seen as a partial realisation of Hilbert’s program and uses the idea of ‘replacing’ an infinite object by a syntactical one: a logical theory which describes this object. It has close connections with formal topology: cut-elimination can be expressed as forcing/topological model over a formal space.”

But what does this really mean? I wish I understood it.

Could there actually be some significance to what at first seems like a mere pun: the relation of ‘ideals’ to ‘ideal objects’? Coquand’s talk seems to relate the two.

(I guess I’ve already seen that a ‘point at infinity’ — a classic example of an ‘ideal object’ — can be an ideal in a C *-algebra.)

Posted by: John Baez on February 22, 2008 8:09 PM | Permalink | Reply to this

Re: Logicians Needed Now


Cool! Could you say what some of the basic ideas are, in very simple terms?

I’ll try to say a few things.

A geometric theory, one using only finite conjuction and arbitrary disjunction, describes a topological space, its Lindenbaum algebra, or syntactic category. This is a locale, or complete Heyting algebra. As you know a locale is a generalization of a topological space. We do not insist that the Heyting algebra is an algebra of sets. The points of a locale are its complete prime filters. These points correspond to the models of the geometric theory.

Often these points are “constructed” using non-constructive methods: the axiom of choice for instance. Consider the Gelfand spectrum of a C*-algebra. There one uses the axiom of choice to “construct” the points. However, these points are almost never needed in applications. The locale structure is what matters. Moreover, because the spectrum as a topological space is non-constructive it does not generalize to other toposes then Set. In applications where we have a continuous field of C*-algebras we need the localic version of the theorem. All of this leads to the idea that the use of the axiom of choice is a choice (it is not needed) and usually a bad choice (theorems obtained in this way do not generalize). Many of the standard applications of the axiom of choice in functional analysis have been removed in this way.

All of this may be seen as a shift from model theory, the existence of points, to proof theory, the locale is non-trivial. So, instead of proving that a point exists one proves that the corresponding theory is consistency. By an application of a non-constructive completeness theorem one goes from one to the other. However, this last move is not needed in practice. This is also what is going on in the Nullstellensatz. [When reading Tao’s post on this, I got the impression that his ideas may be related.]

Typically, when using these methods one did not analyse the corresponding constructive proofs, but was happy with their non-constructive existence, using a Goedel-style completeness theorem. Hence it gives a classical proof of the existence of a constructive proof. Surprisingly, once found, these constructive proofs are usually very elegant.

Thierry’s observation is that in case the space is compact Hausdorff. One can actually consider the finitary covering relation and hence have a finite proof system instead of the infinitary one used in geometric logic. This often allows one to extract very simple algebraic proofs from highly non-constructive arguments.

The connection between an ideal and an ideal element is very clear in the case of the Gelfand spectrum. The points/models are the maximal ideals, which require Zorn for its construction. Hence they are clearly idealized.

Finally, for the relation with cut-elimination: Suppose that we want to prove that something holds for all the points of the spectrum of a C*-algebra. Then, going to the logical side, this means that we want to prove that truth is derivable in the corresponding theory. In order to prove this it suffices to consider only cut-free proofs. This often makes it possible to find the very concrete proofs hidden in abstract arguments.

It’s time to go to bed…

Bas

Posted by: Bas Spitters on February 22, 2008 10:02 PM | Permalink | Reply to this

Re: Logicians Needed Now

Knowing almost nothing about much of the mathematics involved here (cut elimination) I risk a simpleminded comment.

“In general, in the absence of any other machinery, if someone handed me some axioms that led to no contradictions, and told me to find a model of them, I’d try building this model syntactically… since what else could I do?”

But syntactics, as in computer science, isn’t that just asking for what you want to be able to do (with a say a programming language), which is more or less what the axioms do already? Don’t you need a map to a concrete semantics, where the syntactical rules are implemented and where you can actually perform the calculations?

“It can also be seen as a partial realisation of Hilbert’s program and uses the idea of ‘replacing’ an infinite object by a syntactical one: a logical theory which describes this object.”

This sounds very much like what a lazy functional programming language (like Haskell) does - but again in concrete semantics - the calculations can be done on a real computer.

Curious if these comments makes sense.

Posted by: Anders Bengtsson on February 24, 2008 8:20 AM | Permalink | Reply to this

Re: Logicians Needed Now

“This sounds very much like what a lazy functional programming language (like Haskell) does - but again in concrete semantics - the calculations can be done on a real computer.”

Indeed, Thierry Coquand also uses this analogy in the slides mentioned above.

Bas

Posted by: Bas Spitters on February 24, 2008 8:28 PM | Permalink | Reply to this

Re: Logicians Needed Now

Anders wrote:

Don’t you need a map to a concrete semantics, where the syntactical rules are implemented and where you can actually perform the calculations?

My point was that sometimes, when we’re desperate to find a model of some axioms, we resort to a “purely syntactical” model which is really just built from symbolic expressions.

For example, suppose I hand you the axioms describing a gizmo with one binary operation × satifying the equations

a×b=b×a

a×(a×b)=b×b

and I tell you to find some examples of such a gizmo — that is, some models for these axioms. You could look around for examples in nature, but this might be tough — except for the empty set or the 1-element set, which are always good things to try.

So, you might get desperate and start trying to build models syntactically. For example, you might say “how about the free gizmo on 3 generators x, y and z?” This would have elements x, y and z and also all formal products of them, like (x×z)×(z×x), modulo the relations imposed by the equations above. So, for example, x×(x×x) would equal x×x.

Starting from these ‘free’ models, you can then build lots of other models using various constructions.

Using this sort of trick, we instantly know that any ‘algebraic theory’ — any theory describing a gizmo with a bunch of n-ary operations satisfying equational laws — has plenty of models.

I was quite vaguely imagining using some trick like this in some other context…

Posted by: John Baez on March 1, 2008 6:33 AM | Permalink | Reply to this

Re: Logicians Needed Now

Alas, I don’t know what to do about this problem without significantly modifying our setup. Josh mentioned multicategories as one approach, but I have no idea how this would work for the ‘quantum logic’ systems our setup is designed to include. So, I guess I’ll just mention the problem. In a way it doesn’t really matter, since we’re just trying to set up tenuous bridges between various subjects… just enough to get more people talking to each other. But, it’s good to know about the hot water we’re getting into.

I think the subsection on proof theory will leave many scratching their heads about the point of it all, because here all proof theory appears to be is an especially cumbersome way of presenting the construction of proofs/morphisms.

To my mind, the crucial point underlying proof theory is that body of results that are called cut elimination theorems, which for a category theorist not accustomed to thinking of these things, says something quite incredible: that you can dispense with composition (your rule ())!! That’s really a crude way of putting it, so let me say a bit more.

Imagine that you’re setting out to prove a difficult theorem, AB. Along the way you will need to prove a bunch of propositions which will get you from point A to point B: AA 1 , A 1 A 2 , …, A nB. (The cut rule, or rule (), would then allow you to conclude AB.) In the case of a nontrivial theorem, there is no doubt that the intermediate formulas A i could be quite complicated. If you actually wanted to mechanize the process (of deciding which A i you will need), the problem would seem to be hopeless: the search space for the requisite intermediate formulas is infinite, unbounded. So having to rely on the cut rule would seem to be a very risky proposition, from that point of view.

Nevertheless, Gentzen and his successors showed that for deductive systems of carefully specified type, one can get around this fundamental difficulty: he proved a meta-theorem to the effect that any entailment (or sequent) AB that can be proven with the cut or composition rule, can also be proven without it! For example, as an application to the types of monoidal categories you and Mike are discussing in your paper, one can decide if there exists a morphism from one formula A built up from and (your lollipop connective – sorry, I don’t know the LaTeX code for that) to another B, that is, a morphism built out of the data for the particular type of structured category being studied (monoidal, monoidal closed, symmetric monoidal closed, etc.). In effect, in the system left after the cut rule is removed, the only way one can infer an entailment or morphism below the line is by starting with entailment(s) above the line with a simpler logical structure. In a lot of such deductive systems which have been studied by categorists, the entailment AB one is after is built up from simpler ones, one connective at a time.

[There is also a more refined, categorified cut elimination, which allows one to decide not only if there exists a morphism AB, but whether two such morphisms built from syntactically different deductions are the same on the basis of the axioms (commutative diagrams) for the type of structured monoidal category being studied. This is the crucial step toward solving a coherence problem for that type of structure, i.e., deciding which diagrams commute. The fact that Gentzen cut elimination admits a categorified form is one of the great insights due to Lambek.]

But it should come as no surprise that to get away with a cut elimination theorem, one has to be very careful in just how the deductive system is set up – sometimes a delicate matter. In multiplicative intuitionistic linear logic (the logic for symmetric monoidal closed categories), the Gentzen method gives rules which introduce connectives one at a time, either on the left or right of the entailment symbol (the turnstyle). For example, there is a rule

Γ,A,B,ΔCΓ,AB,ΔC

which introduces a connective on the left (here Γ, Δ denote lists of formulas). There is a separate rule

ΓA;ΔBΓ,ΔAB

for introducing on the right. The various rules come in pairs like this, balanced “just so”, so that a cut elimination theorem can go through. In particular, I think one really does need sequents of the form

A 1 ,,A nB

in order to get cut elimination to come out right. (Just FYI, the cut rule would take a form like

ΔA;Γ,A,ΣBΓ,Δ,ΣB).

Multicategories are one semantic device for interpreting such sequents. Another is to think of the “commas” as the monoidal product for a monoidal strictification (a formal way of associating a strict monoidal category equivalent to a given monoidal category). I spoke a little about this back here; see also the comments which follow (including where Tom Leinster chipped in with a remark on unbiased monoidal categories).

I may have more comments later.

Posted by: Todd Trimble on February 21, 2008 4:06 PM | Permalink | Reply to this

Re: Logicians Needed Now

I’ll slowly work my way up to some more substantial issues, but for starters…

Todd writes:

In multiplicative intuitionistic linear logic (the logic for symmetric monoidal closed categories)…

Does anyone know how standard it is to use the term ‘multiplicative intuitionistic linear logic’ (or ‘MILL’) to mean precisely ‘the logic for symmetric monoidal categories’?

Right now about the only place I can find this usage clearly documented is in a paper by Hasegawa. Originally Mike and I used the term ‘MILL’ all over our paper — but then I got scared that it might mean different things to different people. For example, Benton, Bierman, de Paiva and Hyland seem to include the connective ! in their concept of MILL — see the rules listed on page 2.

Posted by: John Baez on February 23, 2008 5:30 AM | Permalink | Reply to this

Re: Logicians Needed Now

I thought it was very standard. Certainly if you leave off the word “intuitionistic” and talk about MLL, then you will find a lot more references, and just about all that I am aware of take the primitives to be , ¬, (this funny ‘par’ connective, De Morgan dual to ), with constants and (units for and respectively). In particular, no modalities ! and ?; if those are included, people generally say ‘MLL + exponentials’ or something similar.

I’m not sure what the best reference for the system MILL (as we know it) would be. But I too am surprised by their inclusion of !; I honestly think you’re on safe ground sticking to your guns here.

Posted by: Todd Trimble on February 25, 2008 2:26 PM | Permalink | Reply to this

Re: Logicians Needed Now

Okay, good — thanks. Elsewhere the same Benton–Bierman–de Paiva–Hyland gang use the term ‘MELL’ (‘multiplicative exponential linear logic) to stand for the fragment of linear logic containing ,, and that exponential !. I like that better. So, I’ll stick to my guns.

Posted by: John Baez on February 25, 2008 8:14 PM | Permalink | Reply to this

Re: Logicians Needed Now

(your lollipop connective – sorry, I don’t know the LaTeX code for that)

It’s \multimap.

Back here you said,

I like to think of the comma as denoting the tensor product of the monoidal strictification of the smc category where these sequent deductions are interpreted.

I’m trying to understand: we’ve got a symmetric monoidal closed category A, and we “interpret it in B” by taking a symmetric monoidal functor F:AB. Then we use the fact that B is equivalent to its skeleton C; calling that equivalence E, we get EF:AC. Then we pull the monoidal structure on C back to A to get the comma. Is that right?

Also, I’m sure I don’t understand cut elimination: the wikipedia page says that if you have proofs of sequents AB and BC, then anywhere in a proof of AC mentioning B you can inline the proof of B instead. But isn’t that just what the cut rule does!?

Posted by: Mike Stay on February 23, 2008 11:04 PM | Permalink | Reply to this

Re: Logicians Needed Now

No, actually, I think that’s the Deep Magic here. The cut rule doesn’t inline the proof.

I think this is the right analogy (please, logicians, smack me if I’m wrong). You could write a procedure in a C program called square:


float square(float x)
{
return x*x;
}

And then you could find the squared length with:


float squaredlength1(float x, float y, float z)
{
return square(x)+square(y)+square(z);
}

This way of doing things is like using the cut rule. The Cut Elimination Theorem tells us that there’s another way of writing this function that doesn’t use a subroutine call:



float squaredlength2(float x, float y, float z)
{
return x*x+y*y+z*z;
}

Is that more or less accurate?

Posted by: John Armstrong on February 24, 2008 1:02 AM | Permalink | Reply to this

Re: Logicians Needed Now

OK, I can see how syntactic subtleties can differentiate these. In lambda calculus, the term

(1)(square(x,y,z)square(x)+square(y)+square(z))(xx*x)

is the same morphism (via beta reduction) as

(2)(x,y,z)x*x+y*y+z*z.

The cut rule, on the other hand, talks about taking a pair of morphisms

(3)(square(x,y,z)square(x)+square(y)+square(z),xx*x)

and spitting out the morphism above.

Posted by: Mike Stay on February 25, 2008 12:46 AM | Permalink | Reply to this

Re: Logicians Needed Now

Yes! And I’m very glad you mentioned beta reduction, because that is a key for one way of thinking about cut elimination. (I wish I had thought to say this earlier, because not long ago there was this course on classical and quantum computation, when you guys were discussing things like lambda calculus and beta reduction.)

As I’m sure you know, there is this so-called “Curry-Howard” correspondence, by which one can inter-translate between formulas and types, and between proofs and programs (or lambda terms). I’m going to put it a bit roughly, but it can all be made quite precise.

The way I think of it is: a deduction f of a sequent

A 1 ,,A nB

corresponds to a term f(a 1 ,,a n) of type B, where each a i is a term of type A i which occurs freely. Then, for example, one can interpret a deduction ending with the rule

Γ,AfBΓAB

in lambda terms as an instance of lambda abstraction, which binds a hitherto free variable a of type A: the term for this deduction would look like

λa.f(x 1 ,,x m,a)

where now the free variables are those of the types X 1 , …, X m appearing in Γ.

Well, this much may have been obvious already, so let’s continue. Consider then a sequent deduction which introduces on the left of the turnstyle:

Δ,BgC;ΩαAΔ,AB,ΩC

Here, to get the corresponding lambda term, we need a fresh variable ϕ of function space type AB. It occurs freely in the term

g(y 1 ,,y n,ϕ(α(z 1 ,,z p)))

[where the y i are variables of types occurring in Δ, and the z j are of types occurring in Ω]. This is the term associated with the previous deduction.

Now suppose we try to “cut” the previous two deductions at the formula AB. This corresponds to substituting the term

[λa.f(x 1 ,,x m,a)]

(of type AB) in for the free variable ϕ. (I guess this is the “function call”.) But the computation then involves evaluating the expression

[λa.f(x 1 ,,x m,a)](α(z 1 ,,z p))

which is where we use beta reduction; it reduces to

f(x 1 ,,x m,α(z 1 ,,z p)).

What this reduction has done is “push back” [or inline?] a substitution of a term for a variable ϕ of type AB, in favor of substitutions of terms of simpler type. One is the substitution of α(z 1 ,,z p) for the variable a of type A. The other substitutes the term in the last display (a term of type B) for the variable b in g(y 1 ,,y n,b).

At the level of sequent deductions, this inlining step pushes back the cuts [where we make the substitutions], from the cut at the formula AB, to cuts at the simpler formulas A and B. In fact, this is exactly the case 6 of the cut elimination algorithm that I was trying to point you to in the comment below, on page 106 of Proofs and Types. This is the case which corresponds precisely to beta reduction.

Posted by: Todd Trimble on February 25, 2008 3:29 AM | Permalink | Reply to this

Re: Logicians Needed Now

It’s \multimap.

Thank you!

I’m trying to understand: we’ve got a symmetric monoidal closed category A, and we “interpret it in B” by taking a symmetric monoidal functor F:AB. Then we use the fact that B is equivalent to its skeleton C; calling that equivalence E, we get EF:AC. Then we pull the monoidal structure on C back to A to get the comma. Is that right?

Sorry; I didn’t mean to be so terse or cryptic. I had something slightly different in mind.

One can think of the sequent calculus for MILL as giving a means of presenting the free symmetric monoidal closed categories F[X,Y,Z,] (generated by a set or discrete category of positive literals X, Y, Z, …). The objects of the free structure are MILL formulas, and the morphisms AB are equivalence classes of MILL-valid deductions of sequents AB. So the question I was trying to give an answer to was: what would the corresponding meaning of equivalence classes of MILL deductions of more general sequents A 1 ,,A nB?

One answer has already been suggested: that these equivalence classes of deductions could be considered the multimorphisms of a certain multicategory; actually the “underlying” multicategory of the free structure F[X,Y,Z] (as a monoidal category). For details on this underlying multicategory construction, see Tom Leinster’s book, page 84 (his V functor). But since your intended audience might not be familiar with multicategories (and you might not want to get into all that), I was suggesting slightly different language with essentially the same content.

Namely, there’s a way of formally “strictifying” a monoidal category M to get a strict monoidal category M monoidally equivalent to M. I believe the construction is given by Joyal and Street in their Geometry of Tensor Calculus paper (and it can also be extracted from Tom’s discussion). The objects of M are finite lists of objects of M. Then, to each finite list Γ=A 1 ,,A n one can associate a “clique” which is the diagram of all ways of tensoring up the A i in order, possibly with copies of I thrown in, and isomorphisms between them based on the monoidal structure in M. (One thinks of each n-fold tensoring as tagged by a planar tree τ with n leaves – a “classical tree” in Tom’s language, where each node has 0 or 2 incoming edges corresponding to the unit or an application of tensor. The tensoring itself might be denoted τΓ. Then, given two trees τ and τ , Mac Lane’s coherence then specifies a well-determined isomorphism f τ,τ : τΓ τ Γ between the two tensorings in M.)

Then, a morphism ΓΔ in the strictification M is just the obvious notion of clique morphism: a collection of maps in M of the form

ϕ τ,σ: τΓ σΔ

(ranging over trees σ, τ) which are compatible with all the f τ,τ , f σ,σ in the sense of making all the triangles commute. For example, f τ,τ followed by ϕ τ ,σ yields ϕ τ,σ. These equations mean that the whole collection of ϕ τ,σ is determined by any single one of them; the device of cliques is just a way of ensuring that no way of tensoring is favored over any other; it’s completely egalitarian.

Hopefully it’s clear the way in which M becomes strict monoidal; on objects the tensor product is concatenation of lists. M is embedded inside M , where objects are interpreted as lists of length 1. If you think a little further about it, this embedding is a monoidal equivalence (for example, any object Γ is isomorphic to an object of M, namely to any of its tensorings τΓ). So, that’s the formal strictification.

Hence, (equivalence classes of) deductions of sequents ΓA can be interpreted as morphisms ΓA in the monoidal strictification of the free symmetric monoidal closed category. You and John are doing something slightly different from free structures in your hom-set approach, but that can also be adapted to this strictification interpretation of sequents (if you want).

Also, I’m sure I don’t understand cut elimination: the wikipedia page says that if you have proofs of sequents AB and BC, then anywhere in a proof of AC mentioning B you can inline the proof of B instead. But isn’t that just what the cut rule does!?

Sadly, I’m not the right person to ask since I don’t speak “programming” (I think Robin does though). I have a sense of what this inlining business is saying, and I think John Armstrong’s response is in the right spirit, but my attempt to speak the lingo would be inarticulate – let Robin or someone else answer.

But maybe I can point you to a special but critical case of the cut-elimination algorithm for MILL which I hope helps. Go to Proofs and Types, page 106, and look at case 6, where one cuts at the formula CD. (Girard et al. are dealing with classical = non-intuitionistic sequents; to remove clutter and put yourself in MILL, take underlineB and underlineB to be empty, and underlineB to be a singleton.) Here the deduction (which ends with cutting at CD) is replaced by a deduction involving two cuts, one at C and one at D. So in effect, in cut-elimination, applications of cut are pushed back to simpler and simpler formulas until one pushes them all the way back to the level of axioms (identities XX on literals X), where they are at last eliminated (since composition with an identity does nothing). That’s the rough idea, although the careful multiple inductions are a little bit fussy – Proofs and Types gives plenty of detail.

Posted by: Todd Trimble on February 24, 2008 2:46 AM | Permalink | Reply to this

Re: Logicians Needed Now

One can think of the sequent calculus for MILL as giving a means of presenting the free symmetric monoidal closed categories F[X,Y,Z,] (generated by a set or discrete category of positive literals X,Y,Z,). The objects of the free structure are MILL formulas, and the morphisms AB are equivalence classes of MILL-valid deductions of sequents AB.

OK, at first glance I thought this was different from what we’re doing, but I think I understand: our objects are generated by the literals and the two connectives and , and we consider AB to be the set of morphisms hom(A,B). Then inference rules are dinatural transformations between special functors from

(1)C n×(C op) mSet.

The coherence laws on the natural transformations assert that two different deductions (two ways of going around the diagram) are “equal.”

Your objects are the functors (lines in a deduction) and your morphisms are the dinatural transformations (coherence-law-equivalence classes of deductions).

Ah! Yes, you say so:

You and John are doing something slightly different from free structures in your hom-set approach, but that can also be adapted to this strictification interpretation of sequents (if you want).

John Baez wrote:

The trick is to create a new category where objects are lists of objects in our old category, and define a new tensor product such that

(2)(X 1 ,,X n)(Y 1 ,,Y m)=(X 1 ,,X n,Y 1 ,,Y m)

Then you can set the associator equal to the identity and see that this new monoidal category is monoidally equivalent to the original one!

This is the trick Todd is calling “monoidal strictification”, and in applications to logic, the commas are supposed to remind you of sequents.

So how does the subformula property play into all this?

Posted by: Mike Stay on February 25, 2008 12:36 AM | Permalink | Reply to this

Re: Logicians Needed Now

Mike wrote:

I like to think of the comma as denoting the tensor product of the monoidal strictification of the smc category where these sequent deductions are interpreted.

I’m trying to understand: we’ve got a symmetric monoidal closed category A, and we “interpret it in B” by taking a symmetric monoidal functor F:AB. Then we use the fact that B is equivalent to its skeleton C; calling that equivalence E, we get EF:AC. Then we pull the monoidal structure on C back to A to get the comma. Is that right?

I don’t know where you got that ‘skeleton’ stuff from, but it makes me fear you’re succumbing to a classic beginner’s mistake: trying to make a monoidal category strict by making it skeletal.

In case you were making this mistake, let me say what it is, and what to do instead.

In a monoidal category we have an associator

a X,Y,Z:XY)ZX(YZ)

Switching to a skeletal subcategory ensures that

(XY)Z=X(YZ)

but this doesn’t help at all if you’re trying to make the associator equal to the identity!

In fact, to make the associator equal to the identity, we need to go in the opposite direction. We need to throw in a lot more isomorphic objects.

The trick is to create a new category where objects are lists of objects in our old category, and define a new tensor product such that

(X 1 ,,X n)(Y 1 ,,Y m)=(X 1 ,,X n,Y 1 ,,Y m)

Then you can set the associator equal to the identity and see that this new monoidal category is monoidally equivalent to the original one!

This is the trick Todd is calling ‘monoidal strictification’, and in applications to logic, the commas are supposed to remind you of sequents.

So: if you want to make a monoidal category strict, don’t make it skeletal: add flab!

Posted by: John Baez on February 24, 2008 4:48 AM | Permalink | Reply to this

Re: Logicians Needed Now

Yes, that’s the mistake I was making. Thanks!

Posted by: Mike Stay on February 24, 2008 9:44 PM | Permalink | Reply to this

Re: Logicians Needed Now

In the programming analogy: if the cut rule is like a function call, eliminating the cut corresponds to inlining the function.

As this analogy suggests, in practice it is not generally wise to eliminate all cuts from a proof: if you do, the size of the proof can blow up exponentially. George Boolos has a fun paper Don’t eliminate cut! which gives a simple example where this happens.

Posted by: Robin on February 24, 2008 9:37 AM | Permalink | Reply to this

Re: Logicians Needed Now

Todd wrote:

But it should come as no surprise that to get away with a cut elimination theorem, one has to be very careful in just how the deductive system is set up — sometimes a delicate matter.

I can believe that. But, to the extent to which the consequences of cut elimination can be phrased as theorems about symmetric monoidal closed categories, I’d naively hope we could phrase these consequences in a way that doesn’t give a darn whether we’re talking about a symmetric monoidal closed category or its strictification. After all, they’re equivalent!

I’m not denying that to prove cut elimination, this strictification business could be crucial — but that’s a matter of ‘technique’ rather than ‘results’. (There’s some rule of thumb in category theory saying that your theorems should be stated in a way that’s invariant under equivalence, but you’re allowed to use tricks like skeletons or strictifications to prove these theorems.)

If my hope is true, I could protect my delicate physicist readers from those scary ‘sequents’

A 1 ,,A nB

and just stick to the formalism where we write

A 1 A nB

as faux-logician’s notation for the homset

hom(A 1 A n,B)

You seem to be confirming my hopes here:

There is also a more refined, categorified cut elimination, which allows one to decide not only if there exists a morphism AB, but whether two such morphisms built from syntactically different deductions are the same on the basis of the axioms (commutative diagrams) for the type of structured monoidal category being studied. This is the crucial step toward solving a coherence problem for that type of structure, i.e., deciding which diagrams commute. The fact that Gentzen cut elimination admits a categorified form is one of the great insights due to Lambek.

It sounds, ironically, that this ‘more refined’ cut elimination theorem may take less work to explain — the result, that is, not the proof.

Could I say it amounts to a decision procedure for telling 1) when a homset in a freely generated SMCC is nonempty and 2) when two elements of the homset, described syntactically in terms of certain basic ‘generators’, are equal?

And, by ‘freely generated’ do we only mean free on a set of objects, or can we throw in some extra generating morphisms as well?

This is probably all in your thesis. I want it! I hope you send it to me and let me scan it in and put it on my website.

Posted by: John Baez on February 24, 2008 9:26 PM | Permalink | Reply to this

Re: Logicians Needed Now

Yes, you’re right – you don’t really need Gentzen sequent calculus in order to prove a cut elimination theorem, nor do you particularly have to mention strictifications per se. What I really meant (and what I should have said) is that cut-elimination is above all a syntactic theorem which is sensitive to just how the syntax is presented.

To prove the point, Kelly and Mac Lane, in their famous paper Coherence in Closed Categories [JPAA 1 (1971), 97-140], proved a cut elimination theorem in purely categorical language – no sequents, no strictifications, no nuthin’. Roughly, they showed that each canonical transformation that is constructible in the theory of smc categories is one of four types (corresponding to the four ‘logical’ rules of MILL) – the crux is to show that the class of transformations of these four types is closed under composition. That’s their cut elimination theorem.

See, I don’t think they liked this Gentzen sequent business any more than most category theorists – as I found out, very quickly, when I began giving seminar talks on my thesis in Australia! [I myself find the ideas clearer when expressed in sequent calculus, but hey, that’s me.] But anyway, the point remains that the cut-elimination theorem is syntactic in nature and a bit sensitive to how the syntax is presented.

But I say let’s not worry about it. I think it would be quite enough just to say a few words on what sequents (as long as you’re talking about them!) are good for; maybe add for the sake of honesty that the way you’re presenting it is a slight simplification and not exactly the way (categorical) logicians do it for their purposes, but anyway you’re not trying to prove a theorem of logic for goodness’ sake! So yes, I think your hopes are confirmed.

Could I say it amounts to a decision procedure for telling 1) when a homset in a freely generated SMCC is nonempty and 2) when two elements of the homset, described syntactically in terms of certain basic ‘generators’, are equal?

Cut-elimination is in any event a crucial step toward such a decision procedure. The traditional concern of logicians was 1), the issue of provability of formulas or sequents. Of perhaps greater concern to category theorists and many others is 2): deciding if a diagram commutes.

And, by ‘freely generated’ do we only mean free on a set of objects, or can we throw in some extra generating morphisms as well?

First, let me say that I think you can probably soft-pedal this business of free structures (since it’s mathematical physicists you’re addressing), if you want, with words like “deciding when a diagram commutes on the basis of the axioms/commutative diagrams of the theory”, or something like that. But anyway, “freely generated” can mean on any category. If you have an oracle who can tell you when two arrows of a category C are equal, then there’s a procedure for deciding equality in F[C] relative to equality in C.

Have we talked about clubs and wreath constructions much at the Café? In fact, what Max Kelly showed is that the free structure F[C] is equivalent to a kind of “wreath product” F[1 ]C. So really, “all” you need to know is the structure of F[1 ] (where 1 is the 1-object 1-morphism category); then, with the help of your C-oracle, you know all about F[C]. The same type of thing holds whenever the theory or doctrine is expressible as a so-called “club”.

Posted by: Todd Trimble on February 24, 2008 11:01 PM | Permalink | Reply to this

Re: Logicians Needed Now

John,

This looks like a very interesting paper!

I haven’t reached the logic part yet, but I’m not at all sure that your Definition 15 (defining a compact monoidal category) is equivalent to the usual definition.

Rather trivially, does it even follow from your definition that * is an equivalence? Less trivially, even if it does (or if you add that as an extra condition), I don’t know how to show that the zig-zag conditions hold. Do you have an argument that they do?

(I think I raised this here once before, with Todd.)

Posted by: Robin on February 21, 2008 4:14 PM | Permalink | Reply to this

Re: Logicians Needed Now

The definition you mention is our definition of a ‘compact monoidal category’, not necessarily symmetric or even braided. It seems most people don’t use the term ‘compact’ until the category is at least braided. What we’re calling a ‘compact monoidal category’ is supposed to match what some people call an autonomous category, or more precisely a ‘right autonomous category’: a monoidal category where every object has a right dual. But, let me continue to use the term ‘compact monoidal category’ here.

Please tell me if the following seems correct, or if you want more details on some point, or if something seems wrong!

In a monoidal category C where every object has a right dual, we can randomly pick a right dual X * for every object X. This choice then extends uniquely to a contravariant functor * R:CC.

There is, however, no obvious reason why this functor need be an equivalence unless every object also has a left dual! In that case, we also get a contravariant functor * L:CC, and both * L* R and * R* L are naturally isomorphic to the identity.

If C is braided, every right dual automatically becomes a left dual, so * R is automatically an equivalence in a braided monoidal category with right duals.

In our paper we take a slightly different tack, defining a (right) compact monoidal category to be a (right) closed monoidal category C equipped with a contravariant functor *:CC such that the internal hom is (naturally isomorphic to) X *Y.

Given this, you should want to see how we get the unit and counit and zig-zag identities that make X * into a right dual for X. By the above remarks, you should not expect *:CC to be an equivalence. But, it will be automatically as soon as C is braided — or more generally, as soon as C is also left compact.

We describe how to get the unit i:1 X *X and counit e:XX *1 . (Note our conventions, which are backwards compared to those I normally use.) We do not describe how to get the zig-zag identities for i and e. But, these should follow from the fact that the functor X * is right adjoint to X. The unit and counit of adjoint functors always satisfy certain zig-zag identities, and these should imply the zig-zag identities for i and e. But now you’ve got me worried. You are, after all, the prince of compact closed categories!

Posted by: John Baez on February 23, 2008 4:06 AM | Permalink | Reply to this

Re: Logicians Needed Now

The unit and counit of adjoint functors always satisfy certain zig-zag identities, and these should imply the zig-zag identities for i and e. But now you’ve got me worried.

Yes, this is precisely the right place to get a little worried. The idea is quite seductive, but it doesn’t seem to work out when you look at the details. Just try to do it, and you’ll see what I mean.

Suppose we have a natural isomorphism ϕ A,B,C:(AB,C)(B,A*C), as in your definition. If you want the zig-zag identities to come out right, you need an extra compatibility condition. Specifically: ask that for any arrow f:ABC and object X, we have ϕ(fX)=ϕ(f)X. (Of course you need to add some associators in there really; I omitted them to avoid clutter.)

Just to confuse things, I should add that I’ve never seen or heard of a category that satisfies your condition but is not compact closed, so it is possible that your definition is equivalent to the usual one after all. However, if it is true then it’s true for some as-yet unknown reason!

Posted by: Robin on February 23, 2008 11:30 AM | Permalink | Reply to this

Re: Logicians Needed Now

I’d actually like to come back to this issue, because I think I have an abstract argument that the Baez-Stay definition is alright after all at least in the symmetric case, but it should first be run past Robin’s eagle eye, and perhaps also shaken down to a simpler proof.

The argument goes like this: it suffices to show that the counit of the adjunction (X)(X *), call it ϵ, is given componentwise by the composite

XX *YϵIYIYY

(and similarly for the unit), i.e., that the counit and unit are given by what they do at I.

Now, in the Stay-Baez definition, the counit ϵ is given by transporting the structure of the counit of the adjunction (X)(X) across the isomorphism (X *)(X). This counit is just the evaluation map

X(XY)Y

in the symmetric monoidal closed category, call it V. Now this map, like every definable map in the theory of smc categories, is certainly V-natural in the sense of enriched category theory. It follows by transport of structure that ϵ carries V-natural structure.

We get from the enriched transformation

ϵ:XX *YY

a transformation

XX *YY

which is V-dinatural in Y. Hence this map factors uniquely through the V-enriched end (if it exists). But

yyyI

by the enriched Yoneda lemma (which holds true in any smc). Tracing through that, we find the factorization is the obvious composite

XX *ϵIIyy

and this proves the desired claim.

Possibly I’ve overlooked some subtlety, but so far I’m not seeing anything wrong with this argument (although it may be needlessly fancy).

Posted by: Todd Trimble on February 25, 2008 6:05 PM | Permalink | Reply to this

Re: Logicians Needed Now

Todd,

This is awesome! I’ve suspected for ages that this might be true, but I wasn’t clever enough to think of this way of looking at it.

If I’m understanding aright, you almost got this last time we talked about it. You pointed out that the extra condition on the adjunction can be expressed as “a compatibility between the unit and counit … and the canonical strengths of these functors”; but it apparently didn’t occur to either of us, at the time, that this compatibility is already a standard fact about closed categories!

I’m going to work through this some more now, to make sure I understand it properly, but it looks wonderful so far.

Robin

Posted by: Robin on February 26, 2008 12:39 PM | Permalink | Reply to this

Re: Logicians Needed Now

Just to put a little gloss on it: it’s not just that we have e.g., a counit of (X)(X *):

XX *YY;

we also have the adjunction (Y)(Y *) which allows us to “decouple” X and Y (whereupon we can apply the Yoneda reduction trick, or some other device). I think maybe the two of us might hav