Octonions and the Standard Model (Part 1)

Posted by John Baez

$MathML-enabled post (click for more details).$

I want to talk about some attempts to connect the Standard Model of particle physics to the octonions. I should start out by saying I don’t have any big agenda here. It’d be great if the octonions — or for that matter, anything — led to new insights in particle physics. But I don’t have such insights, and for me particle physics is just a hobby. I’m not trying to come up with a grand unified theory. I just want to explain some patterns linking the Standard Model to the octonions.

Understanding these patterns requires knowing a bit of physics and a bit of math. I’ll focus on the math side of things: mainly, I’ll be polishing up some existing ideas and trying to make them more pretty. I’ll assume you either know the physics or can fake it: either way, it won’t be the main focus.

In writing this first post, my attempt to explain an octonionic description of the strong force led me to a construction of the octonions that makes them look very much like the quaternions. I don’t know if it’s new, but I’d never seen it before. The basic idea is that octonions are to $\mathbb{C}^3$ as quaternions are to $\mathbb{R}^3$ .

$MathML-enabled post (click for more details).$

The group of automorphisms of the octonions that preserve any chosen octonion $i$ with $i^2 = -1$ happens to be isomorphic to $\mathrm{SU}(3)$ . In physics this is interesting because $\mathrm{SU}(3)$ is the gauge group of the strong nuclear force: the force that holds quarks together. Furthermore, when we look at how $\mathrm{SU}(3)$ acts on the octonions, it matches how $\mathrm{SU}(3)$ acts on quarks and leptons.

This idea has been around for a long time, going back at least to here:

Murat Günaydin and Feza Gürsey, Quark structure and octonions, Journal of Mathematical Physics 14 (1973), 1651–1667.

But if you look at that paper, you’ll see swarms of indices, which in my opinion conceal the beauty of what’s going on.

We could try to simply compute the group of automorphisms preserving some octonion $i$ with $i^2 = -1$ . But it turns out there’s a description of the octonions that makes it obvious that this group at least contains a copy of $\mathrm{SU}(3)$ .

First remember how the quaternions work. The quaternions are numbers like

$a + b i + c j + d k$

with $a,b,c,d$ real. In the late 1800s Josiah Willard Gibbs decided to chop quaternions into a scalar and vector part; if we do that we can write a quaternion as

$a + \vec{a}$

where $a \in \mathbb{R}$ and $\vec{a} \in \mathbb{R}^3$ is the vector $b i + c j + d k$ . Of course, these days kids are discouraged from adding scalars and vectors, but if you use quaternions it’s okay. Here’s how you multiply quaternions:

$(a + \vec{a})(b + \vec{b}) = a b - \vec{a} \cdot \vec{b} + a \vec{b} + b \vec{a} + \vec{a} \times \vec{b}$

So, all four ways of multiplying scalars and vectors are unified in quaternion multiplication:

multiplication of scalars (the ordinary multiplication of real numbers)
multiplication of vectors by scalars (scalar multiplication)
the dot product of vectors
the cross product of vectors

This is no coincidence: the dot product and cross product were invented by Gibbs when he chopped quaternions into a scalar and vector part, and in 1901 these operations were popularized in Edwin Wilson’s book Vector Analysis: A Textbook for the Use of Students of Mathematics, which was based on some lectures by Gibbs.

Now suppose we want to generalize all this using the complex numbers. We’ll take our scalars to live in $\mathbb{C}$ and our vectors to live in $\mathbb{C}^3$ .

One thing we can do — not the thing I’m interested in, but I have to mention it — is to copy everything that we just did, word for word, replacing $\mathbb{R}$ by $\mathbb{C}$ everywhere. We don’t get the octonions: we get the biquaternions, which is a 4-dimensional complex algebra, or 8-dimensonal real algebra. The biquaternions turn out to be isomorphic to the algebra of $2 \times 2$ complex matrices.

In copying everything ‘word for word’, we defined a dot product on $\mathbb{C}^3$ just as we did for $\mathbb{R}^3$ :

$\vec{v} \cdot \vec{w} = \sum_{i=1}^3 v_i w_i$

But another alternative is to use the inner product

$\langle \vec{v}, \vec{w}\rangle = \sum_{i=1}^3 \overline{v}_i w_i$

Just as the dot product on $\mathbb{R}^3$ is invariant under the group $\mathrm{O}(3)$ of orthogonal $3 \times 3$ real matrices, this inner product on $\mathbb{C}^3$ is invariant under the group $\mathrm{U}(3)$ of unitary $3 \times$ complex matrices. Since we’re shooting for a connection to $\mathrm{SU}(3)$ , let’s use this!

It turns out that this change forces us to use a modified cross product. The cross product on $\mathbb{R}^3$ is invariant under the group $\mathrm{SO}(3)$ of orthogonal $3 \times 3$ real matrices with determinant 1. (So: not reflections, but rotations.) Unfortunately, the cross product on $\mathbb{C}^3$ is not invariant under the group $\mathrm{SU}(3)$ of unitary $3 \times 3$ complex matrices with determinant 1.

What’s invariant under $\mathrm{SU}(3)$ is a mutant version of the cross product, where we first take the usual cross product and then take the complex conjugate of each component. Let’s call this

$\overline{\times} \colon \mathbb{C}^3 \times \mathbb{C}^3 \to \mathbb{C}^3$

In coordinates, it’s given by

$(\vec{v} \overline{\times} \vec{w})_i = \overline{(\vec{v} \times \vec{w})_i}$

Later I’ll give a coordinate-free definition which makes it clear why this operation is invariant under $\mathrm{SU}(3)$ . Some people would say ‘equivariant’ but that sounds too fancy to me right now. I just mean that if $U \in \mathrm{SU}(3)$ we have

$U \vec{v} \overline{\times} U \vec{w} = U(\vec{v} \overline{\times} \vec{w})$

Now we’ve got all the machinery to describe the octonions in a way that makes them look a lot like quaternions. The quaternions were built from real scalars and real vectors:

$\mathbb{H} \cong \mathbb{R} \oplus \mathbb{R}^3$

So now we’ll build the octonions from complex scalars and complex vectors:

$\mathbb{O} \cong \mathbb{C} \oplus \mathbb{C}^3$

To multiply guys in here we’ll copy the formula for multiplying quaternions, but we’ll make sure the multiplication is preserved by the obvious action of $\mathrm{SU}(3)$ — namely, the trivial action on scalars and the action by matrix multiplication on vectors. This action clearly preserves $i \in \mathbb{C} \subseteq \mathbb{O}$ , which is another thing we want. Then, we’ll put the most obvious norm on this space, the one with

$\| a + \vec{a} \|^2 = |a|^2 + \langle \vec{a}, \vec{a} \rangle$

and we’ll show that multiplication gets along with this norm:

$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$

So, we’ll have an 8-dimensional normed division algebra — which implies that we’ve got the octonions!

Since our formula for multiplication will look almost like the formula for multiplying quaternions, the proof that multiplication gets along with the norm will be almost like the proof that works for the quaternions.

Even better, just as the quaternions have $\mathrm{SO}(3)$ acting as automorphisms since their multiplication was built from $\mathrm{SO}(3)$ -invariant operations, the octonions will have $\mathrm{SU}(3)$ acting as automorphisms that preserve $i$ , because we’ll build their multiplication from $\mathrm{SU}(3)$ -invariant operations that preserve $i$ !

Here’s the result:

Theorem 1. If we define multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ by $(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + \overline{a} \vec{b} + b \vec{a} + \vec{a} \overline{\times} \vec{b}$ then we obtain a normed division algebra, since $1 \in \mathbb{C}$ acts as the multiplicative identity and $\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$ This normed division algebra is isomorphic to the octonions.

Note the formula for multiplication is exactly like the formula for multiplying quaternions… except for three things. First, the dot product has been replaced by the inner product. Second, the cross product has been replaced by the mutant cross product $\overline{\times}$ . These changes are not weird, because we want operations that are invariant under $\mathrm{SU}(3)$ . Third, the term $a \vec{b}$ has been replaced by $\overline{a} \vec{b}$ , while the term $b \vec{a}$ remains unchanged. This is weird. Of course complex conjugation of scalars is invariant under $\mathrm{SU}(3)$ . But the reason we need it is to get a normed division algebra.

I’ll give the proof at the end of this post. In reading this proof, the main thing to keep an eye on is the overlines for complex conjugation. I checked my calculations a few times, but if I got them wrong then maybe the $\overline{a} \vec{b}$ business isn’t needed. That would make me very happy.

Now let me tell you how I came up with this stuff. This part is more fancy than the story so far, but it may appeal to certain people.

Let’s start by assuming the octonions $\mathbb{O}$ are an 8-dimensional normed division algebra such that the automorphisms preserving any octonion $i$ with $i^2 = -1$ form a copy of $\mathrm{SU}(3)$ , and use this to try to guess how multiplication in the octonions works.

What is $\mathbb{O}$ like, as a representation of $\mathrm{SU}(3)$ ? Initially, at least, it’s a real representation. It has a 1-dimensional invariant subspace spanned by the element $1$ , since $1$ is fixed by every automorphism. And it has a 1-dimensional invariant subspace spanned by $i$ , by assumption. Taking the direct sum of these, we get a 2-dimensional trivial subrepresentation which is also a subalgebra: a copy of $\mathbb{C}$ in $\mathbb{O}$ .

Since $\mathrm{SU}(3)$ is compact there must be a complementary 6-dimensional subspace

$V \subseteq \mathbb{O}$

that’s also preserved by $\mathrm{SU}(3)$ . We can understand this better if we break it into irreducible representations of $\mathrm{SU}(3)$ . The lowest-dimensional real irreps of $\mathrm{SU}(3)$ are the 1-dimensional trivial rep and the 6-dimensional irrep coming from the usual action of $\mathrm{SU}(3)$ on $\mathbb{C}^3$ . Note: $\mathrm{SU}(3)$ has two inequivalent complex irreps of dimension 3, namely its usual rep on $\mathbb{C}^3$ and the dual of this — but these are equivalent as real reps, as always for a unitary rep and its dual.

It follows that $V \subseteq \mathbb{O}$ is the 6-dimensional real irrep of $\mathrm{SU}(3)$ . Indeed, the only alternative is that $V$ is the direct sum of 6 copies of the 1-dimensional trivial rep! But in that case $\mathrm{SU}(3)$ would act trivially on $\mathbb{O}$ , so it would not be a group of automorphisms of $\mathbb{O}$ . So that alternative is out.

So $V$ must be isomorphic to the usual rep on $\mathbb{C}^3$ , and we get

$\mathbb{O} \cong \mathbb{C} \oplus \mathbb{C}^3$

as real reps of $\mathrm{SU}(3)$ .

Now we can try to figure out the options for multiplying octonions. Since the multiplication is invariant under $\mathrm{SU}(3)$ , it must be built from $\mathrm{SU}(3)$ -invariant bilinear maps involving $\mathbb{C}$ and $\mathbb{C}^3$ .

This should instantly make you think of the quaternions! The quaternions

$\mathbb{H} \cong \mathbb{R} \oplus \mathbb{R}^3$

have multiplication built from $\mathrm{SO}(3)$ -invariant bilinear maps involving $\mathbb{R}$ and $\mathbb{R}^3$ . These maps are familiar:

multiplication of scalars
multiplication of vectors by scalars
the dot product of vectors
the cross product of vectors

So, we should think about how to adjust these in the complex case, to get $\mathrm{SU}(3)$ -invariant maps. Multiplication of scalars remains unchanged, as does the multiplication of vectors by scalars, but the dot product and cross product need to be adjusted.

First remember what’s going on with the dot product and cross product in $\mathbb{R}^3$ . The usual inner product on $\mathbb{R}^3$ is invariant under $\mathrm{O}(3)$ just by definition. The cross product is more complicated. To build it, we start with the wedge product.

$\wedge \colon \mathbb{R}^3 \times \mathbb{R}^3 \to \Lambda^2 \mathbb{R}^3$

which is invariant under all of $\mathrm{GL}(3,\mathbb{R})$ . Then we apply an isomorphism

$\alpha \colon \Lambda^2 \mathbb{R}^3 \to (\mathbb{R}^3)^\ast$

This is invariant under $\mathrm{SL}(3,\mathbb{R})$ , since it’s defined by

$\alpha(\omega)(\vec{v}) \; \mathrm{vol} = \omega \wedge \vec{v}$

where $\mathrm{vol}$ is the usual volume 3-form and $\omega$ is anything in $\Lambda^2 \mathbb{R}^3$ . Then we apply the so-called musical isomorphism:

$\sharp \colon (\mathbb{R}^3)^* \to \mathbb{R}^3$

which invariant under $\mathrm{O}(3)$ , since it’s defined by

$\sharp(f) \cdot \vec{v} = f(\vec{v})$

Putting all this crud together, we get the cross product

$\times \colon \mathbb{R}^3 \times \mathbb{R}^3 \to \mathbb{R}^3$

defined by

$\vec{v} \times \vec{w} = \sharp \alpha(\vec{v} \wedge \vec{w})$

The cross product is invariant under $\mathrm{SO}(3)$ , since that’s the intersection of all the groups I just mentioned!

Whew. It’s amazing kids can learn the cross product.

Now let’s copy all this stuff using complex numbers and get the octonions. We’ll replace the dot product with the inner product on $\mathbb{C}^3$ :

$\langle \vec{v}, \vec{w} \rangle = \sum_{i=1}^3 \overline{v}_i w_i$

This is invariant under $\mathrm{U}(3)$ , but it’s sesquilinear complex-linear in the second argument but conjugate-linear in the first. To define a mutant version of the wedge product, we’ll start with the complex-bilinear wedge product

$\wedge \colon \mathbb{C}^3 \times \mathbb{C}^3 \to \Lambda^2 \mathbb{C}^3$

This is invariant under all of $\mathrm{GL}(3,\mathbb{C})$ . Then we apply an isomorphism

$\alpha \colon \Lambda^2 \mathbb{C}^3 \to (\mathbb{C}^3)^\ast$

This is complex-linear and invariant under $\mathrm{SL}(3,\mathbb{C})$ , since it’s defined by

$\alpha(\omega)(\vec{v}) \; \mathrm{vol} = \omega \wedge \vec{v}$

where $\mathrm{vol}$ is the usual volume 3-form. Then we apply an isomorphism

$\sharp \colon (\mathbb{C}^3)^\ast \to \mathbb{C}^3$

This is now conjugate-linear and invariant under $\mathrm{U}(3)$ , since it’s defined by

$\langle \sharp(f) , \vec{v} \rangle = f(\vec{v})$

Putting all this together, we get

$\overline{\times} \colon \mathbb{C}^3 \times \mathbb{C}^3 \to \mathbb{C}^3$

defined by

$\vec{v} \overline{\times} \vec{w} = \sharp \alpha(\vec{v} \wedge \vec{w})$

This is conjugate-linear and invariant under $\mathrm{SU}(3)$ . If we work it out in coordinates we see

$(\vec{v} \overline{\times} \vec{w})_i = \overline{(\vec{v} \times \vec{w})_i}$

So, it’s clear that we should define octonion multiplication using these maps, and that we should make it look like quaternion multiplication so we can prove we get a normed division algebra. The one unpredictable twist is that now we can take complex conjugates of scalars: this operation is also $\mathrm{SU}(3)$ -invariant. We have to do this in the right places to get a normed division algebra!

So let’s see why, by proving this theorem:

Theorem 1. If we define multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ by $(a + \vec{a})(b + \vec{b}) = a b - \langle\vec{a}, \vec{b}\rangle + \overline{a} \vec{b} + b \vec{a} + \vec{a} \overline{\times} \vec{b}$ then we obtain a normed division algebra, since $1 \in \mathbb{C}$ acts as the multiplicative identity and $\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$ This normed division algebra is isomorphic to the octonions.

Proof. It’s enough to show that we have a normed division algebra, since the octonions are the only 8-dimensional normed division algebra over the reals. We’ll use the norm with

$\|a + \vec{a}\|^2 = |a|^2 + \langle \vec{a} , \vec{a} \rangle$

and show that

$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$

It’s a calculation — and if we take all our complex numbers to be real, this calculation will also show that that $\mathbb{R} \oplus \mathbb{R}^3$ is a normed division algebra with the same multiplication formula: it’s the quaternions.

We start with

$\|(a + \vec{a})(b + \vec{b})\|^2 = \|a b - \langle\vec{a}, \vec{b}\rangle + \overline{a} \vec{b} + b \vec{a} + \vec{a} \overline{\times} \vec{b} \|^2$

and use the definition of the norm to break this up into two terms:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 + \| \overline{a} \vec{b} + b \vec{a} + \vec{a} \overline{\times} \vec{b} \|^2$

We can expand the first term:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 = |a b|^2 - 2 \mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle) + |\langle \vec{a}, \vec{b}\rangle|^2$

We can expand the second term:

$\|\overline{a} \vec{b} + b \vec{a} + \vec{a} \overline{\times} \vec{b} \|^2 =$ $\| \overline{a} \vec{b} \|^2 + \|b \vec{a} \|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 + 2 \mathrm{Re} \big( a b \langle \vec{b}, \vec{a} \rangle + a \langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle + \overline{b} \langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle \big)$

But note that

$\langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle = \overline{\vec{a} \cdot (\vec{a} \times \vec{b})} = 0$

by a well-known vector identity which works for complex vectors just as for real ones. Similarly $\langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle = 0$ . So, the expanded second term simplifies, and when we add it to the expanded first term the bits involving $\mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle)$ cancel out. We’re left with this:

$|a b|^2 + |\langle \vec{a}, \vec{b}\rangle|^2 + \| \overline{a} \vec{b} \|^2 + \|b \vec{a} \|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2$

We need to show this equals

$\begin{array}{ccl} \|a + \vec{a} \|^2 \| b + \vec{b}\|^2 &=& (|a|^2 + \|\vec{a}\|^2) (|b|^2 + \|\vec{b}\|^2) \\ \\ &=& |a|^2 |b|^2 + |a|^2 \|\vec{b}\|^2 + |b|^2 \|\vec{a}\|^2 + \|\vec{a}\|^2 \|\vec{b}\|^2 \end{array}$

And if you look, you’ll see 3 terms match, so it’s enough to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This would be a familiar vector identity if we were working in $\mathbb{R}^3$ with the usual cross product instead of the mutant one. But notice, the mutant cross product is obtained from the ordinary one by componentwise complex conjugation, so

$\|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a} \times \vec{b}\|^2$

Thus, we just need to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \times \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This works for $\mathbb{C}^3$ and its inner product exactly as it does for $\mathbb{R}^3$ and its dot product: you can either write out both sides using components and see they agree, or do a geometrical argument using the law of cosines and the law of sines. █

Note: after developing the above approach to the octonions, I found it here:

Ichiro Yokota, Exceptional Lie Groups, Section 1.5.

My approach is more ‘geometrical’, less computational, but it amounts to the same thing.

Part 1. How to define octonion multiplication using complex scalars and vectors, much as quaternion multiplication can be defined using real scalars and vectors. This description requires singling out a specific unit imaginary octonion, and it shows that octonion multiplication is invariant under $\mathrm{SU}(3)$ .
Part 2. A more polished way to think about octonion multiplication in terms of complex scalars and vectors, and a similar-looking way to describe it using the cross product in 7 dimensions.
Part 3. How a lepton and a quark fit together into an octonion — at least if we only consider them as representations of $\mathrm{SU}(3)$ , the gauge group of the strong force. Proof that the symmetries of the octonions fixing an imaginary octonion form precisely the group $\mathrm{SU}(3)$ .
Part 4. Introducing the exceptional Jordan algebra $\mathfrak{h}_3(\mathbb{O})$ : the $3 \times 3$ self-adjoint octonionic matrices. A result of Dubois-Violette and Todorov: the symmetries of the exceptional Jordan algebra preserving their splitting into complex scalar and vector parts and preserving a copy of the $2 \times 2$ adjoint octonionic matrices form precisely the Standard Model gauge group.
Part 5. How to think of $2 \times 2$ self-adjoint octonionic matrices as vectors in 10d Minkowski spacetime, and pairs of octonions as left- or right-handed spinors.
Part 6. The linear transformations of the exceptional Jordan algebra that preserve the determinant form the exceptional Lie group $\mathrm{E}_6$ . How to compute this determinant in terms of 10-dimensional spacetime geometry: that is, scalars, vectors and left-handed spinors in 10d Minkowski spacetime.
Part 7. How to describe the Lie group $\mathrm{E}_6$ using 10-dimensional spacetime geometry.
Part 8. A geometrical way to see how $\mathrm{E}_6$ is connected to 10d spacetime, based on the octonionic projective plane.
Part 9. Duality in projective plane geometry, and how it lets us break the Lie group $\mathrm{E}_6$ into the Lorentz group, left-handed and right-handed spinors, and scalars in 10d Minkowski spacetime.
Part 10. Jordan algebras, their symmetry groups, their invariant structures — and how they connect quantum mechanics, special relativity and projective geometry.
Part 11. Particle physics on the spacetime given by the exceptional Jordan algebra: a summary of work with Greg Egan and John Huerta.
Part 12. The bioctonionic projective plane and its connections to algebra, geometry and physics.

Posted at July 17, 2020 1:34 AM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3233

Re: Octonions and the Standard Model (Part 1)

I want to point out that quaternionic analysis is believed to be related to some special Feynman diagrams of QED (Quantum Electrodynamics), in “On the work of Igor Frenkel”(Perspectives in Representation Theory, A conference celebrating the 60th birthday of Igor Frenkel).~

Posted by: Fen Zuo on July 19, 2020 3:24 AM | Permalink | Reply to this

Looking for precedents to this approach, I stumbled upon this old answer on Math StackExchange, though not all the conjugates seem to be in the same places. (There could have been a typo, or I might be missing something.)

I’m vaguely disappointed that the term “musical isomorphism” was apparently invented because of the notation, rather than some analogy to esoteric music theory.

I don’t want to sidetrack too much, but I don’t see biquaternions mentioned every day, so I figured I’d share a pointer to some notes (that never quite became a paper) about biquaternions and quantum entanglement.

Posted by: Blake Stacey on July 19, 2020 9:42 AM | Permalink | Reply to this

Blake wrote:

Looking for precedents to this approach, I stumbled upon this old answer on Math StackExchange, though not all the conjugates seem to be in the same places.

Thanks! The question here concerns what I called the “mutant cross product”: the variant of the cross product that you get when you have a 3d complex vector space with a positive-definite inner product. Somehow Marek Mitros thought to define this inner product using octonion multiplication — a feat of brilliance that can only be explained by him having spent too much time thinking about the octonions.

But when he goes on to write octonion multiplication in terms of the mutant cross product and other operations, I think he leaves out some necessary complex conjugates. He writes:

Now going to “vice versa”. Let’s define octonions as pairs $(a,\mathbf v)$ where $a$ is complex number and $\mathbf v$ vector in $\mathbb{C}^3$ . Then octonion multiplication can be defined as

$(a,\mathbf v)(b,\mathbf w)=(a b-\mathbf {v\cdot w},a\mathbf w+b \mathbf v + \mathbf {v \times w})$

I hope that above argument with double Cayley-Dickson formula can be used to prove it although I have not done myself this calculation. The reader is urged to do it as an exercise :)

I think he’s using $\mathbf{v}\times\mathbf{w}$ here to denote the mutant cross product which I’m calling $\mathbf{v}\overline{\times}\mathbf{w}$ . I’ll give him the benefit of the doubt and assume he’s using $\mathbf{v}\cdot\mathbf{w}$ to mean the hermitian inner product which I’m calling $\langle \mathbf{v}, \mathbf{w} \rangle$ . But I think he’s leaving out the ‘tricky’ complex conjugate that should appear in one of two terms of $a\mathbf w+b \mathbf v$ . (It doesn’t really matter which one.) And since he didn’t check his guess, he didn’t notice this wrinkle.

I didn’t expect this complex conjugation either: I realized it had to be there when trying to prove we get a normed division algebra.

Anyway, thanks for spotting this!

Well, it’s non-esoteric music theory. In Riemannian geometry, when we “lower indices” using the metric on a manifold $M$ , e.g.

$a_i = g_{i j} a^j$

we’re really using a map $T_x M \to T_x^\ast M$ , and Marcel Berger decided to call this the “flat map”

$\flat \colon T_x M \to T_x^\ast M$

because when you flat a note it’s a bit lower. Similarly for the “sharp map”

$\sharp \colon T_x^\ast \to T_x M$

which “raises indices”.

But maybe this is just what you meant: it’s more about musical notation than music theory proper.

Posted by: John Baez on July 19, 2020 6:00 PM | Permalink | Reply to this

A slight change of basis gets you that instead of $\overline{a}\vec{b} + b\vec{a}$ , you use $a\vec{b}+\overline{b}\vec{a}$ and then you say that moving $b$ past $\vec{a}$ causes a conjugation. I like this version because you keep all of the $a$ factors on the left of each term and all of the $b$ factors on the right and only deal with the fact that we usually scalar-multiply on the left for the end, using the fact that $\vec{a}b = \overline{b}\vec{a}$ . I think this also gives a cleaner embedding of $\mathbb{C}$ as both scalars acting on $\mathbb{C}^3$ and as a subalgebra of the octonions. Unfortunately it doesn’t quite match the standard formula for Cayley-Dickson multiplication, but it does give the octonions unless I screwed up a sign somewhere.

Posted by: Layra on July 19, 2020 10:24 AM | Permalink | Reply to this

Hi, Layra! Thanks for checking this stuff.

A slight change of basis gets you that instead of $\overline{a}\vec{b} + b\vec{a}$ , you use $a\vec{b}+\overline{b}\vec{a}$ .

I noticed that I could put the complex conjugate in either place, and I picked one randomly. I like yours better now!

At first I was confused when you said

using the fact that $\vec{a}b =\overline{b}\vec{a}$ .

I thought “I wouldn’t call this a ‘fact’, more like a definition. Is he making $\mathbb{C}^3$ into a $\mathbb{C}$ -bimodule where left and right multiplication obey this rule?”

Then I realized what you meant. To put it another way: we’re treating $\mathbb{O}$ as $\mathbb{C} \oplus \mathbb{C}^3$ using your variant of my formula, and this gives rise to a $\mathbb{C}$ -bimodule structure on $\mathbb{C}^3$ , which has this property

$\vec{a}b =\overline{b}\vec{a}$

By the way, for anyone watching: it’s not immediately obvious that octonion multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ makes $\mathbb{C}^3$ into a $\mathbb{C}$ -bimodule. It gives a way to multiply vectors in $\mathbb{C}^3$ on the left and on the right by $\mathbb{C}$ . But octonion multiplication is not associative, so we have to think a bit to see why the bimodule axiom

$(a \vec{v}) b = a (\vec{v} b)$

holds. It holds because the octonions are an alternative algebra: they have enough associativity to prove this.

Posted by: John Baez on July 19, 2020 6:18 PM | Permalink | Reply to this

I rewrote the comment a bit and during that process I lost the exact context for $\vec{a}b = \overline{b}\vec{a}$ , which is that the equation is true for $\vec{a}$ and $b$ being octonions, with $b$ being from the embedding of $\mathbb{C}$ and $\vec{a}$ from the orthogonal imaginary space.

So rather than $\mathbb{C}$ acting on $\mathbb{C}^3$ as a $\mathbb{C}$ -bimodule, it’s just multiplication as subspaces of the octonions. This happens to look like a bimodule action, but for me the compatibility of the left and right actions was mostly irrelevant.

Posted by: Layra on July 19, 2020 9:53 PM | Permalink | Reply to this

Hmm, I did some calculations and now I think

$(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + a \vec{b} + \overline{b} \vec{a} + \vec{a} \overline{\times} \vec{b}$

does not make $\mathbb{C} \oplus \mathbb{C}^3$ into a normed division algebra. When I calculate I get

$\|(a + \vec{a})(b + \vec{b})\|^2 = \|a b - \langle\vec{a}, \vec{b}\rangle + a \vec{b} + \overline{b} \vec{a} + \vec{a} \overline{\times} \vec{b} \|^2$

$= |a b|^2 - 2 \mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle) + |\langle \vec{a}, \vec{b}\rangle|^2 +$ $\| a \vec{b} \|^2 + \|\overline{b} \vec{a} \|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 +$ $2 \mathrm{Re} \big( \overline{a} \overline{b} \langle \vec{b}, \vec{a} \rangle + a \langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle + \overline{b} \langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle \big)$

and the terms involving real parts no longer cancel as they did in my blog post. By the way, to understand this calculation you need to know I’m using an inner product that’s conjugate-linear in the first argument.

I believe this would work, but I haven’t checked it:

$(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{b}, \vec{a}\rangle + a \vec{b} + \overline{b} \vec{a} + \vec{a} \overline{\times} \vec{b}$

Or alternatively: switch to an inner product that’s conjugate-linear in the second argument.

Posted by: John Baez on July 21, 2020 9:09 PM | Permalink | Reply to this

Yeah, you need to do something about the inner product. I would personally go with making the second argument conjugate linear, because that’s again, describable as moving a scalar coefficient past a vector to get it out of the inner product and thus picking up a conjugation.

Posted by: Layra on July 22, 2020 12:37 AM | Permalink | Reply to this

That makes me sad because in all of my work I always follow the physicist’s convention of making the first argument in the inner product conjugate-linear. I’ve never been in a situation before where this becomes awkward.

Posted by: John Baez on July 22, 2020 12:51 AM | Permalink | Reply to this

A very interesting perspective of the octonions. I think I finally got a grasp on the octonions geometrically. I would like to point out this construction can be done via the Cayley-Dickson perspective and that explains why $\overline b \vec a= \vec a b$ . You can define octonions to be $2\times 2$ matrices of quaternions (the Cayley-Dickson construction via matrices). So, we have

(1)

\begin{pmatrix} u & v\\ -\overline v & \overline u \end{pmatrix}

where $u$ and $v$ are quaternions. If we write $u=u_0+u_1i+u_2j+u_3k$ , then we obtain the above matrix can be written as

(2)

(u_0-v_0E_2E_1)+(v_1+u_1E_2E_1)E_1i+(v_2+u_2E_2E_1)E_1j+(v_3+u_3E_1E_2)E_1k

where

(3)

E_1=\begin{pmatrix} 1 & 0\\ 0 & -1\end{pmatrix}

and

(4)

E_2=\begin{pmatrix} 0 & 1\\ 1 & 0\end{pmatrix}.

Defining the symbols $\mathbf i:=E_2E_1$ , $\mathbf I:=E_1i$ , $\mathbf J:=E_1j$ and $\mathbf k:=E_1k$ , we obtain

(5)

\mathbf I \mathbf J = \mathbf K,\quad\mathbf I^2 = \mathbf K^2 = \mathbf J^2 = -1,

as in the quaternionic world, and the extra relations

(6)

\mathbf I\mathbf i = - \mathbf I\mathbf i,\quad \mathbf J\mathbf i = - \mathbf i\mathbf J, \quad \mathbf K\mathbf i = - \mathbf i\mathbf K.

So, we can write octonions as

(7)

a+ b\mathbf I+c\mathbf J+d\mathbf K,

where $a,b,c,d \in \mathbb{C} := \mathbb{R} \oplus \mathbf i \mathbb{R}$ (observe $\mathbf i^2 = -1$ ). Note a complex number $z$ does not commute with a pure octonion $u=u_1\mathbf I+u_2\mathbf J+u_3\mathbf K$ . Indeed, $z u = u \overline z$ , and that explain the $\overline b$ in your multiplication formula (if I didn’t make any mistakes).

Posted by: Hugo on July 19, 2020 1:56 PM | Permalink | Reply to this

Hello again! It seems I’ve made lots of mistakes. In my head, the Cayley-Dickson construction would work with the usual matrix multiplication. So, let me try to fix my argument. In order to think about octonions as matrices of quaternions

\begin{pmatrix} u & v\\ -\overline v & \overline u \end{pmatrix}

I believe we have to redefine the multiplication of matrices to be

\begin{pmatrix} u & v\\ -\overline v & \overline u \end{pmatrix} \begin{pmatrix} u' & v'\\ -\overline {v'} & \overline {u'} \end{pmatrix} := \begin{pmatrix} u u' - \overline{v'}v & v'u+v\overline{u'}\\ -\overline{u}\overline{v'}-u'\overline{v} &\overline{u'}\overline{u} - \overline{v}v' \end{pmatrix}

Observe that for real matrices the operation is the usual one, and the operations I did previously seems legal. This multiplication coincides with the Cayley-Dickson construction because it’s essentially the operation

(u,v)\cdot (u',v') = (u u' - \overline{v'}v , v'u+v\overline{u'}),

determined by the first line.

Well, things seems to work now.

Errata: Mistake in line $(2)$ . Correction:

\begin{pmatrix} u & v\\ -\overline v & \overline u \end{pmatrix} =(u_0-v_0E_2E_1)+(u_1+v_1E_2E_1)E_1i+(u_2+v_2E_2E_1)E_1j+(u_3+v_3E_2E_1)E_1k

Posted by: Hugo on July 19, 2020 5:22 PM | Permalink | Reply to this

Nice! Yes, my formula amounts to reinterpreting the Cayley–Dickson construction to reduce the emphasis on “ordered pairs of quaternions” and increase the emphasis on “an analogue of quaternions built starting with complex rather than real numbers”.

Posted by: John Baez on July 19, 2020 7:21 PM | Permalink | Reply to this

Something looks to be missing from this sentence in the original post:

Since $\mathrm{SU}(3)$ there must be a complementary 6-dimensional subspace

Posted by: Blake Stacey on July 20, 2020 12:40 AM | Permalink | Reply to this

There’s also a typo between “We can expand the second term:” and “But note that”, with $|$ instead of $||$

Posted by: David Roberts on July 20, 2020 2:59 AM | Permalink | Reply to this

Whoops — thanks, Blake and David! I’ll fix things up.

I meant “since $\mathrm{SU}(3)$ is compact”. Given a continuous rep of a compact group on a finite-dimensional real or complex vector space, we can pick an invariant inner product, and then every invariant subspace has a complementary invariant subspace, namely the orthogonal complement.

Posted by: John Baez on July 20, 2020 6:01 AM | Permalink | Reply to this

Jeffery

Perhaps you can talk about Geoffery Dixon’s work relating octonions to the Standard Model.

Geoffrey Dixon has a theory of physics based on the algebra C x H x O. He claims that this simple starting point predicts the mathematical structure of the Standard Model. So it’s just interesting that you can start with something as basic as the four normed division algebras, and from that, get something recognizable as the Standard Model.

http://www.7stones.com/Homepage/10Dnew.pdf

http://www.7stones.com/Homepage/10parity.pdf

http://www.7stones.com/Homepage/14mix.pdf

http://www.7stones.com/Homepage/6x6.pdf

http://www.7stones.com/Homepage/123cho.pdf

Posted by: Jeffery Winker on July 22, 2020 3:32 PM | Permalink | Reply to this

Re: Jeffery

Geoffrey Dixon is an old friend of mine and I have old notebooks where I took his ideas and tried to polish up the mathematics. That stuff is lurking behind the scenes of my posts here. In particular, he used the algebra $\mathbb{C} \otimes \mathbb{H} \otimes \mathbb{O}$ to try to get ahold of the $\mathrm{U}(1) \times \mathrm{SU}(2) \times \mathrm{SU}(3)$ group of the Standard Model. I’m focusing on $\mathbb{O}$ right now. To get from the automorphism group of $\mathbb{O}$ down to $\mathrm{SU}(3)$ , he restricts attention to automorphisms that preserve a unit imaginary octonion $i$ . I explain exactly why that works in Part 3. But the observation that

$\mathrm{SU}(3) \cong \{g \in \mathrm{Aut}(\mathbb{O}) : \; g i = i \}$

is old, and even its application to the strong force predate his work, I’m pretty sure. He goes further than that.

I need to learn the history of these ideas better. I think a lot is due to Murat G¼naydin.

Posted by: John Baez on August 2, 2020 9:37 PM | Permalink | Reply to this

Your definition of the inner product on $\mathbb{C}^3$ must have a conjugation in it, and I can believe it does if I squint, but, at least in Safari 13.1.1, it’s so close to the $v$ that it’s almost invisible, and I wouldn’t have noticed it if I hadn’t been looking for it. Maybe it’s just another one of Safari’s math rendering quirks ….

The n-Category Café

Skip to the Main Content

July 17, 2020