Octonions and the Standard Model (Part 2)

Posted by John Baez

$MathML-enabled post (click for more details).$

My description of the octonions in Part 1 raised enough issues that I’d like to talk about it a bit more. I’ll show you a prettier formula for octonion multiplication in terms of $\mathbb{C} \oplus \mathbb{C}^3$ … and also a very similar-looking formula for it in terms of $\mathbb{R} \oplus \mathbb{R}^7$ .

$MathML-enabled post (click for more details).$

Last time I took the usual construction of the quaternions from $\mathbb{R}^3$ and modified it to construct the octonions from $\mathbb{C}^3$ . The key step was replacing the dot product on $\mathbb{R}^3$ by the inner product on $\mathbb{C}^3$ , which is conjugate-linear in one slot.

Layra Idarani pointed out some other conventions that make the formula for multiplication more beautiful, so let me explain those here. This stuff deserves to be made beautiful.

First, we’ll make the inner product conjugate-linear in the second slot:

$\langle \vec{a} , \vec{b} \rangle = \sum_{i=1}^3 a_i \overline{b}_{i}$

It breaks my heart to do this: physicists do it the other way, so this is the convention preferred only by mathematicians who have never studied quantum mechanics. But life is cruel: sticking with the other convention would force me to do other unpleasant things.

Next, we define the hermitian cross product of vectors in $\mathbb{C}^3$ by

$(\vec{a} \overline{\times} \vec{b})_i = \overline{(\vec{a} \times \vec{b})_{i}}$

We saw last time that this operation is invariant under $\mathrm{SU}(3)$ , though the usual cross product is not, nor is the operation of taking the conjugate of each component of a vector.

Lastly, we’ll define a funny way of multiplying vectors in $\mathbb{C}^3$ on the right by complex numbers:

$\vec{a} a = \overline{a} \vec{a}$

for all $a \in \mathbb{C}, \vec{a} \in \mathbb{C}^3$ .

The payoff for all these contortions is a very pretty formula for octonion multiplication. We’ll take elements of $\mathbb{C} \oplus \mathbb{C}^3$ and write them like $a + \vec{a}$ , where the ‘scalar part’ $a$ lives in $\mathbb{C}$ and the ‘vector part’ $\vec{a}$ lives in $\mathbb{C}^3$ . And these will be our octonions!

Theorem 2. If we define multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ by $(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b}$ then this space becomes a 8-dimensional normed division algebra over the real numbers, since $1 \in \mathbb{C}$ acts as the multiplicative identity and $\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$ This normed division algebra is isomorphic to the octonions.

I’ll prove this at the end of this post. It’s almost exactly like the proof of the theorem last time, but I want everyone to be able to check it.

The idea of making $\mathbb{C}^3$ into a right $\mathbb{C}$ -module in a nontrivial way, ‘twisted by conjugation’, may seem weird. Pragmatically speaking we do this to hide the appearance of conjugation in our formula for multiplication. But it’s actually quite natural. Let me explain why.

First of all, note that if we pick any element $i \in \mathbb{O}$ with $i^2 = -1$ , we get a copy of $\mathbb{C}$ sitting in $\mathbb{O}$ as a subalgebra. $\mathbb{O}$ then becomes a left $\mathbb{C}$ -module by left multiplication. This is not as obvious as it sounds, since it’s saying that

$a(b x) = (a b)x$

for all $a, b \in \mathbb{C} \subseteq \mathbb{O}$ and $x \in \mathbb{O}$ . The octonions are not associative! But they are alternative — the subalgebra generated by any two elements is associative — and in the above equation $a$ and $b$ are in the subalgebra generated by $i$ , while $x$ is in the subalgebra generated by $x$ . So the desired equation holds!

We can similarly make $\mathbb{O}$ into a right $\mathbb{C}$ -bimodule by right multiplication. Even better, $\mathbb{O}$ is then a $\mathbb{C},\mathbb{C}$ -bimodule. The reason is that

$(a x)b = a(x b)$

for all $a, b \in \mathbb{C} \subseteq \mathbb{O}$ and $x \in \mathbb{O}$ , again because the octonions are alternative.

The left and right actions of $\mathbb{C}$ on the octonions are really different:

$a x \ne x a$

for most $a \in \mathbb{C}$ , $x \in \mathbb{O}$ , because the octonions are noncommutative. But we can say more than just this negative statement. There’s an inner product on the octonions that’s invariant under all automorphisms, so we can pick an invariant complement $V$ to $\mathbb{C} \subseteq \mathbb{O}$ . We then have

$a v = v \overline{a}$

for $a \in \mathbb{C}$ , $v \in V$ , because orthogonal square roots of $-1$ in the octonions anticommute.

$V$ is a $\mathbb{C},\mathbb{C}$ -bimodule. As a left $\mathbb{C}$ -module it’s isomorphic to $\mathbb{C}^3$ . So, we can use this idea to describe $\mathbb{O}$ as $\mathbb{C} \oplus \mathbb{C}^3$ , and it then becomes very natural to have $\mathbb{C}$ act on $\mathbb{C}^3$ in a ‘twisted’ way, obeying $a v = v \overline{a}$ . This is exactly what we need to make $\mathbb{C}^3$ isomorphic to $V$ as a $\mathbb{C},\mathbb{C}$ -bimodule!

It’s probably good to summarize this:

Theorem 3. Any element $i \in \mathbb{O}$ with $i^2 = -1$ generates a subalgebra of the octonions that can be canonically identified with $\mathbb{C}$ . Left and right multiplication by this copy of $\mathbb{C}$ makes $\mathbb{O}$ into a $\mathbb{C},\mathbb{C}$ -bimodule. As a $\mathbb{C},\mathbb{C}$ -bimodule we then have $\mathbb{O} \cong \mathbb{C} \oplus \mathbb{C}^3$ where the left action of $\mathbb{C}$ on $\mathbb{C} \oplus \mathbb{C}^3$ is the usual one, but the right action is given by $(b , \vec{b})a = (a b, \overline{a}\vec{b})$ for $a,b \in \mathbb{C}, \vec{b} \in \mathbb{C}^3$ .

Finally, one more thing for today — a cute fact I’m not sure what to do with.

We can also take the octonions and split them as $\mathbb{R} \oplus \mathbb{R}^7$ where $\mathbb{R}$ contains $1$ and $\mathbb{R}^7$ contains all the ‘imaginary’ octonions: those orthogonal to $1$ . We can thus write any octonion as $a + \vec{a}$ where now $a \in \mathbb{R}$ and $\vec{a} \in \mathbb{R}^7$ . And now octonion multiplication looks like this:

$(a + \vec{a})(b + \vec{b}) = a b - \vec{a} \cdot \vec{b} + a \vec{b} + \vec{a} b + \vec{a} \times \vec{b}$

This formula looks almost exactly like the one in the theorem! But now $\vec{a} \cdot \vec{b}$ is defined using the usual dot product on $\mathbb{R}^7$ , and $\vec{a} \times \vec{b}$ is defined using a funny cross product in 7 dimensions, which unfortunately is easiest to define using the octonions. So, this formula is less useful than the one in terms of $\mathbb{C} \oplus \mathbb{C}^3$ if you’re trying to get off the ground and explain the octonions from scratch.

Oh, and by the way: I wrote $\vec{a} b$ in this formula just to be cute: in this real context it’s just the same as $b \vec{a}$ : no sneaky tricks with complex conjugation.

That’s all for today, except proving this:

Proof. It’s enough to show that we have a normed division algebra, since the octonions are the only 8-dimensional normed division algebra over the reals. We’ll use the norm with

$\|a + \vec{a}\|^2 = |a|^2 + \langle \vec{a} , \vec{a} \rangle$

and show that

$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$

We start with

$\|(a + \vec{a})(b + \vec{b})\|^2 = \|a b - \langle\vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2$

and use the definition of the norm to break this up into two terms:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 + \| a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2$

We can expand the first term:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 = |a b|^2 - 2 \mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle) + |\langle \vec{a}, \vec{b}\rangle|^2$

We can expand the second term:

$\|a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2 =$ $\| a \vec{b} \|^2 + \|\vec{a} b\|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 + 2 \mathrm{Re} \big(a b \langle \vec{b}, \vec{a} \rangle + \overline{a} \langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle + b \langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle \big)$

But note that

$\langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle = \vec{a} \cdot (\vec{a} \times \vec{b}) = 0$

by a well-known vector identity which works for complex vectors just as for real ones. Similarly $\langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle = 0$ . So, the expanded second term simplifies, and when we add it to the expanded first term the bits involving $\mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle)$ cancel out. We’re left with this:

$|a b|^2 + |\langle \vec{a}, \vec{b}\rangle|^2 + \| a \vec{b} \|^2 + \|\vec{a} b\|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2$

We need to show this equals

$\begin{array}{ccl} \|a + \vec{a} \|^2 \| b + \vec{b}\|^2 &=& (|a|^2 + \|\vec{a}\|^2) (|b|^2 + \|\vec{b}\|^2) \\ \\ &=& |a|^2 |b|^2 + |a|^2 \|\vec{b}\|^2 + |b|^2 \|\vec{a}\|^2 + \|\vec{a}\|^2 \|\vec{b}\|^2 \end{array}$

Three terms match, so it’s enough to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This would be a familiar vector identity if we were working in $\mathbb{R}^3$ with the usual cross product instead of the mutant one. But notice, the mutant cross product is obtained from the ordinary one by componentwise complex conjugation, so

$\|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a} \times \vec{b}\|^2$

Thus, we just need to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \times \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This works for $\mathbb{C}^3$ and its inner product exactly as it does for $\mathbb{R}^3$ and its dot product: you can either write out both sides using components and see they agree, or do a geometrical argument using the law of cosines and the law of sines. █

Part 1. How to define octonion multiplication using complex scalars and vectors, much as quaternion multiplication can be defined using real scalars and vectors. This description requires singling out a specific unit imaginary octonion, and it shows that octonion multiplication is invariant under $\mathrm{SU}(3)$ .
Part 2. A more polished way to think about octonion multiplication in terms of complex scalars and vectors, and a similar-looking way to describe it using the cross product in 7 dimensions.
Part 3. How a lepton and a quark fit together into an octonion — at least if we only consider them as representations of $\mathrm{SU}(3)$ , the gauge group of the strong force. Proof that the symmetries of the octonions fixing an imaginary octonion form precisely the group $\mathrm{SU}(3)$ .
Part 4. Introducing the exceptional Jordan algebra $\mathfrak{h}_3(\mathbb{O})$ : the $3 \times 3$ self-adjoint octonionic matrices. A result of Dubois-Violette and Todorov: the symmetries of the exceptional Jordan algebra preserving their splitting into complex scalar and vector parts and preserving a copy of the $2 \times 2$ adjoint octonionic matrices form precisely the Standard Model gauge group.
Part 5. How to think of $2 \times 2$ self-adjoint octonionic matrices as vectors in 10d Minkowski spacetime, and pairs of octonions as left- or right-handed spinors.
Part 6. The linear transformations of the exceptional Jordan algebra that preserve the determinant form the exceptional Lie group $\mathrm{E}_6$ . How to compute this determinant in terms of 10-dimensional spacetime geometry: that is, scalars, vectors and left-handed spinors in 10d Minkowski spacetime.
Part 7. How to describe the Lie group $\mathrm{E}_6$ using 10-dimensional spacetime geometry.
Part 8. A geometrical way to see how $\mathrm{E}_6$ is connected to 10d spacetime, based on the octonionic projective plane.
Part 9. Duality in projective plane geometry, and how it lets us break the Lie group $\mathrm{E}_6$ into the Lorentz group, left-handed and right-handed spinors, and scalars in 10d Minkowski spacetime.
Part 10. Jordan algebras, their symmetry groups, their invariant structures — and how they connect quantum mechanics, special relativity and projective geometry.
Part 11. Particle physics on the spacetime given by the exceptional Jordan algebra: a summary of work with Greg Egan and John Huerta.
Part 12. The bioctonionic projective plane and its connections to algebra, geometry and physics.

Posted at July 22, 2020 7:36 PM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3234

Re: Octonions and the Standard Model (Part 2)

Typo:

$\mathbb{O}$ then becomes [a] left $\mathbb{C}$ -module by left multiplication.

Also, there seems to be one too many hyphens after “alternative”.

Posted by: Blake Stacey on July 23, 2020 8:53 AM | Permalink | Reply to this

Thanks, fixed! I might someday turn this stuff into a paper or at least a bunch of webpages, so it’s nice to get everything corrected.

Posted by: John Baez on July 23, 2020 11:03 PM | Permalink | Reply to this

Right action on complex vector space

Here’s another take on why you should define the right action as you did.

Let $A$ be a $k$ -algebra (i.e. $k$ central in $A$ ), and $M$ a left $A$ -module. Let $M^\ast := Hom_k(A,k)$ . Then one easily makes $M^*$ into a right $A$ -module. If we want it to be a left $A$ -module, we can use an anti-homomorphism $\iota: A \to A$ , and precompose the natural right action with that. But now $(M^\ast)^\ast$ has two left module structures: the natural one (dual of a right module) and the new one (which involves $\iota$ twice). These will only agree if $\iota^2 = Id$ .

One reason we’d want to make the right module $M$ into a left module is to be able to define module morphisms $M\to M^*$ , i.e. inner products on $M$ .

If $A$ is commutative, then there’s an obvious anti-involution - the identity. But usually this isn’t available. If $A =$ $kG$ is a group algebra, there’s another one people like, taking $g \mapsto g^{-1}$ , which is how we usually define “the dual of a rep of $G$ ”. If $A$ is a ring of matrices, there’s transpose.

Now consider $A =$ $\mathbb{H}$ , $k =$ $\mathbb{R}$ . Up to conjugation, the only anti-involution is quaternion conjugate. If we think of the complex numbers as living in $\mathbb{H}$ , then it’s unnatural to use $\iota=Id$ on $\mathbb{C}$ - that won’t extend to $\mathbb{H}$ . Similarly, if we use the usual $2\times 2$ real picture of $\mathbb{C}$ , we run into transpose as the natural anti-involution. The view here is that $\mathbb{C}$ is “accidentally” commutative - why should matrices commute?

Using complex conjugate for $\iota$ , when we try to define inner products on $\mathbb{C}$ -modules, they are automatically sesquilinear. (They need not be positive definite; that’s not an algebraic condition.)

Posted by: Allen Knutson on July 23, 2020 1:14 PM | Permalink | Reply to this

The chatty discursive style of a blog article is fun to read, but sometimes you don’t have the patience to pick out the theorems lurking in the chatter, so I decided to add a theorem to my post above:

Posted by: John Baez on July 23, 2020 11:00 PM | Permalink | Reply to this

A bit tangentially, I’m pretty sure that you can’t construct the sedenions as $\mathbb{H}\oplus \mathbb{H}^3$ in the same manner, even accounting for all of the nonsense you have to do to get “scalar multiplication” to work out. The problem is that there is a distinguished copy of $\mathbb{C}$ that you can pick out in the sedenions using only the associator, and unless you’re really willing to break things, $\mathbb{H}\oplus \mathbb{H}^3$ has too much symmetry going on to single out a particular copy of $\mathbb{C}$ .

I haven’t tested out whether you can construct the sedenions as $\mathbb{C}\oplus \mathbb{C}^7$ as a twisted complexification of the octonion construction, but at least this one has a distinguished embedding of $\mathbb{C}$ in it.

Posted by: Layra on July 24, 2020 12:21 AM | Permalink | Reply to this

Interesting norm identity?

To me it feels like there is also something interesting going one in the final form of the product of two octonions in the final proof - can’t put my finger on it exactly, but the fact that the norm sort of threads over the terms of the product terms seems vaguely significant, does it not?

Posted by: Isabel Pirsic on July 25, 2020 8:15 PM | Permalink | Reply to this

Re: Interesting norm identity?

I’m not sure what you mean by “the norm sort of threads over the terms of the product terms”. But if you’re talking about this formula

$(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b}$

what’s nice is that each scalar or vector is getting multiplied by each scalar or vector in each possible way.

Posted by: John Baez on July 26, 2020 2:01 AM | Permalink | Reply to this

I’m slightly confused by the rule

$\vec{a} b = \bar{b} \vec{a}$

How does it actually work in components?

Because on one hand I might have ( $a_j$ and $b$ are reals, $i$ is the imaginary unit)

$\begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b i = \begin{pmatrix} a_1 b i \\ a_2 b i \\ a_3 b i \end{pmatrix}$

And on the other hand,

$-i \begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b = \begin{pmatrix} - i a_1 b \\ - i a_2 b \\ - i a_3 b \end{pmatrix}$

but those are supposed to be the same thing, right?

Is the right rule simply, that, if I pull an $i$ out of a vector, it switches sign if it’s pulled out to the left but stays the same if it’s pulled out to the right? (And in reverse if it’s pulled in)

i.e would it actually be:

$-i \begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b = \begin{pmatrix} i a_1 b \\ i a_2 b \\ i a_3 b \end{pmatrix} = \begin{pmatrix} a_1 b i \\ a_2 b i \\ a_3 b i \end{pmatrix} = \begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b i$

Posted by: kram1032 on August 3, 2020 2:05 PM | Permalink | Reply to this

Kram wrote:

I’m slightly confused by the rule

$\vec{a} b = \bar{b} \vec{a}$

How does it actually work in components?

I was defining the left hand side, which is something mathematicians hardly ever write, in terms of the right-hand side, which is a standard thing. People multiply vectors on the left by scalars all the time. Here I’m using that to define a way to multiply vectors on the right by scalars.

In components I’d write $\vec{a} b$ as

$(a_1,a_2,a_3) b$

and I’m defining this to mean

$(\overline{b} a_1, \overline{b}a_2, \overline{b} a_3)$

which is often written $\bar{b} \vec{a}$ .

Is the right rule simply, that, if I pull an $i$ out of a vector, it switches sign if it’s pulled out to the left but stays the same if it’s pulled out to the right?

Close, but no: it stays the same if it’s pulled out to the left but switches sign if it’s pulled out to the right.

… I might have ( $a_j$ and $b$ are reals, $i$ is the imaginary unit)

$\begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b i = \begin{pmatrix} a_1 b i \\ a_2 b i \\ a_3 b i \end{pmatrix}$

No, you are not following my definition of what it means to multiply a vector on the right by $b i$ . When $b$ is real, my definition says we multiply a vector on the right by $b i$ by multiplying it on the left by $- bi$ . So you get

$\begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix} b i = \begin{pmatrix} - b i a_1 \\ -b i a_2 \\ -b i a_3 \end{pmatrix}$

which is the negative of what you said.

(You’re using column vectors, I was using row vectors since they’re quicker to type, but that doesn’t make any difference here.)

Posted by: John Baez on August 3, 2020 8:52 PM | Permalink | Reply to this

The n-Category Café

Skip to the Main Content

July 22, 2020