Re: Unitary Representations of the Poincaré Group
Evan Jenkins wrote:
OK, since everybody else has gone quiet, I’ll step up. I’ve been cheating and looking at Shlomo Sternberg’s delightful Group Theory and Physics, so I think I can more or less explain the derivation of the Klein-Gordon equation.
Great! Todd has said he isn’t giving up, but he’s busy just now. So thanks, Evan, for jumping in and pushing the ball forward!
Everything you say looks right to me. So, all I can offer is a little bit of material that connects what you’re saying to some of what I wrote. A while back, I wrote something like this:
Stone’s theorem says:
Any strongly continuous unitary representation of on a Hilbert
space is of the form for a unique (possibly unbounded)
self-adjoint operator on this Hilbert space. Conversely, any such
operator gives a strongly continuous unitary representation of
by this formula.
The spectral theorem says:
Suppose is a (possibly unbounded) self-adjoint operator on a Hilbert
space. Then this Hilbert space is isomorphic to for some measure
space , and making use of this isomorphism, becomes a multiplication
operator:
where
is some measurable function. And in this situation,
Now, stick these theorems together — perhaps with a little glue — and see what happens!
If we do this, we get:
Let be a measure space and let
a measurable function. Then there is a strongly continuous unitary representation of on given as follows:
Moreover, every strongly continuous unitary representation of is unitarily equivalent to one of this form!
So, this is a nice concrete description of all the strongly continuous unitary representations of the very first Lie group to be born at the beginning of time: the real line.
Let’s look at a baby example. Let’s take to be single point! Then the function is really just a real number, and
So, we’re getting a 1-dimensional representation of , given by
Moreover, our theorem assures that every strongly continuous unitary representation of is of this form!
If we leave out that annoying adjective ‘strongly continuous’, there are many more — at least if you believe in the Axiom of Choice. But let’s not go there. From now, in this thread I’ll use rep to mean ‘strongly continuous unitary representation’.
Okay, now let’s return to the case of a general measure space . If we pick a measurable function , we get a rep of the real line thanks to our theorem. But each point of is giving us a one-dimensional rep of the real line. And, I hope you see there’s some sense in which our rep is ‘built’ from all these 1-dimensional reps.
What is this sense, exactly? If is a finite set, our rep is a direct sum of these 1-dimensional reps , one for each point . But in general, is a direct integral of these 1-dimensional reps.
You may not know what a ‘direct integral’ is, but the point is that is built from lots of copies of , one for each point of , in a way that involves integrals. And the same is true of our rep of the real line on ! It’s fun to work out exactly what’s going on here… and then direct integrals will lose their terror, because they’re all pretty much like this.
Now, maybe someone can step up the plate and tackle some of these:
-
Take the rep of the real line on that acts by translating functions:
Hit it with our classification of reps of the real line and see what you get!
-
Guess which 1-dimensional reps is a direct integral of. What famous thing are we secretly talking about here?
-
Generalize Stone’s theorem and the spectral theorem from to . What’s the classification of reps of the additive group ? What do the one-dimensional reps look like?
-
Generalize problem 1 to . Take this rep of on :
where now . Hit it with your classification of reps of and see what it looks like.
-
Generalize problem 2 to . What famous thing are we talking about now?
-
Why is all this incredibly closely linked to what Evan was just talking about? How does the hyperboloid
get into the game? How does the Klein–Gordon equation get into the game? Can you see why is a direct integral of Hilbert spaces of the form ?
Re: Unitary Representations of the Poincaré Group
The article described below is a complete redo of Wigner’s sections 1-4 and much of 5-7. It was originally intended for this forum, but is too long. Even the summary to follow is.
Plus, everything is in PDF or in HTML making use of HTML’s math and layout facilities that are not supported in n-Category Cafe.
Instead, the entire thread, as well as my reply, have been translated into HTML and placed under the web link attached to my name on the blog header. For reference, it is:
The Wigner Classification for Galilei Poincare and Euclid
and its supplements:
Poincare Representations - n-Category Cafe (the thread)
The Wigner Classification for General 4-Space Signatures (the reply)
Related reading includes:
A. P. Balachandra, G. Marmo, B.-S. Skagersam, A. Stern,
Gauge Symmetries and Fibre Bundles, Applications in Particle Dynamics, Lecture Notes in Physics 188;
whose coordinatization of Non-Helical Sector 1L (described below) goes beyond what I describe; and
N. P. Landsman, Mathematical Topics Between Classical and Quantum Mechanics, 1998 Springer (particularly sections III.1 and I.2.5).
Related links that tie into the article are listed at the end of the reply and include:
The Jordan Decomposition in the Unified Group
the detailed analysis that subsumes Wigner’s section 4B.
Towards a General Theory of Signature and Signature Change,
Dimension and Signature
both of these lie at the root of the analysis carried out.
Poisson Algebras, Poisson Manifolds, Symplectic Manifolds, Poisson Bracket
these cover the foundation of the analysis used to subsume Wigner. The articles significantly expand on (and clean up) the Wikipedia originals, with running examples involving the Heisenberg and Spin algebras.
Unification of Galilei, Poincare and Euclidean Symmetry
(UI-Chicago, 2008 October 7)
An on-line version of a talk given at a seminar at UIC, hosted by Kauffman; this includes a vastly expanded account of NOVA’s two Einstein bio series, including not just the respective web sites but also transcribed and annotated copies of several of the original papers by Einstein and Lorentz.
The Newton-Wigner Position Operator - derived from T. F. Jordan
The coordinatization of the class 1L (and the no-go result for carrying out the same for non-Helical class 2L).
The Missing Heisenberg Relation
What happened to the 4th Heisenberg relation?
Definition of Mass
Mass defined from first principles; from the 11th generator of the Unified Group
This is a summary of the sections.
0. Unravelling and Subsuming Wigner
1. Wigner’s Section 2: Linear Representation Theory vs. Symplectic Reduction
Symplectic reduction and the Poisson-Lie manifold over the Lie Group.
2. If Irreps and Particles are Synonymous, then where is the Vacuum Particle?
The concept of irrep as particle has obvious gaps (as indicated by the title of the section). A more comprehensive account of what an irrep actually is - one that includes the Wigner class 3 (and class 4) sectors - starts from an entirely different viewpoint: irreps corresponding to elementary systems that are to be thought of as media, not particles; e.g., an isotropic medium is one which has the rotation generator J as an invariant; a quasi-vacuum, one which has the boost generator K as an invariant. An isotropic quasi-vacuum is a vacuum. A vacuon (which is Wigner’s class 3) is a translation invariant medium.
3. Wigner’s Section 3: The Wigner Sectors for General Signatures
This signficantly expands Wigner’s discussion in Section 3.
The Wigner/von Neumann classes include the following: the Tardion (classes 1/4E, 1L, 1G) which are media with a Staton state; Statons (class 1/2A), which are Archimedia at Absolute Rest, Luxons (class 2L), which are the null media, Lorentzian Tachyons (class 4L), which are Lorentzian media with a Synchron frame, Archimedean Tachyons (class 4A), which are Archimedia in Absolute Motion and Synchrons (class 2/4G) which are Galilean media which support instantaneous action at a distance transfers of impulse across space.
4. Wigner’s Section 4A: Symmetry and Signature
This is an expansion of Wigner’s section 4A which starts out from a general theory of (possibly-degenerate) signatures, replacing the orthogonal group with the bi-orthogonal groups. In addition, treatment by linear representation theory is expanded (and yet, simplified) by its generalization to the non-linear representation theory of Poisson-Lie manifolds and symplectic reduction.
5. Wigner Section 5: Central Extensions
This covers much of what Wigner’s section 5 is dealing with, with a surprising twist, that puts to the lie the notion that the 10 generators of Poincare are enough even when retricting focus to the Lorentzian case.
6. Wigner’s Section 4B-D: The Jordan Decomposition
Wigner botched this part of the proof, along with the rest of section 4 by working with the group, itself, rather than just its Lie algebra. Initially, I thought he did this because he was treating global issues. But a close reading of section 4 shows that he’s only working with the connected subgroup - which completely defeats the point of his analysis!
The correct way to the analysis is thus is with the Lie algebra, not the Lie group! As a bonus, the simplicity of this approach allows one to run through the general cases: all signatures, in one fell swoop; and to even treat the inhomogeneous group (which Wigner does not do, either).
The subsections include
6.1. The Characteristic Equations of the Infinitesimal Generators.
The Jordan classes of the transformations are easily determined on the Lie algebra. Jordan decomposition remains invariant under exponentiation, therefore this applies to the Lie group as well.
6.2. Wigner Section 4B: The Jordan Decomposition
The classes are [(1111)], the identity transformation (all signatures); [(211)] the Galilean boost (Galilean) and Poincare shift (Archimedean); [(31)] the Null Boost (Lorentzian); [(11)zz*] Rotations (all signatures); [zz*zz*] general Euclidean transformations; [(11)zz*] general Lorentz transformations; [(11)11] general Galilean transformation and [(11)11] general Archimedean transformations.
Historically speaking…
All of the Archimedean members of the family are rooted all the way back in the Hellenistic Era, except the Poincare shift. The Galilean boost is, historically, the first bona fide space-time symmetry and is rooted in early modern Europe; while the Poincare shift (and its relativization to the Lorentz boost) are from the late 19th century, dating from Poincare’ study in global time synchronization. (He was one of those involved in the standardization into our present-day time zones).
6.3. Wigner’s Section 4C: Uniqueness of the Boost
This is trivially handled if doing this with the Lie algebra, rather than the Lie group.
The analysis is done here.
6.4. Wigner’s Section 4D: Simplicity of the Lorentz Group
Similarly, this is much more easily dealt with in the Lie algebra, instead of the Lie group. Here, the result generalizes: the Unified Group is simple for all signatures, except the Euclidean. The decomposition in the Euclidean sector is well-known.
This analysis is also done here.
7. Wigner’s Section 6-7: Symplectic Decomposition
No analysis in terms of linear spaces. Instead: a far more general analysis in terms of non-linear representation. Irrep is replaced by symplectic leaf.
7.1. The Vacuon Sectors
Covers the 3 classes of translation-invariant media: the generic vacuon, the quasi-vacuum and the vacuum.
7.2. The Archimedean Tachyon Sectors
Covers a special subclass of translation non-invariant media: those specific to the Archimedean signature, corresponding to systems in motion.
7.3. Wigner’s Section 6: Translation Invariants
The translation-invariant sectors are the vacuons; while the translation non-invariant sectors subdivide into the Archimeden Tachyon, the Helical and Non-Helical sectors.
Part of Wigner’s section 6 deals with the translation-invariants. Here, the analysis is done more simply. Out of it naturally emerge the Pauli-Lubanski 4-vector.
7.4. The Non-Helical Sectors
This cross-classifies with the von Neumann–Wigner classification to yield the non-helical Tardions, Tachyons, Luxons, Synchrons and Statons.
7.5. The Helical Sectors
Neither this, nor the following sections have yet been fully written up here, but a link to a parallel analysis for the non-Archimdean cases is provided.
The helical subdivision, like the non-helical subdivision, also cross-categories with respect to the Tardion, Tachyon, Luxon/Synchron/Staton classification.
8. Mass-Energy-Momentum vs. Velocity for the Translation Non-Invariant Sectors
9. Coordinatization of the Translation Non-Invariant Sectors
Once you get a coordinatization (which is an application of the Darboux Theorem), you then have the canonical 1-form and 2-forms, as well as the basis for defining the invariant measures on both the phase space and the configuration space. The general non-helical translation non-invariant sector has 4 Darboux pairs. In the tardion and helical cases, this decomposes into a Heisenberg triple plus a complementary pair for spin.
It is the Darboux coordinates that you then proceed to quantize by whatever favorite method you have at your disposal. In turn, it is this which governs the linear space representations.
This is the part of the analysis from Wigner which has not (yet) been included. But for the most part, it is fairly routine – as opposed to the much more complex problem of trying to quantize the whole Lie group!
10. Schroedinger Equation for Arbitrary Signatures
The Schroedinger equation is just an expression of the quadratic mass-shell invariant specialized to the Galilean sector … combined with the linear invariant that comes from the 11th generator. Both specialize across all the signature types. In the Lorentzian sector, the resulting equation is equivalent to the Klein-Gordon equation, up to a Foldy-Woutheusen transformation.
11. The Dirac Equation for Arbitrary Signatures
Similarly, the Dirac equation can be done across the board to all signature types and all sectors (including the tachyons and synchrons).
12. Position and Time Operators
This carries out the analysis of all the possibilities, devising a system of differential equations for a prospective position operator and time operator, built on the Poisson-Lie manifold that contains the symmetry group. Time operators exist for the Synchrons and (apparently) one of the Archimedean Tachyon sectors; while position operators exist for the non-helical tardion and helical sectors.
13. Many Body States and the No-Interaction Theorem
If I write this up, my intent is to combine BOTH the Leutweiler and Haag theorems into one; the former being the classical case of the general result, the latter the quantized version.
In addition, this can be generalized to arbitrary signatures (i.e. non-relativistic forms of the Haag and Leutweiler theorems).
Re: Unitary Representations of the Poincaré Group
This thread has been for me a very useful and helpful discussion. After a fair amount of time spent going back and forth over it, I think the fog in my brain is at last beginning to clear, after (sadly) years of confusion about what is, after all, very basic quantum physics.
I’d like to try to summarize what I’ve so far understood. (Looking back over it, I acknowledge that it’s pretty long-winded, and that it repeats some points already made by Greg Egan, Bruce Westbury, Evan Jenkins, and – of course – John Baez. I owe a debt of gratitude to them all; the fact I haven’t linked back to their comments is due to sheer laziness on my part.)
We are trying to understand unitary irreducible representations of the Poincaré group , aka “elementary particles” in quantum mechanics. A unitary representation consists of a Hilbert space and a continuous group homomorphism to its unitary group with the strong operator topology, and “irreducible” has the usual meaning.
(At this first pass, we won’t worry about technical details like “strong operator topology”. There are various spots in the outline below, particularly every place where the words “direct integral” appear, that implicitly require as formal background some form of spectral theory and Stone’s theorem, and there details of operator topologies will certainly be involved.)
At the outset, John told us that each unitary irrep is uniquely determined by two parameters:
-
A continuous parameter called the mass. This has something to do with how the irrep restricts to a representation of the normal subgroup of translations , a noncompact group.
-
A discrete parameter called the helicity (or “spin”). This specifies a unitary irrep of a compact subgroup of called a “little group”. If the mass is positive, this little group will be a copy of [whose unitary irreps are parametrized by half-integers, called spin numbers]. Other possibilities for the little group arise if the mass is zero.
By using the method of induced bundles, we can reconstruct the unitary irrep of from its mass and helicity. I’d like to try to outline how I think the whole thing works (in some cases minor twists on things others have already said).
Let’s start by discussing “mass”. This has to do with how the restricted representation
decomposes into a “direct sum” of irreducible unitary representations of . Except that in the representation theory of a noncompact Lie group like , we won’t use a direct sum exactly – we use a slightly more elaborate construction called a direct integral of representations. We’ll come to this later; for now let’s start with irreps of .
We know what unitary irreps of look like. Because is a locally compact commutative group, its unitary irreps are 1-dimensional and are specified by characters
or in other words by elements of the Pontryagin dual of , which is isomorphic to . This isomorphism takes an element , which physicists call a 4-momentum, to the character defined by
Here the in the exponent refers to a bilinear form based on the Minkowski form of signature . (If I were paying more mind to the physical dimensions, I guess I ought to have a factor in the exponent as well, referring here to Planck’s constant.)
The mass (or should we say mass squared?) of a momentum is defined by ; in other words, if we write , then
Mass is a relativistic invariant; let’s see what that implies. Consider the action of on the character group, by pulling back the action of on the translation group :
(abusing language here: denotes both an element of and the linear transformation it induces on ). By the Pontryagin isomorphism, this gives an action on momenta via the definition
The relativistic invariance of mass means that this right action carries a 4-momentum to another 4-momentum of the same mass.
Let us return now to a unitary irrep of . We restrict the irrep to a rep of . This decomposes into a direct sum (or direct integral) of a whole bunch of little 1-dimensional reps of indexed by 4-momenta:
where indexes the 4-momenta which occur as irreducible components, and the is the eigenspace attached to a given (i.e., the sum of the 1-dim components whose character is ).
Now I think an important point is that if is irreducible as a representation over , then the 4-momenta that occur in all have the same mass. (That is, if and occur in the direct integral but have different masses, then they would have to belong to different irreducible components of the original representation of .) So whatever the domain of integration is, it ought to be contained in the locus of momentum space given by the equation .
That mass is what we mean by the mass of the unitary irrep of .
Even better, under the assumption of irreducibility, the domain should be exactly the orbit of any one of these under the Poincaré group action. (If has more than one orbit, this would contradict irreducibility.) For example, in the relatively simple case of massive particles (), I believe acts transitively on that component of the locus contained in the “forward light cone” – one of the two sheets of the hyperboloid – and this will be our .
The transitive action of on defines a connected groupoid whose objects are points of and where morphisms are elements such that . The mapping defines a linear representation of this groupoid, and the action of on is completely determined from this groupoid representation (we retrieve the action by taking a direct integral – a little bit about this below). Since the groupoid is connected, this representation is determined by its restriction to any one of the automorphism groups in the groupoid; in fact we have an identification of as a homogeneous space
where is the stabilizer of a point . The subgroup acts trivially on , so if we form the quotient in the exact sequence
then we are really interested in the action of on . The group is then the semidirect product .
The quotient is the so-called “little group” at . It is a compact subgroup of the spin cover of the Lorentz group; by definition it is the stabilizer subgroup of in . When , we may choose as a representative momentum, and see that is fixed by any element in (the spin cover of) the spatial rotation group . In other words, the little group here would be the spin cover . Any other little group would be a subgroup conjugate to .
Summarizing then: if we know how a chosen little group acts on , then there is a unique extension of this action to a groupoid representation as above, and from there we retrieve the action of on by a direct integral construction on the groupoid representation. Both of these processes are additive functors. Therefore, if the action of on were reducible, so must then be the action of on .
Hence a necessary condition for the unitary representation to be irreducible is that any (and therefore every) accompanying little group representation also be irreducible. But we know about the irreducible representations in a case like ; as we mentioned, they are parametrized by a discrete parameter called spin (usually presented as a half-integer: ). For massless particles, other types of little groups can arise (e.g., ), and ‘helicity’ is the general term for a discrete parameter which parametrizes their unitary irreps.
So, this gives the way in which each unitary irrep of gives rise to a mass and a helicity, and we have some idea of how to go the other way, putting a mass and helicity together to form a unitary irrep. Let’s go into some more detail on this.
Above I said that we form a representation of a groupoid. This groupoid is really a topological groupoid. Its underlying span is of the form
where is the coset space and is the canonical action on cosets. The representation of the groupoid is exactly what Greg Egan was telling us about before, the bundle induced from the representation of over . I would write this induced bundle as
where the tensor product refers to modding out by a relation of the form . Hopefully that helps make manifest the way the groupoid is supposed to act: it boils down to the fact that acts naturally on the bundle.
Then, when I referred to a direct integral construction being performed on the groupoid representation, I am referring to something equivalent to taking the sections of the induced bundle. This should give back the Hilbert space that underlies the original unitary irrep, if I understand things correctly.
I have some concerns about what I have written, but I’ll pause for now, and think about it some more, and maybe ask some questions later.
Re: Unitary Representations of the Poincaré Group
Heh – thanks, John! (I was actually pestering John with some email questions about this today. But he thought it would be better to have this out in the open.)
Personally, what I want is a “nice” (meaning written in clean modern mathematical language and notation) statement and proof of what Wigner did. I’ve seen it sketched in a physics book or two, and it could be made a lot nicer I think.
Just to try to get the ball rolling, let me comment on John’s slightly cryptic mention of , and how it’s supposed to act on Minkowski spacetime .
The trick is to identify with the space of Hermitian matrices. Recall that a complex matrix ,
is Hermitian if its equals its conjugate transpose : concretely, if , , . Apparently, then, every Hermitian matrix can be written in the form
for a unique choice of real . Notice the nice fact that the determinant of this matrix is precisely
the Minkowski norm of the spacetime vector .
Now, let act on Hermitian matrices by
Since matrices in have determinant 1, it is clear that
which, under the identification , just means that the induced action of on preserves the Minkowski norm, or the metric on spacetime used in relativity theory.
It’s not hard to see that the topological group is connected. Thus, if is the connected component of the Lorentz group (the group of automorphisms of which preserve the Minkowski metric), we have produced a map
It turns out this map is onto, and that the kernel is . This quotient map is in fact the universal cover; it restricts to another famous universal cover:
There are a number of details which ought to be written up, but this gives the basic idea. In studying continuous group representations, it’s usually convenient to pass to the universal cover (which I’m sure is why John introduced it). All of this is to be related furthermore to spinors, and to the distinction between bosons and fermions.