Random Permutations (Part 14)

December 15, 2024

Posted by John Baez

$MathML-enabled post (click for more details).$

I want to go back over something from Part 11, but in a more systematic and self-contained way.

Namely, I want to prove a wonderful known fact about random permutations, the Cycle Length Lemma, using a bit of category theory. The idea here is that the number of $k$ -cycles in a random permutation of $n$ things is a random variable. Then comes a surprise: in the limit as $n \to \infty$ , this random variable approaches a Poisson distribution with mean $1/k$ . And even better, for different choices of $k$ these random variables become independent in the $n \to \infty$ limit.

I’m stating these facts roughly now, to not get bogged down. But I’ll state them precisely, prove them, and categorify them. That is, I’ll state equations involving random variables — but I’ll prove that these equations come from equivalences of groupoids!

$MathML-enabled post (click for more details).$

First I’ll state the Cycle Length Lemma, which summarizes a lot of interesting facts about random permutations. Then I’ll state and prove a categorified version of the Cycle Length Lemma, which asserts an equivalence of groupoids. Then I’ll derive the original version of the lemma from this categorified version by taking the cardinalities of these groupoids. The categorified version contains more information, so it’s not just a trick for proving the original lemma.

What do groupoids have to do with random permutations? You’ll see, but it’s an example of ‘principle of indifference’, especially in its modern guise, called the ‘principle of transformation groups’: the idea that outcomes related by a symmetry should have the same probability. This sets up a connection between groupoids and probability theory — and as we’ll see, we can “go down” from groupoids to probabilities using the theory of groupoid cardinalities.

The Cycle Length Lemma

In the theory of random permutations, we treat the symmetric group $S_n$ as a probability measure space where each element has the same measure, namely $1/n!$ . Functions $f \colon S_n \to \mathbb{R}$ then become random variables, and we can study their expected values:

$E(f) = \frac{1}{n!} \sum_{\sigma \in S_n} f(\sigma).$

An important example is the function

$C_k \colon S_n \to \mathbb{N}$

that counts, for any permutation $\sigma \in S_n$ , its number of cycles of length $k$ , also called $k$ -cycles. A well-known but striking fact about random permutations is that whenever $k \le n$ , the expected number of $k$ -cycles is $1/k$ :

$E(C_k) = \frac{1}{k}$

For example, a random permutation of any finite set has, on average, one fixed point!

Another striking fact is that whenever $j \ne k$ and $j + k \le n$ , so that it’s possible for a permutation $\sigma \in S_n$ to have both a $j$ -cycle and a $k$ -cycle, the random variables $C_j$ and $C_k$ are uncorrelated in the following sense:

$E(C_j C_k) = E(C_j) E(C_k) .$

You might at first think that having lots of $j$ -cycles for some large $j$ would tend to inhibit the presence of $k$ -cycles for some other large value of $k$ , but that’s not true unless $j + k \gt n$ , when it suddenly becomes impossible to have both a $j$ -cycle and a $k$ -cycle!

These two facts are special cases of the Cycle Length Lemma. To state this lemma in full generality, recall that the number of ordered $p$ -tuples of distinct elements of an $n$ -element set is the falling power

$n^{\underline{p}} = n(n-1)(n-2) \, \cdots \, (n-p+1).$

It follows that the function

$C_k^{\underline{p}} \colon S_n \to \mathbb{N}$

counts, for any permutation in $S_n$ , its ordered $p$ -tuples of distinct $k$ -cycles. We can also replace the word ‘distinct’ here by ‘disjoint’, without changing the meaning, since distinct cycles must be disjoint.

The two striking facts mentioned above generalize as follows:

1) First, whenever $p k \le n$ , so that it is possible for a permutation in $S_n$ to have $p$ distinct $k$ -cycles, then

$E(C_k^{\underline{p}}) = \frac{1}{k^p}.$

If you know about the moments of a Poisson distribution here’s a nice equivalent way to state this equation: when $p k \le n$ , the $p$ th moment of the random variable $C_k$ equals that of a Poisson distribution with mean $1/k$ .

2) Second, the random variables $C_k$ are better and better approximated by independent Poisson distributions. To state this precisely we need a bit of notation. Let $\vec{p}$ denote an $n$ -tuple $(p_1 , \dots, p_n)$ of natural numbers, and let

$|\vec{p}| = p_1 + 2p_2 + \cdots + n p_n.$

If $|\vec{p}| \le n$ , it is possible for a permutation $\sigma \in S_n$ to have a collection of distinct cycles, with $p_1$ cycles of length 1, $p_2$ cycles of length 2, and so on up to $p_n$ cycles of length $n$ . If $|\vec{p}| \gt n$ , this is impossible. In the former case, where $|\vec{p}| \le n$ , we always have

$E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right) = \prod_{k=1}^n E( C_k^{\underline{p}_k}) .$

Taken together, 1) and 2) are equivalent to the Cycle Length Lemma, which may be stated in a unified way as follows:

Cycle Length Lemma. Suppose $p_1 , \dots, p_n \in \mathbb{N}$ . Then

$E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right) = \left\{ \begin{array}{ccc} \displaystyle{ \prod_{k=1}^n \frac{1}{k^{p_k}} } & & \mathrm{if} \; |\vec{p}| \le n \\ \\ 0 & & \mathrm{if} \; |\vec{p}| \gt n \end{array} \right.$

This appears, for example, in Ford’s course notes on random permutations and the statistical properties of prime numbers [Lemma 1.1, F]. The most famous special case is when $|\vec{p}| = n$ . Apparently this goes back to Cauchy, but I don’t know where he proved it. I believe he would have phrased it in terms of counting permutations, not probabilities.

I won’t get into details of precisely the sense in which random variables $C_k$ approach independent Poisson distributions. For that, see Arratia and Tavaré [AT].

The Categorified Cycle Length Lemma

To categorify the Cycle Length Lemma, the key is to treat a permutation as an extra structure that we can put on a set, and then consider the groupoid of $n$ -element sets equipped with this extra structure:

Definition. Let $\mathsf{Perm}(n)$ be the groupoid in which

an object is an $n$ -element set equipped with a permutation $\sigma \colon X \to X$

and

a morphism from $\sigma \colon X \to X$ to $\sigma' \colon X' \to X'$ is a bijection $f \colon X \to X'$ that is permutation-preserving in the following sense:

$f \circ \sigma \circ f^{-1} = \sigma'.$

We’ll need this strange fact below: if $n \lt 0$ then $\mathsf{Perm}(n)$ is the empty groupoid (that is, the groupoid with no objects and no morphisms).

More importantly, we’ll need a fancier groupoid where a set is equipped with a permutation together with a list of distinct cycles of specified lengths. For any $n \in \mathbb{N}$ and any $n$ -tuple of natural numbers $\vec{p} = (p_1 , \dots, p_n)$ , recall that we have defined

$|\vec{p}| = p_1 + 2p_2 + \cdots + n p_n.$

Definition. Let $\mathsf{A}_\vec{p}$ be the groupoid of $n$ -element sets $X$ equipped with a permutation $\sigma \colon X \to X$ that is in turn equipped with a choice of an ordered $p_1$ -tuple of distinct $1$ -cycles, an ordered $p_2$ -tuple of distinct $2$ -cycles, and so on up to an ordered $p_n$ -tuple of distinct $n$ -cycles. A morphism in this groupoid is a bijection that is permutation-preserving and also preserves the ordered tuples of distinct cycles.

Note that if $|p| \gt n$ , no choice of disjoint cycles with the specified property exists, so $A_\vec{p}$ is the empty groupoid.

Finally, we need a bit of standard notation. For any group $G$ we write $\mathsf{B}(G)$ for its delooping: that is, the groupoid that has one object $\star$ and $\mathrm{Aut}(\star) = G$ .

The Categorified Cycle Length Lemma. For any $\vec{p} = (p_1 , \dots, p_n) \in \mathbb{N}^n$ we have

$\mathsf{A}_{\vec{p}} \simeq \mathsf{Perm}(n - |\vec{p}|) \; \times \; \prod_{k = 1}^n \mathsf{B}(\mathbb{Z}/k)^{p_k}$

Proof. Both sides are empty groupoids when $n - |\vec{p}| \lt 0$ , so assume $n - |\vec{p}| \ge 0$ . A groupoid is equivalent to any full subcategory of that groupoid containing at least one object from each isomorphism class. So, fix an $n$ -element set $X$ and a subset $Y \subseteq X$ with $n - |\vec{p}|$ elements. Partition $X - Y$ into subsets $S_{k\ell}$ where $S_{k \ell}$ has cardinality $k$ , $1 \le k \le n$ , and $1 \le \ell \le p_k$ . Every object of $\mathsf{A}_{\vec{p}}$ is isomorphic to the chosen set $X$ equipped with some permutation $\sigma \colon X \to X$ that has each subset $S_{k \ell}$ as a $k$ -cycle. Thus $\mathsf{A}_{\vec{p}}$ is equivalent to its full subcategory containing only objects of this form.

An object of this form consists of an arbitrary permutation $\sigma_Y \colon Y \to Y$ and a cyclic permutation $\sigma_{k \ell} \colon S_{k \ell} \to S_{k \ell}$ for each $k,\ell$ as above. Consider a second object of this form, say $\sigma'_Y \colon Y \to Y$ equipped with cyclic permutations $\sigma'_{k \ell}$ . Then a morphism from the first object to the second consists of two pieces of data. First, a bijection

$f \colon Y \to Y$

such that

$\sigma'_Y = f \circ \sigma_Y \circ f^{-1}.$

Second, for each $k,\ell$ as above, bijections

$f_{k \ell} \colon S_{k \ell} \to S_{k \ell}$

such that

$\sigma'_{k \ell} = f_{k \ell} \circ \sigma_{k \ell} \circ f_{k \ell}^{-1}.$

Since $Y$ has $n - |\vec{p}|$ elements, while $\sigma_{k \ell}$ and $\sigma'_{k \ell}$ are cyclic permutations of $k$ -element sets, it follows that $\mathsf{A}_{\vec{p}}$ is equivalent to

$\mathsf{Perm}(n - |\vec{p}|) \; \times \; \prod_{k = 1}^n \mathsf{B}(\mathbb{Z}/k)^{p_k}. \qquad \qquad ▮$

The case where $|\vec{p}| = n$ is especially pretty, since then our chosen cycles completely fill up our $n$ -element set and we have

$\mathsf{A}_{\vec{p}} \simeq \prod_{k = 1}^n \mathsf{B}(\mathbb{Z}/k)^{p_k}.$

Groupoid Cardinality

The cardinality of finite sets has a natural extension to finite groupoids, and this turns out to be the key to extracting results on random permutations from category theory. Let’s briefly recall the idea of ‘groupoid cardinality’ [BD,BHW]. Any finite groupoid $\mathsf{G}$ is equivalent to a coproduct of finitely many one-object groupoids, which are deloopings of finite groups $G_1, \dots, G_m$ :

$\mathsf{G} \simeq \sum_{i = 1}^m \mathsf{B}(G_i),$

and then the cardinality of $\mathsf{G}$ is defined to be

$|\mathsf{G}| = \sum_{i = 1}^m \frac{1}{|G_i|}.$

This concept of groupoid cardinality has various nice properties. For example it’s additive:

$|\mathsf{G} + \mathsf{H}| = |\mathsf{G}| + |\mathsf{H}|$

and multiplicative:

$|\mathsf{G} \times \mathsf{H}| = |\mathsf{G}| \times |\mathsf{H}|$

and invariant under equivalence of groupoids:

$\mathsf{G} \simeq \mathsf{H} \implies |\mathsf{G}| = |\mathsf{H}|.$

But none of these three properties require that we define $|\mathsf{G}|$ as the sum of the reciprocals of the cardinalities $|G_i|$ : any other power of these cardinalities would work just as well. What makes the reciprocal cardinalities special is that if $G$ is a finite group acting on a set $S$ , we have

$|S\sslash G| = |S|/|G|$

where the groupoid $S \sslash G$ is the weak quotient or homotopy quotient of $S$ by $G$ , also called the action groupoid. This is the groupoid with elements of $S$ as objects and one morphism from $s$ to $s'$ for each $g \in G$ with $g s = s'$ , with composition of morphisms coming from multiplication in $G$ .

The groupoid of $n$ -element sets equipped with permutation, $\mathsf{Perm}(n)$ , has a nice description in terms of weak quotients:

Lemma. For all $n \in \mathbb{N}$ we have an equivalence of groupoids

$\mathsf{Perm}(n) \simeq S_n \sslash S_n$

where the group $S_n$ acts on the underlying set of $S_n$ by conjugation.

Proof. We use the fact that $\mathrm{Perm}(n)$ is equivalent to any full subcategory of $\mathrm{Perm}(n)$ containing at least one object from each isomorphism class. For $\mathsf{Perm}(n)$ we can get such a subcategory by fixing an $n$ -element set, say $X = \{1,\dots, n\}$ , and taking only objects of the form $\sigma \colon X \to X$ , i.e. $\sigma \in S_n$ . A morphism from $\sigma \in S_n$ to $\sigma' \in S_n$ is then a permutation $\tau \in S_n$ such that

$\sigma' = \tau \sigma \tau^{-1} .$

But this subcategory is precisely $S_n \sslash S_n$ . ▮

Corollary. For all $n \in \mathbb{N}$ we have

$|\mathrm{Perm}(n)| = 1$

Proof. We have $|\mathrm{Perm}(n)| = |S_n \sslash S_n| = |S_n|/|S_n| = 1$ . ▮

It should now be clear why we can prove results on random permutations using the groupoid $\mathsf{Perm}(n)$ : this groupoid is equivalent to $S_n \sslash S_n$ , a groupoid with one object for each permutation $\sigma \in S_n$ , and with each object contributing $1/n!$ to the groupoid cardinality.

Now let us use this idea to derive the original Cycle Length Lemma from the categorified version.

Cycle Length Lemma. Suppose $p_1 , \dots, p_n \in \mathbb{N}$ . Then

Proof. We know that

$\mathsf{A}_{\vec{p}} \simeq \mathsf{Perm}(n - |\vec{p}|) \; \times \; \prod_{k = 1}^n \mathsf{B}(\mathbb{Z}/k)^{p_k}$

So, to prove the Cycle Length Lemma it suffices to show three things:

$|\mathsf{A}_{\vec{p}}| = E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right)$

$\mathsf{Perm}(n - |\vec{p}|) = \left\{ \begin{array}{ccc} 1 & & \mathrm{if} \; |\vec{p}| \le n \\ \\ 0 & & \mathrm{if} \; |\vec{p}| \gt n \end{array} \right.$

and

$|\mathsf{B}(\mathbb{Z}/k)| = 1/k$

The last of these is immediate from the definition of groupoid cardinality. The second follows from the Corollary above, together with the fact that $\mathsf{Perm}(n - |\vec{p}|)$ is the empty groupoid when $|\vec{p}| \gt n$ . Thus we are left needing to show that

$|\mathsf{A}_{\vec{p}}| = E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right).$

We prove this by computing the cardinality of a groupoid equivalent to $\mathsf{A}_{\vec p}$ . This groupoid is of the form

$Q(\vec{p}) \sslash S_n$

where $Q(\vec{p})$ is a set on which $S_n$ acts. As a result we have

$|\mathsf{A}_{\vec{p}}| = |Q(\vec{p}) \sslash S_n| = |Q(\vec{p})| / n!$

and to finish the proof we will need to show

$E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right) = |Q(\vec{p})| / n!.$

What is the set $Q(\vec{p})$ , and how does $S_n$ act on this set? An element of $Q(\vec{p})$ is a permutation $\sigma \in S_n$ equipped with an ordered $p_1$ -tuple of distinct $1$ -cycles, an ordered $p_2$ -tuple of distinct $2$ -cycles, and so on up to an ordered $p_n$ -tuple of distinct $n$ -cycles. Any element $\tau \in S_n$ acts on $Q(\vec{p})$ in a natural way, by conjugating the permutation $\sigma \in S_n$ to obtain a new permutation, and mapping the chosen cycles of $\sigma$ to the corresponding cycles of this new permutation $\tau \sigma \tau^{-1}$ .

Recalling the definition of the groupoid $\mathsf{A}_{\vec{p}}$ , it is clear that any element of $Q(\vec{p})$ gives an object of $\mathsf{A}_{\vec{p}}$ , and any object is isomorphic to one of this form. Furthermore any permutation $\tau \in S_n$ gives a morphism between such objects, all morphisms between such objects are of this form, and composition of these morphisms is just multiplication in $S_n$ . It follows that

$\mathsf{A}_{\vec{p}} \simeq Q(\vec{p}) \sslash S_n.$

To finish the proof, note that

$E\left( \prod_{k=1}^n C_k^{\underline{p}_k} \right)$

is $1/n!$ times the number of ways of choosing a permutation $\sigma \in S_n$ and equipping it with an ordered $p_1$ -tuple of distinct $1$ -cycles, an ordered $p_2$ -tuple of distinct $2$ -cycles, and so on. This is the same as $|Q(\vec{p})| / n!$ . ▮

References

[AT] Richard Arratia and Simon Tavaré, The cycle structure of random permutations, The Annals of Probability (1992), 1567–1591.

[BD] John C. Baez and James Dolan, From finite sets to Feynman diagrams, in Mathematics Unlimited—2001 and Beyond, vol. 1, eds. Björn Engquist and Wilfried Schmid, Springer, Berlin, 2001, pp. 29–50.

[BHW] John C. Baez, Alexander E. Hoffnung and Christopher D. Walker, Higher-dimensional algebra VII: groupoidification, Theory and Applications of Categories 24 (2010), 489–553.

[F] Kevin Ford, Anatomy of Integers and Random Permutations—Course Lecture Notes.

Posted at December 15, 2024 12:00 PM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3583

23 Comments & 0 Trackbacks

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Wow, this is lovely. It may be a silly question, but could the cycle length lemma imply relations in some kind of Burnside ring?

Posted by: jack on December 17, 2024 1:18 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Hmm, I don’t know! Since the cycle decomposition of a permutation in $S_n$ , described by an $n$ -box Young diagram, specifies its conjugacy class, and the Cycle Length Lemma says (among many other things!) how many permutations are in each conjugacy class, one thing it does is describe the “integration over $S_n$ ” map from the ring of class functions on $S_n$ to $\mathbb{Q}$ .

There’s also a map from the Burnside ring of $S_n$ to its representation ring, which at least over $\mathbb{Q}$ is the same as the ring of class functions. That’s as close as I can immediately get!

Posted by: John Baez on December 17, 2024 6:08 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Oh, and I should say that any finite $S_n$ -set $X$ gives not only an element of the Burnside ring of $S_n$ , but also a groupoid $X\sslash S_n$ , which gets us into the realm of the games I’m playing here.

Posted by: John Baez on December 17, 2024 6:14 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I love the idea of proving stuff about random permutations using groupoid cardinality.

But first, a comment on something from earlier in the series. Your posts about random permutations always seem to get me thinking about random endomorphisms. Maybe that’s because they’re easier: when $f$ is a random endomorphism of a finite set $X$ (chosen uniformly at random), the random variables $f(x)$ ( $x \in X$ ) are independent, unlike for permutations.

You said, among other things, that the expected number of $k$ -cycles in a random permutation of $n$ elements converges to $1/k$ as $n \to \infty$ . What about a random endomorphism?

It seems to me that the answer is the same for endomorphisms as permutations, and very easy to prove.

As a warm-up, consider $k = 1$ : fixed points. For a random endomorphism $f$ of an $n$ -element set, the probability that a given element $x$ is fixed by $f$ is $1/n$ . Put another way, the expected number of 1-cycles that $x$ belongs to is $1/n$ (since the number of 1-cycles that $x$ belongs to is either $0$ or $1$ !). Summing over all $x$ in our $n$ -element set $X$ gives the expected number of 1-cycles as $\sum_{x \in X} 1/n = 1$ .

Now take any $k \geq 1$ , and take a random endomorphism $f$ of an $n$ -element set $X$ . For a given element $x \in X$ , the probability that $x$ belongs to a $k$ -cycle is the probability that $f(x), f^2(x), \ldots, f^{k - 1}(x)$ are all different from $x$ and $f^k(x) = x$ . This is

$\Bigl(\frac{n - 1}{n}\Bigr)^{k - 1} \cdot \frac{1}{n} = \frac{1}{n} \cdot \Bigl( 1 - \frac{1}{n} \Bigr)^{k - 1}.$

Again, this expression can be interpreted as the expected number of $k$ -cycles that $x$ belongs to. Summing over all $x \in X$ gives the number of $k$ -cycles in $f$ , multiplied by $k$ because each cycle gets counted $k$ times. Hence the expected number of $k$ -cycles in a random endomorphism of an $n$ -element set is

$\frac{1}{k} \cdot \Bigl( 1 - \frac{1}{n} \Bigr)^{k - 1}.$

Here I’m using the fact that the expected value of a sum of random variables is the sum of the expected values, even if they’re not independent. (They’re not, in this case. E.g. if we have some $x \in X$ and we know that none of the elements other than $x$ belong to a 2-cycle, then $x$ can’t either.)

If we let $n \to \infty$ , it converges to $1/k$ . So that’s the expected number of $k$ -cycles in a random endomorphism of a large finite set.

This is the same as the answer for random permutations. So I’d now like to wave a wand and deduce the result on permutations from the result on endomorphisms.

Unfortunately, I don’t know how, but maybe I have an inkling. The picture I have in my head is something like this:

$colim_X Sym(X) \leftrightarrows colim_X End(X)$

Here $End(X)$ and $Sym(X)$ are the sets of endomorphisms and permutations of a finite set $X$ , and the colimits are over the category of finite sets and injections. Any permutation is an endomorphism, which gives one of the two arrows. The other one comes from the fact that any permutation of a finite set $X$ restricts canonically to a permutation of a subset $Y \subseteq X$ :

endo of finite set

It would be nice to show that when we choose endomorphisms uniformly at random, then the resulting permutations are distributed uniformly too. Then we could deduce the result on expected number of cycles in a random permutation from the analogous result for endomorphisms. But right now, I don’t even know how to make sense of this statement about uniform distributions, because different endomorphisms of $X$ restrict to permutations on subsets of different sizes.

I’m being lazy and not looking up the proof that the expected number of $k$ -cycles in a random permutation is $1/k$ . How hard is it anyway?

Posted by: Tom Leinster on December 17, 2024 10:26 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I don’t have time to think through how good the analogy is (I’m procrastinating from grading exams right now), but your comment reminds me of a common trick (or technique, or whatever — like you, I mean to cast no shade with that term) from geometric probability, sometimes called (de-)Poissonization.

Say you want to prove something about random collections of points in a fixed set $K \subseteq \mathrm{R}^d$ . The first thing you have to do is decide what you mean by a random collection of points. Assuming that $K$ has finite positive volume, perhaps the most obvious thing to do is fix a positive integer $n$ , then choose $X_1, \ldots, X_n$ uniformly and independently in $K$ .

If you don’t want to be tied to a specific $n$ , you could first pick $N$ at random, say with a Poisson distribution, and then pick $N$ points uniformly and independently. If $N$ is a Poisson random variable with mean $n$ , then the resulting random collection of points is called a Poisson point process (PPP) in $K$ with intensity $n / vol(K)$ . This probably sounds like a more complicated thing to deal with than a fixed number of random points, but it magically turns out to have a very special property: if $A$ and $B$ are disjoint subsets of $K$ , then the number of random points in $A$ and the number of random points in $B$ are independent random variables. (In fact, the collections of random points in those sets are themselves independent PPPs.) This makes certain things easier to prove about the PPP model.

Now maybe you are actually interested in the fixed- $n$ model, or more realistically, in the large- $n$ asymptotics of that model. Depending on what you’re doing, extra independence property may make it easier to do what you want to do for the PPP model. So you “Poissonize” the problem and prove results for the PPP model instead.

Then if you’re lucky, you can transfer large- $n$ results for the PPP model back to the fixed- $n$ model, using (among other things) that a Poisson random variable with large mean is tightly concentrated around its mean. That’s the “de-Poissonization” step.

The appearance of the Poisson distribution in this stuff is (as far as I see anyway) completely different from its appearance in the original post. This is all really to say that it’s a familiar situation to me to see two different, but conceptually related, random constructions, each with different technical advantages, that lead to similar results in some asymptotic regime.

Posted by: Mark Meckes on December 17, 2024 1:33 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I see; very interesting! So in this kind of situation, it can be easier to work with a Poisson random variable with mean $n$ than it is to work with the plain old number $n$ itself. I like it.

To implement a similar idea here — that is, to relate the statistics of random permutations and random endomorphisms — we’d surely need to know something about the number of elements in the eventual image of a random endomorphism. By the eventual image, I mean the set of periodic points of the endomorphism. In the diagram above, it’s the five-element yellow subset.

Five years ago, we had a conversation about the expected number of periodic points of a random endomorphism. For an endomorphism of a set with a large number $n$ of elements, it’s $\sim \sqrt{\pi n/2}$ . From what John wrote in that conversation, I suspect that lots more is known about the distribution of the number of periodic points than merely its mean.

Posted by: Tom Leinster on December 17, 2024 4:30 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Tom wrote:

I’m being lazy and not looking up the proof that the expected number of $k$ -cycles in a random permutation is $1/k$ . How hard is it anyway?

I’m also being lazy and neither looking it up nor thinking about it, but I believe it should be a straightforward application of the method of indicators: you just need to compute the probability that one fixed $k$ -cycle appears in your permutation, and count the number of $k$ -cycles.

Posted by: Mark Meckes on December 17, 2024 1:36 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Ah, thanks. Following your hints, I see that it is easy! In fact, it’s easier than the proof for endomorphisms (so my whole previous comment was misguided), and gives a cleaner answer too: it’s exactly $1/k$ , for all $n$ .

I dug back through John’s posts and found a different proof in Part 6.

By the way, I didn’t know the method you alluded to was called the “method of indicators”. And thanks for using British English; presumably Americans actually call it the “method of turn signals”.

Posted by: Tom Leinster on December 17, 2024 4:18 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Oops, I messed up that calculation of the expected number of k-cycles in a random endomorphism. Never mind.

Posted by: Tom Leinster on December 17, 2024 6:46 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I’m trying to read your comment, Tom, and I bumped into this:

Now take any $k \ge 1$ , and take a random permutation $f$ of an $n$ -element set $X$ .

I think you meant “endomorphism” here, not “permutation”. Do you want me to fix it?

Posted by: John Baez on December 17, 2024 9:36 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

You’re right, I meant endomorphism; thanks. I’ve fixed it. However, as per my last comment, I got that calculation wrong anyway.

Posted by: Tom Leinster on December 17, 2024 10:36 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Tom wrote:

I’m being lazy and not looking up the proof that the expected number of $k$ -cycles in a random permutation is $1/k$ . How hard is it anyway?

There are a bunch of proofs — for example, my blog article gave a proof, since it’s a special case of the Cycle Length Lemma that $E(C_k) = 1/k$ — but if this result is all you want, here’s a much more efficient approach based on Sridhar Ramesh’s comment on Part 6.

Suppose $1 \le k \le n$ .

Theorem. Given a random permutation of an $n$ -element set, the probability that any given point lies on a $k$ -cycle is $1/n$ .

Proof. Choose a particular point in an $n$ -element set. How many permutations of this set have this point in a $k$ -cycle? To choose such a permutation we have to choose the other $k - 1$ elements in the cycle and put an ordering on them. Then we have to permute the remaining $n - k$ elements. There are

$\binom{n - 1}{k - 1} \times (k - 1)! \times (n - k)! = (n-1)!$

ways of doing this. Dividing by the total number of permutations of our $n$ -element set we obtain the probability $1/n$ . ■

Corollary. Given a random permutation of an $n$ -element set, the expected number of $k$ -cycles is $1/k$ .

Proof. Suppose the expected number of $k$ -cycles is $E$ . Then the expected number of points lying on $k$ -cycles $k E$ , and the probability that a point lies on a $k$ -cycle is $k E /n$ . But this equals $1/n$ by the Theorem, so $E = 1/k$ . ■

Also, even simpler to state:

Corollary. Given a random permutation of an $n$ -element set, the expected number of points lying on $k$ -cycles is $1$ .

It would be nice if there were a proof of this last corollary that made it instantly obvious, rather than detouring through the fact that

$\binom{n - 1}{k - 1} \times (k - 1)! \times (n - k)! = (n-1)!$

Posted by: John Baez on December 17, 2024 10:01 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

John wrote:

Corollary. Given a random permutation of an $n$ -element set, the expected number of points lying on $k$ -cycles is $1$

and

It would be nice if there were a proof of this last corollary that made it instantly obvious.

You probably know this, but an equivalent challenge is to find an “instantly obvious” proof that the expected number of points lying on $k$ -cycles is independent of the cardinality $n$ of the ambient set, for $n$ in the range $k, k + 1, \ldots$ .

For suppose we know that the expected number of points lying on $k$ -cycles is independent of $n \geq k$ , for all $k$ . Call this expected number $e_k$ . For each $n$ and permutation $\sigma$ of $n$ letters,

$\sum_{k = 1}^n (\text{number of points lying in a }\ k\text{-cycle of }\ \sigma) = n.$

Taking the mean over all $\sigma \in S_n$ gives

$\sum_{k = 1}^n e_k = n.$

The same argument applied to $n - 1$ in place of $n$ gives

$\sum_{k = 1}^{n - 1} e_k = n - 1.$

Subtracting the last equation from the second-to-last equation gives $e_n = 1$ . This holds for all $n$ , so we’re done.

Posted by: Tom Leinster on December 21, 2024 5:07 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Nice! I’m not sure the relation “P instantly follows from Q” is transitive, but I will accept this as a zero-effort step.

Posted by: John Baez on December 21, 2024 8:29 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I’m not sure it is either!

In situations like this, I like to ask myself whether a proof could be explained to a mathematical friend while you’re out for a walk together. That means no pen and paper, no furious feats of concentration, and the possibility of occasional interruptions to cross a road or look at a tree.

It would be great to have a proof like that for the fact that the expected number of $k$ -cycles in a random permutation (or equivalently, the expected number of elements belonging to a $k$ -cycle) is independent of the size of the ambient set. As long as it’s at least $k$ , obviously.

Posted by: Tom Leinster on December 21, 2024 11:16 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

This is all very cool. Presumably the groupoids you consider can be generalized to $G$ -sets equipped with an element of $G$ , potentially further equipped with distinguished orbits (for $G$ a fixed finite group). There is a whole literature on probability on groups that could maybe be approached from this angle.

Posted by: Mark Meckes on December 17, 2024 3:50 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Thanks! I can indeed categorify a bunch of general abstract nonsense results along the lines you mention. For some category theorists, stripping off all the specifics might be enjoyable. But for me, the concreteness of the results being categorified is part of the fun. So I guess I should learn what people have done with other finite groups.

It’s really tempting to look at groups $GL(n,\mathbb{F}_q)$ , since they are ‘ $q$ -deformed versions’ of the symmetric groups $S_n$ , and we might get similar formulas showing up but with $q$ -factorials replacing factorials, etc.

The phrase “a whole literature” is simultaneously intriguing yet terrifying. Do you know a nice review article or something?

Posted by: John Baez on December 17, 2024 10:16 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

There are a lot of mostly disconnected facets to probability on groups and I don’t know a good review for the whole area generally. But here is a not-too-old review of one important facet of this literature (random walks on groups), here is a recent paper I happen to know of that feels closer in spirit to the kind of stuff you’re doing here, and (since you mentioned $GL(n, \mathbb{F}_q)$ ) here is an older survey about random matrices over finite fields.

Posted by: Mark Meckes on December 18, 2024 10:54 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Thanks! That survey on random matrices over finite fields is just what I want.

Posted by: John Baez on December 21, 2024 1:27 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Nice post, as usual.

A typo: Following “is the falling power” $x$ should be $n$ .

Posted by: Kevin Walker on December 20, 2024 5:10 PM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Thanks! Fixed.

Posted by: John Baez on December 21, 2024 1:26 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

I’ve polished up this blog article and turned it into a paper:

Groupoid cardinality and random permutations.

It’s the shortest paper I’ve written in… forever? I really like the idea of writing short papers now.

The last section is new, not in the blog article. It explains how whenever you have a finite group $G$ , a functor

$F: G \sslash G \to FinSet$

describes a kind of structure you can put on elements of $G$ , which is equivariant under conjugation. Counting the number of structures you can put on an element of $G$ , you get a function

$|F| \colon G \to \mathbb{N}$

where the bars mean the cardinality of a finite set. The expected value of $|F|$ is

$E(|F|) = \frac{1}{|G|} \sum_{g \in G} |F(g)|$

But we can compute this as the cardinality of a groupoid! Just apply the Grothendieck construction to $F$ , or in other words form its category of elements, to get a groupoid $\int F$ . This is the groupoid of “elements $g \in G$ equipped with a structure $x \in F(g)$ ”. Then I show

$E(|F|) = |\textstyle{\int} F|$

where the bars at right mean groupoid cardinality.

This equation is cute, but it gets even cuter if we remember that the expected value of $|F|$ is its integral with respect to the normalized Haar measure on $G$ . Then we can write the equation as

$\textstyle{\int} |F| = |\textstyle{\int} F|$

Posted by: John Baez on December 21, 2024 1:40 AM | Permalink | Reply to this

Re: Random Permutations (Part 14)

$MathML-enabled post (click for more details).$

Blog that draws the reader into the world of mathematical structures and their subtle properties. With elegance and precision, the author guides us through the intricacies of theory, showing how seemingly abstract concepts can have profound and surprising consequences. This is not just a lesson in mathematics but also a masterful demonstration of the beauty of logical thinking. For math enthusiasts – an intellectual feast; for laypeople – an encouragement to explore the mysteries of the queen of sciences. It’s worth reading to experience how mathematics can be both rigorous and poetic.

Posted by: Korepetytor Matematyki on January 22, 2025 7:54 PM | Permalink | Reply to this

The n-Category Café

Skip to the Main Content

December 15, 2024