How to Count n-Ary Trees

July 7, 2025

Posted by John Baez

$MathML-enabled post (click for more details).$

How do you count rooted planar $n$ -ary trees with some number of leaves? For $n = 2$ this puzzle leads to the Catalan numbers. These are so fascinating that the combinatorist Richard Stanley wrote a whole book about them. But what about $n \gt 2$ ?

I’ll sketch one way to solve this puzzle using generating functions. This will give me an excuse to talk a bit about something called ‘Lagrange inversion’.

$MathML-enabled post (click for more details).$

You can define a rooted planar binary tree recursively by this equation:

$B \cong x + B^2$

This means “to make a set into the leaves of a binary tree, it should either be the one-element set or the leaves of a pair of binary trees”. You can then solve this using the quadratic formula and get

$B \cong \frac{1 - \sqrt{1 - 4x}}{2} = x + x^2 + 2 x^3 + 5 x^4 + 14 x^5 + \cdots$

where the numbers here are the Catalan numbers. For example, there are 5 binary trees with 4 vertices. For more details and more rigor go here.

We can define rooted planar $n$ -ary trees by a similar equation:

$T \cong x + T^n$

Bu how do we solve this when $n \gt 2$ ? Even for $n = 3$ or $n = 4$ , where we could use the cubic or quartic equation, we don’t really want to.

The trick is to use Lagrange inversion. Given a formal power series

$\displaystyle{ f(z) = \sum_{k=1}^\infty f_k \frac{z^k}{k!} }$

normalized with $f_1 = 1$ to make things simple, Lagrange inversion tells us a formal power series $g$ that’s the inverse of $f$ with respect to composition:

$g(f(z)) = z, \qquad f(g(z)) = z$

Here’s how it works. Let’s write

$\displaystyle{ g(z) = \sum_{j=1}^\infty g_j \frac{z^j}{j!} }$

Then $g_1 = 1$ , and the Lagrange inversion theorem says that for $j \ge 2$ we have

$\displaystyle{ g_j = \sum_{k=1}^{j-1} (-1)^k \; j^{\overline{k}} \; B_{j-1,k}(\hat{f}_1,\hat{f}_2,\ldots,\hat{f}_{j-k}) }$

where

$\displaystyle{ \hat{f}_k = \frac{f_{k+1}}{(k+1)} }$

and the rising powers $j^{\overline{k}}$ are defined by

$\displaystyle{ j^{\overline{k}} = j(j+1)\cdots (j+k-1) }$

But the main ingredient in this formula is the Bell polynomials $B_{j,k}$ . These are named after Eric Temple Bell, author of a science fiction series about a planet where mathematics is done only by men.

It seems easiest to explain the Bell polynomials with an example. $B_{j,k}$ is a polynomial in $j-1$ variables. It keeps track of all the ways you can partition an $j$ -element set into $k$ disjoint nonempty subsets, called blocks. For example:

$B_{6,2}(x_1,x_2,x_3,x_4,x_5)=$ $6x_5x_1+15x_4x_2+10x_3^2$

This says that if we partition a 6-element set into 2 blocks there are:

6 ways to partition it into a block of size 5 and a block of size 1
15 ways to partition it into a block of size 4 and a block of size 2
10 ways to partition it into two blocks of size 3.

and those are all the ways!

So, if you know all the Bell polynomials, you know how many ways there are to partition any finite set into some chosen number of blocks of some chosen sizes.

The formula for Lagrange inversion is intimidating yet intriguing, and that’s what I really want to understand. But for now let’s just apply it to count $n$ -ary trees. We’re trying to solve

$T = x + T^n$

or in other words

$x = T^n - T$

for $x$ . So we make up a function

$f(z) = z^n - z$

and we seek the inverse power series $g$ : this will give us $T$ as a power series in $x$ .

The first coefficient of $f$ is $-1$ , not $1$ , so we’ll need to tweak the formula for Lagrange inversion. Luckily this will just get rid of the sign $(-1)^k$ that appeared in that formula.

Actually I’ll skip the detailed calculation, which is much less fun to read than to do yourself. The main point is that Lagrange inversion does the job. I’ll just give you the answer:

$\displaystyle{ g(z) = \sum_{k=0}^\infty \binom{n k}{k} \frac{z^{(n-1)k+1} }{(n-1)k+1} }$

So, the number of rooted planar $n$ -ary trees with $(n-1)k + 1$ leaves should be

$\binom{n k}{k} \frac{1}{(n-1)k+1}$

As a sanity check, note that an $n$ -ary tree always has $(n-1)k + 1$ leaves for some natural number $k \ge 0$ , because it can either have $1$ leaf (the root), or we can stick a sprout with $n$ leaves on an existing leaf, thus adding $n-1$ new leaves.

Also note that when $n = 2$ we get our friends the Catalan numbers!

So this is pretty cool, and it raises tons of interesting questions, mostly about the deep inner meaning of Lagrange inversion. Richard Stanley wrote about this in Section 5.4 of the second volume of Enumerative Combinatorics, André Joyal wrote about it in Theorem 2 of Une théorie combinatoire des séries formelles (with a partially finished English translation here), and Flajolet and Sedgewick wrote about it in Appendix A.6 of Analytic Combinatorics. So there’s a lot of material to read! But I find all these discussions puzzling, so I’m trying to dig deeper and find an explanation that’s easier to grasp. Luckily Todd Trimble knows a lot about this subject, and it’s very beautiful! So stay tuned.

Posted at July 7, 2025 8:24 AM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3606

7 Comments & 0 Trackbacks

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

Three OEIS hubs for notes/refs on compositional inversion of analytic functions and formal power series that are related to variants of Lagrange inversion formulas are OEIS A145271, A133437 (an e.g.f. variant of the o.g.f. A111785), and A134264. There are numerous combinatorial models (see, e.g., Guises of the Stasheff polytopes, associahedra for the Coxeter A_n root system?) for A111785 and Guises of the noncrossing partitions (NCPs) / refined Narayana polynomials) for A134264). The tree model dates back to Arthur Cayley (1857, see simple core cases in my pdf Mathemagical Forests) based on observations of a relation between compositional inversion and iterated derivatives by Charles Graves (1853, see two of my Math Overflow answers here and here.). The degenerate case of inverting $y = f(x) = x + x^n$ with $n$ any integer is sketched in my answer to the MO-Q A combinatorial interpretation for n-ary-trees for-negative n.

Posted by: Tom Copeland on July 7, 2025 6:10 PM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

Thanks! Maybe Todd and I can figure out more about $n$ -ary trees for negative $n$ , since we’re getting good at ‘combinatorics with negative sets’. But these may be harder than trees with a negative number of leaves!

Posted by: John Baez on July 7, 2025 9:04 PM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

Should the reciprocity for the rising and falling factorials

$\binom{-n}{k} = (-1)^k \binom{n+k-1}{k}$

arise in your machinations and/or the infinite dihedral group rep given in my MO-Q Multivariate polynomial representations of the infinite dihedral group, please alert me with a comment below my MO-Q.

Posted by: Tom Copeland on July 8, 2025 12:06 AM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

The classic Lagrange inversion partition polynomials, giving the inverse of formal Taylor series, or exponential generating functions, are presented in OEIS A134685 with a simple combinatorial interpretation of balls in bins illustrated in my pdf A short note on Lagrange inversion. These are related to phylogenetic trees and tropical Grassmannians.

The set of compositional inversion partition polynomials of OEIS A133437 give the exponential generating function that is the inverse of a formal power series, or ordinary generating function. The corresponding o.g.f. is essentially given by A111785 (divide the polynomials of A133437 by the factorials). The first few of these partition polynomials can be found in a letter to Oldenberg, the first Secretary of England’s Royal Society, written in 1676 by Isaac Newton. The coefficients of these polynomials count the distinct faces of the associahedra, an infinite set of progressively higher dimensional polytopes first realized in the 1950s and 60s.

Posted by: Tom Copeland on July 7, 2025 9:39 PM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

Here is perhaps one way to interpret Lagrange inversion from a different (deeper?) perspective, namely generalized permutahedra:

Marcelo Aguiar and Federico Ardila, Hopf monoids and generalized permutahedra.

and

Federico Ardila, Algebraic structures on polytopes.

(for an accessible talk).

Posted by: Kenneth on July 8, 2025 6:32 PM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

I blogged about Aguilar and Ardila’s work here:

More secrets of the associahedra, January 25, 2018.

But now I am trying to understand that work better, hence my new post!

Thanks for the link to the video: I’d heard about it but couldn’t find it.

Posted by: John Baez on July 8, 2025 6:40 PM | Permalink | Reply to this

Re: How to Count n-Ary Trees

$MathML-enabled post (click for more details).$

I keep meaning to expand this article a bit so it shows how we get from the general formula for Lagrange inversion to the formula for counting $n$ -ary trees. But I keep being too busy!

Posted by: John Baez on July 25, 2025 8:34 AM | Permalink | Reply to this

The n-Category Café

Skip to the Main Content

July 7, 2025