(BT) Diversity from (LC) Diversity

Posted by Tom Leinster

$MathML-enabled post (click for more details).$

Around 2010, in papers that both appeared in print in 2012, two different mathematical notions were introduced and given the name “diversity”.

One, introduced by Tom Leinster and Christina Cobbold, is already familiar to regular readers of this blog. Say $X$ is a finite set, and for each $x,y \in X$ we have a number $Z(x,y) = Z(y,x) \in [0,1]$ that specifies how “similar” $x$ and $y$ are. (Typically we also assume $Z(x,x) = 1$ .) Fix a parameter $q \in [0,\infty]$ . If $p$ is a probability distribution on $X$ , then the quantity $D_q^Z(p) = \left(\sum_{x\in supp(p)} \left( \sum_{y\in supp(p)} Z(x,y) p(y)\right)^{q-1} p(x)\right)^{1/(1-q)}$ (with the cases $q=1,\infty$ defined by taking limits) can be interpreted as the “effective number of points” in $X$ , taking into account both the similarities between points as quantified by $Z$ and the weights specified by $p$ . Its logarithm $\log D_q^Z(p)$ is a refinement of the $q$ -Rényi entropy of $p$ . The main motivating example is when $X$ is a set of species of organisms present in an ecosystem, and $D_q^Z(p)$ quantifies the “effective number of species” in $X$ , accounting for both similarities between species and their relative abundances. This family of quantities turns out to subsume many of the diversity measures previously introduced in the theoretical ecology literature, and they are now often referred to as Leinster–Cobbold diversities.

$MathML-enabled post (click for more details).$

The parameter $q$ determines how much $D_q^Z(p)$ counts the very “rare” points (those for which $p(x)$ is very small). An interesting question from an ecological point of view is, given $X$ and $Z$ , which probability distribution $p$ maximizes the diversity $D_q^Z(p)$ ? It turns out that the answer is independent of $q$ . Moreover, if $X$ is a metric space and $Z(x,y) = e^{-d(x,y)}$ , this maximum diversity $D(X) := \max_p D_q^Z(p)$ is an isometric invariant closely related to the magnitude of $X$ . It also extends in a natural way to compact metric spaces.

Independently, David Bryant and Paul Tupper defined a diversity on a set $X$ to be a $[0,\infty)$ -valued function $\delta$ on the finite subsets of $X$ which satisfies:

$\delta(A) = 0$ if $A$ has at most one element, and
$\delta(A\cup B) \le \delta(A \cup C) + \delta(C \cup B)$ whenever $C \neq \emptyset$ .

I will refer to a diversity in this sense as a BT diversity. If $\delta$ were defined only on sets with at most two elements, this would amount to the definition of a metric. In fact, if $d$ is a metric on $X$ , then $\delta(A) = diam (A) := \max_{a,b \in A} d(a,b)$ defines a BT diversity on $X$ , so BT diversities are actually a generalization of metrics.

Here as well, the motivation for the name “diversity” comes from an example in theoretical ecology: suppose $X$ is a set of species in a phylogenetic tree $T$ . Define $\delta(A)$ to be the length of the smallest subtree of $T$ containing $A$ . Then $\delta$ is a BT diversity, known in the literature as phylogenetic diversity. However, just as with the maximum diversity discussed above, most of the subsequent work on BT diversities has focused on geometric examples.

So we now have two seemingly quite different geometric notions, introduced about the same time, going by strikingly similar names for conceptually similar reasons. One can’t help wondering, do they have something to do with each other? In particular, could maximum (LC) diversity be an example of a BT diversity?

In a new paper with Gautam Ashwarya, Dongbin Li, and Mokshay Madiman, we show that, after a minor tweak, maximum diversity does give rise to a BT diversity. The minor tweak is necessary to handle the first condition in the definition of BT diversity: if $X$ is a metric space and $x \in X$ , it’s easy to check that $D(\{x\}) = 1$ , whereas a BT diversity must satisfy $\delta(\{x\}) = 0$ . This can be dealt with in the simplest imaginable way:

Theorem 1 Let $X$ be a metric space. For each nonempty finite $A \subseteq X$ set $\delta(A) = D(A) - 1$ , and define also $\delta(\emptyset) = 0$ . Then $\delta$ is a BT diversity on $X$ .

(In the paper itself, we adopt the term complexity when referring to the quantities $\log D_q^Z(p)$ and $\log D(X)$ , and state most of the results in terms of complexity instead of maximum diversity; we further deduce from Theorem 1 that the complexity $log D(X)$ is also a BT diversity. This terminology is used partly to cut down on the potential confusion created by using “diversity” in multiple ways. It also alludes to the relationship between $\log D_q^Z(p)$ and Rényi entropy, which is widely understood as a measure of “complexity”. Further connections between LC complexity and Rényi entropy are the subject of forthcoming work that I hope to be able to tell you more about soon! But for the remainder of this blog post I’ll stick to the maximum diversity formulation.)

Interestingly, maximum diversity has some properties that are quite nice and natural, but turn out to make it intriguingly different from the heretofore most thoroughly studied BT diversities. For example, $D = 1 + \delta$ has the following subadditivity property, which is not shared by the functional $1 + diam$ :

Theorem 2 Let $X$ be a metric space, and let $A_1, \ldots, A_n \subseteq X$ be compact subsets. Then $D\left(\bigcup_{i=1}^n A_i \right) \le \sum_{i=1}^n D(A_i).$

Maximum diversity actually satisfies a much stronger property called fractional subadditivity, which arises naturally in inequalities for entropy. Another special case of fractional subadditivity is the following.

Theorem 3 Let $X = \{x_1, \ldots, x_n\}$ be a finite metric space. Then $\frac{D(X)}{n} \le \frac{1}{n} \sum_{i=1}^n \frac{D(X \setminus \{x_i\})}{n-1}.$

Theorem 3 can be interpreted as saying that the “complexity per element” of $X$ is at most the average complexity per element of a randomly chosen subset of cardinality $n-1$ . This captures the natural intuition that as the size of a metric space increases, its complexity per element decreases.

In the setting of $\mathbb{R}^n$ , many examples of BT diversities are homogeneous, in the sense that $\delta(\lambda A) = \lambda \delta(A)$ for all $\lambda \ge 0$ and nonempty finite $A \subseteq \mathbb{R}^n$ , and either sublinear, meaning homogeneous and also satisfying $\delta(A + B) \le \delta(A) + \delta(B),$ or else linear, where we have equality in the condition above. For example, the diameter is a sublinear diversity. (Diversities with these properties are the focus of a recent work by Bryant and Tupper.)

By contrast, maximum diversity has no simple homogeneity property; in fact its complex behavior with respect to scaling is part of what gives it such rich geometric interest. And at least in one dimension, the diversity $\delta = \log D$ satisfies the following superlinearity properties.

Theorem 4 Let $\delta$ be the diversity $\delta = \log D$ defined on compact subsets of $\mathbb{R}$ . Then $\delta(A + B) \ge \delta(A) + \delta(B)$ and $\delta(\lambda A + (1-\lambda)B) \ge \lambda \delta(A) + (1-\lambda) \delta(B)$ for every $0 \le \lambda \le 1$ and nonempty compact $A,B \subseteq \mathbb{R}$ .

The first inequality in Theorem 4 can be regarded as a generalization of the Cauchy–Davenport inequality in $\mathbb{R}$ , and the second as a version of the Brunn–Minkowski inequality in $\mathbb{R}$ . (In fact, since Lebesgue measure can be recovered from maximum diversity, it implies the Brunn–Minkowski inequality in $\mathbb{R}$ .) It is an open question, for which we know some partial results, whether Theorem 4 can be extended to higher dimensions.

In conclusion, our results make (at least) the following points:

The seemingly independent mathematical notions of diversity introduced by Leinster and Cobbold on the one hand, and Bryant and Tupper on the other hand, are actually closely connected.
Maximum diversity, in the sense of LC diversities, leads to a geometrically interesting example of a BT diversity whose behavior is quite different from many of the previously studied examples of BT diversities.
Maximum diversity, at least in certain contexts, satisfies a number of inequalities which extend important classical inequalities, and it would be especially interesting to push this line of inquiry further.

Please read the paper itself for more detail and other remarks (it’s short!).

Posted at August 5, 2025 4:34 PM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3612

Re: (BT) Diversity from (LC) Diversity

That’s super nice!

In the remarks after Theorem 1 of your post, and also in Theorem 1.7 of your paper, you point out that LC maximum diversity $D(X)$ gives rise to a BT diversity in two ways. You can either use $D(X) - 1$ or $\log D(X)$ .

First question: is $\phi(D(X))$ a BT diversity for any function $\phi$ satisfying $\phi(1) = 0$ and $\phi'(1) = 1$ ? And maybe we also want $\phi$ to be increasing.

I’m asking not only because that’s a common generalization, and not only with the idea that $x \mapsto x - 1$ is the linear approximation to any such function $\phi$ , but also because of stuff about “Tsallis” vs Rényi entropies. Maybe I’ll elaborate on that if the answer to the first question is yes :-)

Second, vaguer, question: do you have a clear sense of whether we should view $D - 1$ or $\log D$ as the primary player here?

Posted by: Tom Leinster on August 5, 2025 5:22 PM | Permalink | Reply to this

Regarding your first question, what we do in the paper is first show that $D-1$ is a BT diversity (using a reformulation of the definition contained in Lemma 2.2 in the paper). Then we apply Lemma 2.1, which states that if $\delta$ is a BT diversity, then so is $log (\delta + 1)$ . What the proof of Lemma 2.1 needs is that the function $\psi:[0,\infty) \to [0,\infty)$ given by $\psi(x) = log (x+1)$ satisfies

$\psi(0) = 0$ ,
$\psi$ is (weakly) increasing, and
$\psi$ is subadditive ( $\psi(x+y) \le \psi(x) + \psi(y))$ .

Those hypotheses could perhaps be weakened, but I think the hypotheses you suggest for $\phi$ don’t give enough control for large arguments to substitute for the subadditivity property.

As for the second question, I think the answer is very context-dependent.

I don’t think it’s possible to deduce that $D-1$ is a BT diversity from the fact that $log D$ is. So from the point of view of Theorem 1 in the post and Theorem 1.7 of the paper, I’d say $D-1$ is the primary player.

On the other hand, I just spotted a typo in Theorem 4 of the post and the line above it (maybe you could fix this, Tom!): in that result, I should have said $\delta = \log D$ . One could argue that the result amounts to a similar result for $D$ itself with products instead of sums on the right hand side, but given the interest in linearity properties for BT diversities, $log D$ is the version that has a more directly comparable (in the sense of almost opposite!) behavior to more classical examples. So it perhaps fits into the existing world of BT diversities more naturally.

And in the forthcoming work that I alluded to, focusing more on relationships between diversity and entropy, $log D$ will definitely be the primary player.

Posted by: Mark Meckes on August 5, 2025 6:32 PM | Permalink | Reply to this

Thanks for the information on what’s needed of $\psi$ . Evidently the situation is different from the one I had in mind, so it’s probably not worth elaborating on what I was thinking.

I just spotted a typo in Theorem 4 of the post

That should be fixed now.

And in the forthcoming work that I alluded to, focusing more on relationships between diversity and entropy, $\log D$ will definitely be the primary player.

Looking forward to it!

For those not following closely, “ $\log D$ ” is a metric-sensitive maximum Rényi entropy.

Posted by: Tom Leinster on August 5, 2025 6:39 PM | Permalink | Reply to this

While thinking about an upcoming talk I will give about this work, I suddenly realized what I should have titled this post:

A Tale of Two Diversities

My sincere apologies to all lovers of puns and of Dickens for this oversight.

Posted by: Mark Meckes on September 1, 2025 11:55 AM | Permalink | Reply to this

The n-Category Café

Skip to the Main Content

August 5, 2025