(BT) Diversity from (LC) Diversity
Posted by Tom Leinster
Guest post by Mark Meckes
Around 2010, in papers that both appeared in print in 2012, two different mathematical notions were introduced and given the name “diversity”.
One, introduced by Tom Leinster and Christina Cobbold, is already familiar to regular readers of this blog. Say is a finite set, and for each we have a number that specifies how “similar” and are. (Typically we also assume .) Fix a parameter . If is a probability distribution on , then the quantity (with the cases defined by taking limits) can be interpreted as the “effective number of points” in , taking into account both the similarities between points as quantified by and the weights specified by . Its logarithm is a refinement of the -Rényi entropy of . The main motivating example is when is a set of species of organisms present in an ecosystem, and quantifies the “effective number of species” in , accounting for both similarities between species and their relative abundances. This family of quantities turns out to subsume many of the diversity measures previously introduced in the theoretical ecology literature, and they are now often referred to as Leinster–Cobbold diversities.
The parameter determines how much counts the very “rare” points (those for which is very small). An interesting question from an ecological point of view is, given and , which probability distribution maximizes the diversity ? It turns out that the answer is independent of . Moreover, if is a metric space and , this maximum diversity is an isometric invariant closely related to the magnitude of . It also extends in a natural way to compact metric spaces.
Independently, David Bryant and Paul Tupper defined a diversity on a set to be a -valued function on the finite subsets of which satisfies:
if has at most one element, and
whenever .
I will refer to a diversity in this sense as a BT diversity. If were defined only on sets with at most two elements, this would amount to the definition of a metric. In fact, if is a metric on , then defines a BT diversity on , so BT diversities are actually a generalization of metrics.
Here as well, the motivation for the name “diversity” comes from an example in theoretical ecology: suppose is a set of species in a phylogenetic tree . Define to be the length of the smallest subtree of containing . Then is a BT diversity, known in the literature as phylogenetic diversity. However, just as with the maximum diversity discussed above, most of the subsequent work on BT diversities has focused on geometric examples.
So we now have two seemingly quite different geometric notions, introduced about the same time, going by strikingly similar names for conceptually similar reasons. One can’t help wondering, do they have something to do with each other? In particular, could maximum (LC) diversity be an example of a BT diversity?
In a new paper with Gautam Ashwarya, Dongbin Li, and Mokshay Madiman, we show that, after a minor tweak, maximum diversity does give rise to a BT diversity. The minor tweak is necessary to handle the first condition in the definition of BT diversity: if is a metric space and , it’s easy to check that , whereas a BT diversity must satisfy . This can be dealt with in the simplest imaginable way:
Theorem 1 Let be a metric space. For each nonempty finite set , and define also . Then is a BT diversity on .
(In the paper itself, we adopt the term complexity when referring to the quantities and , and state most of the results in terms of complexity instead of maximum diversity; we further deduce from Theorem 1 that the complexity is also a BT diversity. This terminology is used partly to cut down on the potential confusion created by using “diversity” in multiple ways. It also alludes to the relationship between and Rényi entropy, which is widely understood as a measure of “complexity”. Further connections between LC complexity and Rényi entropy are the subject of forthcoming work that I hope to be able to tell you more about soon! But for the remainder of this blog post I’ll stick to the maximum diversity formulation.)
Interestingly, maximum diversity has some properties that are quite nice and natural, but turn out to make it intriguingly different from the heretofore most thoroughly studied BT diversities. For example, has the following subadditivity property, which is not shared by the functional :
Theorem 2 Let be a metric space, and let be compact subsets. Then
Maximum diversity actually satisfies a much stronger property called fractional subadditivity, which arises naturally in inequalities for entropy. Another special case of fractional subadditivity is the following.
Theorem 3 Let be a finite metric space. Then
Theorem 3 can be interpreted as saying that the “complexity per element” of is at most the average complexity per element of a randomly chosen subset of cardinality . This captures the natural intuition that as the size of a metric space increases, its complexity per element decreases.
In the setting of , many examples of BT diversities are homogeneous, in the sense that for all and nonempty finite , and either sublinear, meaning homogeneous and also satisfying or else linear, where we have equality in the condition above. For example, the diameter is a sublinear diversity. (Diversities with these properties are the focus of a recent work by Bryant and Tupper.)
By contrast, maximum diversity has no simple homogeneity property; in fact its complex behavior with respect to scaling is part of what gives it such rich geometric interest. And at least in one dimension, the diversity satisfies the following superlinearity properties.
Theorem 4 Let be the diversity defined on compact subsets of . Then and for every and nonempty compact .
The first inequality in Theorem 4 can be regarded as a generalization of the Cauchy–Davenport inequality in , and the second as a version of the Brunn–Minkowski inequality in . (In fact, since Lebesgue measure can be recovered from maximum diversity, it implies the Brunn–Minkowski inequality in .) It is an open question, for which we know some partial results, whether Theorem 4 can be extended to higher dimensions.
In conclusion, our results make (at least) the following points:
The seemingly independent mathematical notions of diversity introduced by Leinster and Cobbold on the one hand, and Bryant and Tupper on the other hand, are actually closely connected.
Maximum diversity, in the sense of LC diversities, leads to a geometrically interesting example of a BT diversity whose behavior is quite different from many of the previously studied examples of BT diversities.
Maximum diversity, at least in certain contexts, satisfies a number of inequalities which extend important classical inequalities, and it would be especially interesting to push this line of inquiry further.
Please read the paper itself for more detail and other remarks (it’s short!).
Re: (BT) Diversity from (LC) Diversity
That’s super nice!
In the remarks after Theorem 1 of your post, and also in Theorem 1.7 of your paper, you point out that LC maximum diversity gives rise to a BT diversity in two ways. You can either use or .
First question: is a BT diversity for any function satisfying and ? And maybe we also want to be increasing.
I’m asking not only because that’s a common generalization, and not only with the idea that is the linear approximation to any such function , but also because of stuff about “Tsallis” vs Rényi entropies. Maybe I’ll elaborate on that if the answer to the first question is yes :-)
Second, vaguer, question: do you have a clear sense of whether we should view or as the primary player here?