Rank-based linkage I: triplet comparisons and oriented simplicial complexes

R. W. R. Darling; Will Grilliette; Adam Logan

doi:10.46298/compositionality-8-2

R. W. R. Darling ; Will Grilliette ; Adam Logan - Rank-based linkage I: triplet comparisons and oriented simplicial complexes

compositionality:14123 - Compositionality, March 20, 2026, Volume 8 (2026) - https://doi.org/10.46298/compositionality-8-2

Rank-based linkage I: triplet comparisons and oriented simplicial complexesArticle

Authors: R. W. R. Darling ; Will Grilliette ; Adam Logan

Rank-based linkage is a new tool for summarizing a collection $S$ of objects according to their relationships. These objects are not mapped to vectors, and ``similarity'' between objects need be neither numerical nor symmetrical. All an object needs to do is rank nearby objects by similarity to itself, using a Comparator which is transitive, but need not be consistent with any metric on the whole set. Call this a ranking system on $S$. Rank-based linkage is applied to the $K$-nearest neighbor digraph derived from a ranking system. Computations occur on a 2-dimensional abstract oriented simplicial complex whose faces are among the points, edges, and triangles of the line graph of the undirected $K$-nearest neighbor graph on $S$. In $|S| K^2$ steps it builds an edge-weighted linkage graph $(S, \mathcal{L}, σ)$ where $σ(\{x, y\})$ is called the in-sway between objects $x$ and $y$. Take $\mathcal{L}_t$ to be the links whose in-sway is at least $t$, and partition $S$ into components of the graph $(S, \mathcal{L}_t)$, for varying $t$. Rank-based linkage is a functor from a category of ``out-ordered'' digraphs to a category of partitioned sets, with the practical consequence that augmenting the set of objects in a rank-respectful way gives a fresh clustering which does not ``rip apart'' the previous one. The same holds for single linkage clustering in the metric space context, but not for typical optimization-based methods. Orientation sheaves play in a fundamental role and ensure that partially overlapping data sets can be ``glued'' together. Open combinatorial problems are presented in the last section.

39 pages, 13 figures

https://doi.org/10.46298/compositionality-8-2

Source: arXiv.org:2302.02200

Volume: Volume 8 (2026)

Published on: March 20, 2026

Imported on: April 26, 2023

Keywords: Combinatorics, Statistics Theory, 62H30 (Primary) 05C20, 05E45, 05C76 (Secondary), G.4

Licence: arXiv.org - Non-exclusive license to distribute

Classifications

Mathematics Subject Classification 2020¹

Sources:

[1] zbMATH Open.

Bibliographic References

Share and export

Consultation statistics

This page has been seen 181 times.

This article's PDF has been downloaded 100 times.