Offset Filtration

From HandWiki

The offset filtration (also called the "union-of-balls"[1] or "union-of-disks"[2] filtration) is a growing sequence of metric balls used to detect the size and scale of topological features of a data set. The offset filtration commonly arises in persistent homology and the field of topological data analysis. Utilizing a union of balls to approximate the shape of geometric objects was first suggested by Frosini in 1992 in the context of submanifolds of Euclidean space.[3] The construction was independently explored by Robins in 1998, and expanded to considering the collection of offsets indexed over a series of increasing scale parameters (i.e., a growing sequence of balls), in order to observe the stability of topological features with respect to attractors.[4] Homological persistence as introduced in these papers by Frosini and Robins was subsequently formalized by Edelsbrunner et al. in their seminal 2002 paper Topological Persistence and Simplification.[5] Since then, the offset filtration has become a primary example in the study of computational topology and data analysis.

Definition

Let [math]\displaystyle{ X }[/math] be a finite set in a metric space [math]\displaystyle{ (M,d) }[/math], and for any [math]\displaystyle{ x\in X }[/math] let [math]\displaystyle{ B(x,\varepsilon) = \{y\in X \mid d(x,y) \leq \varepsilon \} }[/math] be the closed ball of radius [math]\displaystyle{ \varepsilon }[/math] centered at [math]\displaystyle{ x }[/math]. Then the union [math]\displaystyle{ X^{(\varepsilon)}:=\bigcup_{x\in X} B(x,\varepsilon) }[/math] is known as the offset of [math]\displaystyle{ X }[/math] with respect to the parameter [math]\displaystyle{ \varepsilon }[/math] (or simply the [math]\displaystyle{ \varepsilon }[/math]-offset of [math]\displaystyle{ X }[/math]).

The offset filtration at six scale parameters on a point cloud sampled from two circles of different sizes.

By considering the collection of offsets over all [math]\displaystyle{ \varepsilon \in [0,\infty) }[/math] we get a family of spaces [math]\displaystyle{ \mathcal O(X) := \{ X^{(\varepsilon)} \mid \varepsilon \in [0,\infty)\} }[/math] where [math]\displaystyle{ X^{(\varepsilon)}\subseteq X^{(\varepsilon^\prime)} }[/math] whenever [math]\displaystyle{ \varepsilon \leq \varepsilon^\prime }[/math]. So [math]\displaystyle{ \mathcal O(X) }[/math] is a family of nested topological spaces indexed over [math]\displaystyle{ \varepsilon }[/math], which defines a filtration known as the offset filtration on [math]\displaystyle{ X }[/math].[6]

Note that it is also possible to view the offset filtration as a functor [math]\displaystyle{ \mathcal O(X) : [0, \infty) \to \mathbf{Top} }[/math] from the poset category of non-negative real numbers to the category of topological spaces and continuous maps.[7][8] There are some advantages to the categorical viewpoint, as explored by Bubenik and others.[9]

Properties

A standard application of the nerve theorem shows that the union of balls has the same homotopy type as its nerve, since closed balls are convex and the intersection of convex sets is convex.[10] The nerve of the union of balls is also known as the Čech complex,[11] which is a subcomplex of the Vietoris-Rips complex.[12] Therefore the offset filtration is weakly equivalent to the Čech filtration (defined as the nerve of each offset across all scale parameters), so their homology groups are isomorphic.[13]

Although the Vietoris-Rips filtration is not identical to the Čech filtration in general, it is an approximation in a sense. In particular, for a set [math]\displaystyle{ X \subset \mathbb R^d }[/math] we have a chain of inclusions [math]\displaystyle{ \operatorname{Rips}_\varepsilon(X) \subset \operatorname{Cech}_{\varepsilon^\prime}(X) \subset \operatorname{Rips}_{\varepsilon^\prime}(X) }[/math] between the Rips and Čech complexes on [math]\displaystyle{ X }[/math] whenever [math]\displaystyle{ \varepsilon^\prime / \varepsilon \geq \sqrt{2d/d+1} }[/math].[14] In general metric spaces, we have that [math]\displaystyle{ \operatorname{Cech}_\varepsilon(X) \subset \operatorname{Rips}_{2\varepsilon}(X) \subset \operatorname{Cech}_{2\varepsilon}(X) }[/math] for all [math]\displaystyle{ \varepsilon \gt 0 }[/math], implying that the Rips and Cech filtrations are 2-interleaved with respect to the interleaving distance as introduced by Chazal et al. in 2009.[15][16]

It is a well-known result of Niyogi, Smale, and Weinberger that given a sufficiently dense random point cloud sample of a smooth submanifold in Euclidean space, the union of balls of a certain radius recovers the homology of the object via a deformation retraction of the Čech complex.[17]

The offset filtration is also known to be stable with respect to perturbations of the underlying data set. This follows from the fact that the offset filtration can be viewed as a sublevel-set filtration with respect to the distance function of the metric space. The stability of sublevel-set filtrations can be stated as follows: Given any two real-valued functions [math]\displaystyle{ \gamma, \kappa }[/math] on a topological space [math]\displaystyle{ T }[/math] such that for all [math]\displaystyle{ i\geq 0 }[/math], the [math]\displaystyle{ i\text{th} }[/math]-dimensional homology modules on the sublevel-set filtrations with respect to [math]\displaystyle{ \gamma, \kappa }[/math] are point-wise finite dimensional, we have [math]\displaystyle{ d_B (\mathcal B_i (\gamma), \mathcal B_i (\kappa)) \leq d_\infty (\gamma, \kappa) }[/math] where [math]\displaystyle{ d_B(-) }[/math] and [math]\displaystyle{ d_\infty(-) }[/math] denote the bottleneck and sup-norm distances, respectively, and [math]\displaystyle{ \mathcal B_i (-) }[/math] denotes the [math]\displaystyle{ i\text{th} }[/math]-dimensional persistent homology barcode.[18] While first stated in 2005, this sublevel stability result also follows directly from an algebraic stability property sometimes known as the "Isometry Theorem,"[9] which was proved in one direction in 2009,[16] and the other direction in 2011.[19][20]

A multiparameter extension of the offset filtration defined by considering points covered by multiple balls is given by the multicover bifiltration, and has also been an object of interest in persistent homology and computational geometry.[21][22]

References

  1. Adams, Henry; Moy, Michael (2021). "Topology Applied to Machine Learning: From Global to Local". Frontiers in Artificial Intelligence 4: 2. doi:10.3389/frai.2021.668302. ISSN 2624-8212. PMID 34056580. 
  2. Edelsbrunner, Herbert (2014). A short course in computational geometry and topology. Cham. pp. 35. ISBN 978-3-319-05957-0. OCLC 879343648. https://www.worldcat.org/oclc/879343648. 
  3. Frosini, Patrizio (1992-02-01). Casasent, David P.. ed. "Measuring shapes by size functions". Intelligent Robots and Computer Vision X: Algorithms and Techniques (Boston, MA) 1607: 122–133. doi:10.1117/12.57059. Bibcode1992SPIE.1607..122F. http://proceedings.spiedigitallibrary.org/proceeding.aspx?articleid=980889. 
  4. Robins, Vanessa (1999-01-01). "Towards computing homology from approximations". Topology Proceedings 24: 503–532. http://topology.nipissingu.ca/tp/reprints/v24/tp24222.pdf. 
  5. Edelsbrunner; Letscher; Zomorodian (2002). "Topological Persistence and Simplification" (in en). Discrete & Computational Geometry 28 (4): 511–533. doi:10.1007/s00454-002-2885-2. ISSN 0179-5376. http://link.springer.com/10.1007/s00454-002-2885-2. 
  6. Halperin, Dan; Kerber, Michael; Shaharabani, Doron (2015), Bansal, Nikhil; Finocchi, Irene, eds., "The Offset Filtration of Convex Objects", Algorithms - ESA 2015 (Berlin, Heidelberg: Springer Berlin Heidelberg) 9294: pp. 705–716, doi:10.1007/978-3-662-48350-3_59, ISBN 978-3-662-48349-7, http://link.springer.com/10.1007/978-3-662-48350-3_59, retrieved 2023-02-25 
  7. Bauer, Ulrich; Kerber, Michael; Roll, Fabian; Rolle, Alexander (2023-02-16). "A Unified View on the Functorial Nerve Theorem and its Variations". p. 8. arXiv:2203.03571 [math.AT].
  8. Blumberg, Andrew J.; Lesnick, Michael (2022-10-17). "Stability of 2-Parameter Persistent Homology" (in en). Foundations of Computational Mathematics. doi:10.1007/s10208-022-09576-6. ISSN 1615-3375. https://link.springer.com/10.1007/s10208-022-09576-6. 
  9. 9.0 9.1 Bubenik, Peter; Scott, Jonathan A. (2014). "Categorification of Persistent Homology" (in en). Discrete & Computational Geometry 51 (3): 600–627. doi:10.1007/s00454-014-9573-x. ISSN 0179-5376. http://link.springer.com/10.1007/s00454-014-9573-x. 
  10. Edelsbrunner, Herbert (1993). "The union of balls and its dual shape" (in en). Proceedings of the Ninth Annual Symposium on Computational Geometry - SCG '93 (San Diego, California, United States: ACM Press): 218–231. doi:10.1145/160985.161139. ISBN 978-0-89791-582-3. http://portal.acm.org/citation.cfm?doid=160985.161139. 
  11. Kim, Jisu; Shin, Jaehyeok; Chazal, Frédéric; Rinaldo, Alessandro; Wasserman, Larry (2020-05-12). "Homotopy Reconstruction via the Cech Complex and the Vietoris-Rips Complex". arXiv:1903.06955 [math.AT].
  12. Edelsbrunner, Herbert (2010). Computational topology : an introduction. J. Harer. Providence, R.I.: American Mathematical Society. pp. 61. ISBN 978-0-8218-4925-5. OCLC 427757156. https://www.worldcat.org/oclc/427757156. 
  13. Chazal, Frédéric; Michel, Bertrand (2021). "An Introduction to Topological Data Analysis: Fundamental and Practical Aspects for Data Scientists". Frontiers in Artificial Intelligence 4: 667963. doi:10.3389/frai.2021.667963. ISSN 2624-8212. PMID 34661095. 
  14. de Silva, Vin; Ghrist, Robert (2007-04-25). "Coverage in sensor networks via persistent homology" (in en). Algebraic & Geometric Topology 7 (1): 339–358. doi:10.2140/agt.2007.7.339. ISSN 1472-2739. http://www.msp.org/agt/2007/7-1/p16.xhtml. 
  15. Anai, Hirokazu; Chazal, Frédéric; Glisse, Marc; Ike, Yuichi; Inakoshi, Hiroya; Tinarrage, Raphaël; Umeda, Yuhei (2020-05-26). "DTM-based Filtrations". arXiv:1811.04757 [cs.CG].
  16. 16.0 16.1 Chazal, Frédéric; Cohen-Steiner, David; Glisse, Marc; Guibas, Leonidas J.; Oudot, Steve Y. (2009-06-08). "Proximity of persistence modules and their diagrams" (in en). Proceedings of the Twenty-fifth Annual Symposium on Computational Geometry (Aarhus Denmark: ACM): 237–246. doi:10.1145/1542362.1542407. ISBN 978-1-60558-501-7. https://dl.acm.org/doi/10.1145/1542362.1542407. 
  17. Niyogi, Partha; Smale, Stephen; Weinberger, Shmuel (2008). "Finding the Homology of Submanifolds with High Confidence from Random Samples" (in en). Discrete & Computational Geometry 39 (1–3): 419–441. doi:10.1007/s00454-008-9053-2. ISSN 0179-5376. http://link.springer.com/10.1007/s00454-008-9053-2. 
  18. Cohen-Steiner, David; Edelsbrunner, Herbert; Harer, John (2007). "Stability of Persistence Diagrams" (in en). Discrete & Computational Geometry 37 (1): 103–120. doi:10.1007/s00454-006-1276-5. ISSN 0179-5376. http://link.springer.com/10.1007/s00454-006-1276-5. 
  19. Lesnick, Michael (2015). "The Theory of the Interleaving Distance on Multidimensional Persistence Modules" (in en). Foundations of Computational Mathematics 15 (3): 613–650. doi:10.1007/s10208-015-9255-y. ISSN 1615-3375. http://link.springer.com/10.1007/s10208-015-9255-y. 
  20. Lesnick, Michael (2023). "Lecture notes for AMAT 840: Multiparameter Persistence". University at Albany, SUNY. https://www.albany.edu/~ML644186/840_2022/Math840_Notes_22.pdf. 
  21. Corbet, René; Kerber, Michael; Lesnick, Michael; Osang, Georg (2023-02-20). "Computing the Multicover Bifiltration" (in en). Discrete & Computational Geometry. doi:10.1007/s00454-022-00476-8. ISSN 0179-5376. https://link.springer.com/10.1007/s00454-022-00476-8. 
  22. Edelsbrunner, Herbert; Osang, Georg (2021). "The Multi-Cover Persistence of Euclidean Balls" (in en). Discrete & Computational Geometry 65 (4): 1296–1313. doi:10.1007/s00454-021-00281-9. ISSN 0179-5376. PMID 34720303.