A Granular Grassmannian Clustering Framework via the Schubert Variety of Best Fit
By: Karim Salta, Michael Kirby, Chris Peterson
In many classification and clustering tasks, it is useful to compute a geometric representative for a dataset or a cluster, such as a mean or median. When datasets are represented by subspaces, these representatives become points on the Grassmann or flag manifold, with distances induced by their geometry, often via principal angles. We introduce a subspace clustering algorithm that replaces subspace means with a trainable prototype defined as a Schubert Variety of Best Fit (SVBF) - a subspace that comes as close as possible to intersecting each cluster member in at least one fixed direction. Integrated in the Linde-Buzo-Grey (LBG) pipeline, this SVBF-LBG scheme yields improved cluster purity on synthetic, image, spectral, and video action data, while retaining the mathematical structure required for downstream analysis.
Similar Papers
GradientSpace: Unsupervised Data Clustering for Improved Instruction Tuning
Machine Learning (CS)
Teaches AI to learn many skills better.
Hyperbolic Gaussian Blurring Mean Shift: A Statistical Mode-Seeking Framework for Clustering in Curved Spaces
Machine Learning (CS)
Finds hidden groups in complex, branching data.
Barycentric subspace analysis of network-valued data
Differential Geometry
Helps understand complex networks by finding patterns.