Higher-order, generically complete, continuous, and polynomial-time isometry invariants of periodic sets
By: Daniel E Widdowson, Vitaliy A Kurlin
Potential Business Impact:
Finds new crystals by spotting fakes.
Periodic point sets model all solid crystalline materials (crystals) whose atoms can be considered zero-sized points with or without atomic types. This paper addresses the fundamental problem of checking whether claimed crystals are novel, not noisy perturbations of known materials obtained by unrealistic atomic replacements. Such near-duplicates have skewed ground-truth because past comparisons relied on unstable cells and symmetries. The proposed Lipschitz continuity under noise is a new essential requirement for machine learning on any data objects that have ambiguous representations and live in continuous spaces. For periodic point sets under isometry (any distance-preserving transformation), we designed invariants that distinguish all known counter-examples to the completeness of past descriptors and detect thousands of (near-)duplicates in large high-profile databases of crystals within two days on a modest desktop computer.
Similar Papers
Higher-order, generically complete, continuous, and polynomial-time isometry invariants of periodic sets
Computational Geometry
Finds fake crystals in science databases.
Geometric Data Science
Metric Geometry
Organizes all possible crystal shapes perfectly.
Continuous Uniqueness and Novelty Metrics for Generative Modeling of Inorganic Crystals
Machine Learning (CS)
Finds new materials faster and better.