Higher-order, generically complete, continuous, and polynomial-time isometry invariants of periodic sets
By: Daniel E Widdowson, Vitaliy A Kurlin
Potential Business Impact:
Finds fake crystals in science databases.
Periodic point sets model all solid crystalline materials (crystals) whose atoms can be considered zero-sized points with or without atomic types. This paper addresses the fundamental problem of checking whether claimed crystals are novel, not noisy perturbations of known materials obtained by unrealistic atomic replacements. Such near-duplicates have already skewed ground truth because past comparisons relied on discontinuous cells and symmetries. The proposed Lipschitz continuity under noise is a new essential requirement for machine learning on any data objects that have ambiguous representations and live in continuous spaces. For periodic point sets under isometry (any distance-preserving transformation), we designed invariants that distinguish all known counter-examples to the completeness of past descriptors and confirm thousands of (near-)duplicates in the world's largest databases of inorganic crystals within hours on a desktop computer.
Similar Papers
Higher-order, generically complete, continuous, and polynomial-time isometry invariants of periodic sets
Computational Geometry
Finds new crystals by spotting fakes.
Geometric Data Science
Metric Geometry
Organizes all possible crystal shapes perfectly.
Continuous Uniqueness and Novelty Metrics for Generative Modeling of Inorganic Crystals
Machine Learning (CS)
Finds new materials faster and better.