Cardinality augmented loss functions
By: Miguel O'Malley
Potential Business Impact:
Helps computers learn better from uneven data.
Class imbalance is a common and pernicious issue for the training of neural networks. Often, an imbalanced majority class can dominate training to skew classifier performance towards the majority outcome. To address this problem we introduce cardinality augmented loss functions, derived from cardinality-like invariants in modern mathematics literature such as magnitude and the spread. These invariants enrich the concept of cardinality by evaluating the `effective diversity' of a metric space, and as such represent a natural solution to overly homogeneous training data. In this work, we establish a methodology for applying cardinality augmented loss functions in the training of neural networks and report results on both artificially imbalanced datasets as well as a real-world imbalanced material science dataset. We observe significant performance improvement among minority classes, as well as improvement in overall performance metrics.
Similar Papers
Intrinsic Dimensionality as a Model-Free Measure of Class Imbalance
Machine Learning (CS)
Measures data problems better for smarter computers.
Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data
Machine Learning (CS)
Teaches computers to learn from unfair data.
Imbalanced Classification through the Lens of Spurious Correlations
Machine Learning (CS)
Fixes computer learning when data is uneven.