Symmetry and Generalisation in Neural Approximations of Renormalisation Transformations
By: Cassidy Ashworth, Pietro Liò, Francesco Caso
Potential Business Impact:
Makes computers learn patterns in physics better.
Deep learning models have proven enormously successful at using multiple layers of representation to learn relevant features of structured data. Encoding physical symmetries into these models can improve performance on difficult tasks, and recent work has motivated the principle of parameter symmetry breaking and restoration as a unifying mechanism underlying their hierarchical learning dynamics. We evaluate the role of parameter symmetry and network expressivity in the generalisation behaviour of neural networks when learning a real-space renormalisation group (RG) transformation, using the central limit theorem (CLT) as a test case map. We consider simple multilayer perceptrons (MLPs) and graph neural networks (GNNs), and vary weight symmetries and activation functions across architectures. Our results reveal a competition between symmetry constraints and expressivity, with overly complex or overconstrained models generalising poorly. We analytically demonstrate this poor generalisation behaviour for certain constrained MLP architectures by recasting the CLT as a cumulant recursion relation and making use of an established framework to propagate cumulants through MLPs. We also empirically validate an extension of this framework from MLPs to GNNs, elucidating the internal information processing performed by these more complex models. These findings offer new insight into the learning dynamics of symmetric networks and their limitations in modelling structured physical transformations.
Similar Papers
Symmetry-Aware Graph Metanetwork Autoencoders: Model Merging through Parameter Canonicalization
Machine Learning (CS)
Makes different AI models work together easily.
Symmetry in Neural Network Parameter Spaces
Machine Learning (CS)
Finds hidden patterns in computer brains.
A Tale of Two Symmetries: Exploring the Loss Landscape of Equivariant Models
Machine Learning (CS)
Makes smart computers learn better by fixing their rules.