Score: 3

Accelerating Machine Learning Systems via Category Theory: Applications to Spherical Attention for Gene Regulatory Networks

Published: May 14, 2025 | arXiv ID: 2505.09326v1

By: Vincent Abbott , Kotaro Kamiya , Gerard Glowacki and more

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

AI learns to build itself faster and better.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

How do we enable artificial intelligence models to improve themselves? This is central to exponentially improving generalized artificial intelligence models, which can improve their own architecture to handle new problem domains in an efficient manner that leverages the latest hardware. However, current automated compilation methods are poor, and efficient algorithms require years of human development. In this paper, we use neural circuit diagrams, based in category theory, to prove a general theorem related to deep learning algorithms, guide the development of a novel attention algorithm catered to the domain of gene regulatory networks, and produce a corresponding efficient kernel. The algorithm we propose, spherical attention, shows that neural circuit diagrams enable a principled and systematic method for reasoning about deep learning architectures and providing high-performance code. By replacing SoftMax with an $L^2$ norm as suggested by diagrams, it overcomes the special function unit bottleneck of standard attention while retaining the streaming property essential to high-performance. Our diagrammatically derived \textit{FlashSign} kernel achieves comparable performance to the state-of-the-art, fine-tuned FlashAttention algorithm on an A100, and $3.6\times$ the performance of PyTorch. Overall, this investigation shows neural circuit diagrams' suitability as a high-level framework for the automated development of efficient, novel artificial intelligence architectures.

Country of Origin
πŸ‡―πŸ‡΅ πŸ‡ΊπŸ‡Έ United States, Japan

Page Count
14 pages

Category
Mathematics:
Category Theory