Score: 0

Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model

Published: November 25, 2025 | arXiv ID: 2511.20798v1

By: Rio Alexa Fear , Payel Mukhopadhyay , Michael McCabe and more

Potential Business Impact:

Controls physics simulations by changing AI's thoughts.

Business Areas:

Embedded Systems Hardware, Science and Engineering, Software

Recent advances in mechanistic interpretability have revealed that large language models (LLMs) develop internal representations corresponding not only to concrete entities but also distinct, human-understandable abstract concepts and behaviour. Moreover, these hidden features can be directly manipulated to steer model behaviour. However, it remains an open question whether this phenomenon is unique to models trained on inherently structured data (ie. language, images) or if it is a general property of foundation models. In this work, we investigate the internal representations of a large physics-focused foundation model. Inspired by recent work identifying single directions in activation space for complex behaviours in LLMs, we extract activation vectors from the model during forward passes over simulation datasets for different physical regimes. We then compute "delta" representations between the two regimes. These delta tensors act as concept directions in activation space, encoding specific physical features. By injecting these concept directions back into the model during inference, we can steer its predictions, demonstrating causal control over physical behaviours, such as inducing or removing some particular physical feature from a simulation. These results suggest that scientific foundation models learn generalised representations of physical principles. They do not merely rely on superficial correlations and patterns in the simulations. Our findings open new avenues for understanding and controlling scientific foundation models and has implications for AI-enabled scientific discovery.

Uncovering Emergent Physics Representations Learned In-Context by Large Language Models

Computation and Language

Computers learn physics concepts from examples.

17 Aug 2025 0

88%

Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models

Machine Learning (CS)

Tests if computers understand how things move.

10 Sep 2025 1

88%

Towards a Physics Foundation Model

Machine Learning (CS)

Simulates many physics problems with one program.

17 Sep 2025 2

View PDF Login to Bookmark

Country of Origin

🇬🇧 United Kingdom

Page Count

16 pages

Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model

Controls physics simulations by changing AI's thoughts.

Technical Abstract

Uncovering Emergent Physics Representations Learned In-Context by Large Language Models

Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models

Towards a Physics Foundation Model