"Energon": Unveiling Transformers from GPU Power and Thermal Side-Channels
By: Arunava Chaudhuri , Shubhi Shukla , Sarani Bhattacharya and more
Potential Business Impact:
Steals secrets from AI by watching computer heat.
Transformers have become the backbone of many Machine Learning (ML) applications, including language translation, summarization, and computer vision. As these models are increasingly deployed in shared Graphics Processing Unit (GPU) environments via Machine Learning as a Service (MLaaS), concerns around their security grow. In particular, the risk of side-channel attacks that reveal architectural details without physical access remains underexplored, despite the high value of the proprietary models they target. This work to the best of our knowledge is the first to investigate GPU power and thermal fluctuations as side-channels and further exploit them to extract information from pre-trained transformer models. The proposed analysis shows how these side channels can be exploited at user-privilege to reveal critical architectural details such as encoder/decoder layer and attention head for both language and vision transformers. We demonstrate the practical impact by evaluating multiple language and vision pre-trained transformers which are publicly available. Through extensive experimental evaluations, we demonstrate that the attack model achieves a high accuracy of over 89% on average for model family identification and 100% for hyperparameter classification, in both single-process as well as noisy multi-process scenarios. Moreover, by leveraging the extracted architectural information, we demonstrate highly effective black-box transfer adversarial attacks with an average success rate exceeding 93%, underscoring the security risks posed by GPU side-channel leakage in deployed transformer models.
Similar Papers
MoEcho: Exploiting Side-Channel Attacks to Compromise User Privacy in Mixture-of-Experts LLMs
Cryptography and Security
AI can spy on your private data.
Thermal-Aware 3D Design for Side-Channel Information Leakage
Cryptography and Security
Hides secret computer information from spies.
TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs
High Energy Physics - Experiment
Makes AI models run faster on special chips.