Score: 1

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation

Published: November 11, 2025 | arXiv ID: 2511.07876v1

By: Xingyu Li , Xiaolei Liu , Cheng Liu and more

Potential Business Impact:

Makes AI models get stuck and waste power.

Business Areas:

Power Grid Energy

As large language models (LLMs) scale, their inference incurs substantial computational resources, exposing them to energy-latency attacks, where crafted prompts induce high energy and latency cost. Existing attack methods aim to prolong output by delaying the generation of termination symbols. However, as the output grows longer, controlling the termination symbols through input becomes difficult, making these methods less effective. Therefore, we propose LoopLLM, an energy-latency attack framework based on the observation that repetitive generation can trigger low-entropy decoding loops, reliably compelling LLMs to generate until their output limits. LoopLLM introduces (1) a repetition-inducing prompt optimization that exploits autoregressive vulnerabilities to induce repetitive generation, and (2) a token-aligned ensemble optimization that aggregates gradients to improve cross-model transferability. Extensive experiments on 12 open-source and 2 commercial LLMs show that LoopLLM significantly outperforms existing methods, achieving over 90% of the maximum output length, compared to 20% for baselines, and improving transferability by around 40% to DeepSeek-V3 and Gemini 2.5 Flash.

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Computation and Language

Makes AI models get stuck and repeat words.

17 Jun 2025 0

91%

Breaking the Loop: Detecting and Mitigating Denial-of-Service Vulnerabilities in Large Language Models

Cryptography and Security

Stops AI from repeating itself, making it faster.

1 Mar 2025 1

88%

Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting

Cryptography and Security

Protects power grids from fake data.

13 Dec 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

14 pages

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation

Makes AI models get stuck and waste power.

Technical Abstract

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Breaking the Loop: Detecting and Mitigating Denial-of-Service Vulnerabilities in Large Language Models

Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting