Adaptive AI Agent Placement and Migration in Edge Intelligence Systems
By: Xingdan Wang , Jiayi He , Zhiqing Tang and more
Potential Business Impact:
Lets AI agents work faster on phones.
The rise of LLMs such as ChatGPT and Claude fuels the need for AI agents capable of real-time task handling. However, migrating data-intensive, multi-modal edge workloads to cloud data centers, traditionally used for agent deployment, introduces significant latency. Deploying AI agents at the edge improves efficiency and reduces latency. However, edge environments present challenges due to limited and heterogeneous resources. Maintaining QoS for mobile users necessitates agent migration, which is complicated by the complexity of AI agents coordinating LLMs, task planning, memory, and external tools. This paper presents the first systematic deployment and management solution for LLM-based AI agents in dynamic edge environments. We propose a novel adaptive framework for AI agent placement and migration in edge intelligence systems. Our approach models resource constraints and latency/cost, leveraging ant colony algorithms and LLM-based optimization for efficient decision-making. It autonomously places agents to optimize resource utilization and QoS and enables lightweight agent migration by transferring only essential state. Implemented on a distributed system using AgentScope and validated across globally distributed edge servers, our solution significantly reduces deployment latency and migration costs.
Similar Papers
Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions
Artificial Intelligence
Makes smart AI work on phones without internet.
Edge Large AI Models: Collaborative Deployment and IoT Applications
Information Theory
Smart devices work together for faster AI.
Edge Large AI Models: Revolutionizing 6G Networks
Networking and Internet Architecture
Smart phones will do many complex tasks.