With the rapid proliferation of large language models and vision-language models, AI agents have evolved from isolated, task-specific systems into autonomous, interactive entities capable of perceiving, reasoning, and acting without human intervention. As these agents proliferate across virtual and physical environments, from virtual assistants to embodied robots, the need for a unified, agent-centric infrastructure becomes paramount. In this survey, we introduce the Internet of Agents (IoA) as a foundational framework that enables seamless interconnection, dynamic discovery, and collaborative orchestration among heterogeneous agents at scale. We begin by presenting a general IoA architecture, highlighting its hierarchical organization, distinguishing features relative to the traditional Internet, and emerging applications. Next, we analyze the key operational enablers of IoA, including capability notification and discovery, adaptive communication protocols, dynamic task matching, consensus and conflict-resolution mechanisms, and incentive models. Finally, we identify open research directions toward building resilient and trustworthy IoA ecosystems.
Internet of Agents: Fundamentals, Applications, and Challenges
- Paper ID: 2505.07176
- Title: Internet of Agents: Fundamentals, Applications, and Challenges
- Authors: Yuntao Wang, Shaolong Guo, Yanghe Pan, Zhou Su, Fahao Chen, Tom H. Luan, Peng Li, Jiawen Kang, Dusit Niyato
- Classification: cs.MA (Multi-Agent Systems), cs.AI (Artificial Intelligence)
- Publication Date: January 2025 (Accepted by IEEE TCCN in Oct. 2025)
- Paper Link: https://arxiv.org/abs/2505.07176
With the rapid proliferation of large language models and vision-language models, AI agents have evolved from isolated, task-specific systems into autonomous interactive entities capable of perception, reasoning, and action without human intervention. As these agents are deployed extensively in virtual and physical environments, from virtual assistants to embodied robots, establishing a unified, agent-centric infrastructure has become critical. This survey introduces the Internet of Agents (IoA) as a foundational framework enabling seamless interconnection, dynamic discovery, and collaborative orchestration among large-scale heterogeneous agents. The paper first proposes a universal IoA architecture, highlighting its hierarchical organization, distinguishing characteristics relative to traditional Internet, and emerging applications. It then analyzes key operational enabling technologies for IoA, including capability notification and discovery, adaptive communication protocols, dynamic task matching, consensus and conflict resolution mechanisms, and incentive models. Finally, it identifies open research directions for building resilient and trustworthy IoA ecosystems.
This research addresses the fundamental challenge of constructing a unified infrastructure to support interconnection, collaboration, and orchestration of large-scale heterogeneous AI agents. Specifically, it encompasses:
- Interconnectivity Challenges: Existing multi-agent systems primarily operate through simulation on single devices, whereas practical IoA deployment requires spanning billions of geographically distributed agents, each with unique computational, network, sensing, and energy profiles.
- Agent-Native Interfaces: Current computer-using agents (such as OpenAI's Operator) rely on mimicking human GUI operations to control browsers and applications, incurring substantial screen-scraping overhead.
- Autonomous Collaboration: IoA comprises physical and virtual agents operating in highly dynamic environments, requiring agents to self-organize and self-negotiate, forming low-cost and efficient collaborative networks.
With rapid advancement in AI agents, Gartner predicts that by 2028, at least 15% of daily tasks will be autonomously executed by AI agents, and 33% of enterprise applications will integrate agent-driven intelligence. Agents are becoming "new citizens" of digital and physical spaces, reshaping economic structures and human social interactions.
Existing research primarily focuses on multi-agent systems (MAS), facing three major challenges:
- Ecosystem Isolation: Existing frameworks confine agents to their own environments, limiting integration of third-party agents
- Single-Device Simulation: Most MAS systems are limited to single-device simulation, contrasting sharply with real-world scenarios where agents operate across multiple devices and geographic locations
- Rigid Communication and Coordination: Existing agent protocols and state transitions are largely hard-coded, failing to capture the dynamic, task-specific nature of actual collaboration
- Universal IoA Architecture: Provides a comprehensive overview of IoA architecture, outlining its hierarchical framework, distinguishing characteristics from traditional Internet, key features, and emerging applications
- Key Enabling Technologies for IoA: Analyzes critical enabling technologies supporting IoA, including capability notification and discovery, dynamic task orchestration and matching, adaptive communication protocols, consensus and conflict resolution mechanisms, economic models, and regulatory models
- Identification of Open Challenges: Identifies unresolved problems in the IoA domain, pointing toward future research directions that promote widespread adoption and sustainable development
IoA employs a four-layer architectural design:
- Infrastructure Layer: Integrates critical resources including AI models, data, knowledge, computing, and communication. Encompasses foundation models (such as GPT-4o, DeepSeek-R1), multimodal data infrastructure, and communication technologies (5G URLLC, mesh networks, etc.)
- Agent Management Layer: Manages agent identity, capability, discovery, and lifecycle within IoA. Supports decentralized identifiers (DID), capability semantic modeling, and publish-subscribe mechanisms
- Agent Coordination Layer: Manages distributed task execution and agent collaboration. Supports task decomposition, process orchestration, adaptive communication protocols, and dynamic agent-task matching
- Application Layer: Provides standardized interfaces and semantic alignment, supporting cross-modal, semantic, and knowledge-level integration
Each IoA agent comprises four core functional modules:
- Planning Module: Large model-based reasoning, task decomposition, and decision-making, incorporating techniques such as Chain-of-Thought (CoT) and Tree-of-Thought (ToT)
- Memory Module: Maintains short-term context and long-term knowledge, supporting continuous learning and personalization
- Interaction Module: Implements communication and collaboration across human, agent, and environmental interfaces
- Action Module: Achieves plan execution through physical actuators and software tool interfaces
- Capability Assessment: Combines self-reported declarations with system verification to construct agent capability profiles
- Capability Notification: Employs proactive notification, event-triggered, and periodic synchronization strategies
- Capability Retrieval: Evolves from traditional search to semantic retrieval, knowledge-enhanced retrieval, and agent-augmented retrieval
- Interaction Patterns: Multiple collaborative modes including aggregation, reflection, debate, and tool use
- Communication Topology: Network structures including chain, star, tree, and graph configurations
- Task Decomposition: Rule-based and learning-based decomposition methods
- Task Assignment: Routing-based and self-organizing assignment strategies
Compares five representative protocols:
- MCP (Model Context Protocol): Client-server architecture proposed by Anthropic
- A2A (Agent-to-Agent): P2P interaction protocol proposed by Google
- ANP (Agent Network Protocol): Fully decentralized P2P architecture
- AGNTCY: Hybrid architecture promoting standardized collaboration
- Agora: Research project balancing efficiency, generality, and portability
The paper analyzes several representative IoA implementations:
- Chen et al.'s IoA Prototype: Adopts hierarchical client-server architecture supporting heterogeneous virtual agent collaboration
- DAWN System: Hierarchical architecture supporting globally distributed IoA collaboration
- Siemens Industrial Co-Pilot: Autonomous agents applied across entire production lifecycle
- Midea Intelligent Agent Factory: Deploys 14 agents across 38 production scenarios
The paper details five representative application scenarios:
- Smart Homes: P2P communication within home agent subnets
- Healthcare: Robot coordination within medical facilities
- Intelligent Factories: Coordination between on-site and external agents
- Smart Grids: Energy management through IoA subnets and external agents
- Smart Cities: Cross-domain agent networks
The paper identifies six key characteristics of IoA:
- Autonomous Intelligence: Agents possess high autonomy, proactively publishing capabilities and independently initiating collaboration
- High Dynamicity: Supports on-demand agent creation, migration, and termination, as well as real-time workflow reconfiguration
- High Heterogeneity: Spans a broad spectrum from microcontroller nodes to GPU cluster-driven agents
- Large-Scale Scalability: Supports scaling from small temporary agent teams to billions of cross-domain agents
- Semantic-Aware Communication: Context-aware semantic understanding and reasoning capabilities based on large models
- Task-Driven Cooperation: Prioritizes task-oriented networks where agents dynamically adjust capabilities to match task requirements
The paper systematically compares IoA with traditional Internet and IoT:
| Dimension | Traditional Internet | IoT | IoA |
|---|
| Core Objective | Host and information connectivity | Device and information connectivity | Agent and knowledge connectivity |
| Architecture | Centralized (client-server) | Decentralized (edge-edge-cloud) | Hybrid (P2P + broker) |
| Interaction Mode | Passive (request-response) | Event-driven | Proactive (goal-oriented) |
| Communication Level | Bit-level transmission | Bit-level + lightweight protocols | Semantic-level exchange |
| Autonomy Source | Human control | Rule-based device logic | Large model-driven agent intelligence |
- IoA Standardization and Interoperability Framework: Develop unified standardization frameworks ensuring cross-platform and cross-domain interoperability
- Secure Adaptive Agent Communication Protocols: Address the trilemma of generality-efficiency-portability while embedding rigorous security guarantees
- Decentralized Autonomous Agent Ecosystems: Develop autonomous mechanisms requiring no central authority, analogous to biological ecosystems
- Agent-Based Economic Systems: Integrate adaptive incentive mechanisms combining short-term capacity-aware economic rewards with long-term reputation tracking
- Privacy-Preserving Agent Interaction: Integrate context-aware and task-driven privacy-by-design, allowing agents to dynamically negotiate privacy protection levels based on task criticality
- Cyber-Physical Security IoA: Integrate cyber-physical defenses, such as correlating network traffic with phasor measurement unit readings to detect false data injection in power grid agents
- Ethical Interoperable IoA: Ensure ethical compliance requiring transparent moral reasoning and adaptive semantic alignment
IoA represents the next generation of infrastructure for autonomous interconnected intelligent systems, providing a comprehensive framework for large-scale collaboration among heterogeneous agents. Through hierarchical architecture, dynamic discovery mechanisms, adaptive communication protocols, and economic incentive models, IoA can support diverse application scenarios ranging from smart homes to smart cities.
- Scalability Challenges: Real-time coordination and consensus mechanisms in large-scale IoA systems still face latency and throughput limitations
- Security and Privacy Issues: Cross-domain agent collaboration introduces new attack surfaces and privacy leakage risks
- Insufficient Standardization: Lack of unified agent description language and lightweight communication protocols
- Economic Model Complexity: Developing interoperable and fair economic frameworks across heterogeneous IoA platforms remains challenging
The paper indicates that IoA maturation requires continued innovation in network architecture, interoperability standards, and security paradigms, particularly in addressing large-scale deployment, cross-domain collaboration, and ethical compliance.
- Comprehensiveness: Provides the first systematic survey of the IoA domain, covering architecture design, key technologies, application scenarios, and future directions
- Forward-Looking: Accurately captures technological trends in the evolution from isolated agents to interconnected agent ecosystems
- Practicality: Combines academic research with industrial practice, providing concrete prototype cases and application scenarios
- Clear Structure: Hierarchical analytical framework facilitates understanding of complex IoA ecosystems
- Lack of Quantitative Analysis: As a survey paper, it lacks specific performance evaluations and experimental validation
- Limited Technical Details: Descriptions of certain key technologies (such as consensus mechanisms, privacy protection) are relatively high-level
- Insufficient Challenge Analysis: Analysis of technical implementation difficulties and solutions could be more concrete
- Domain Contribution: Provides important theoretical framework and research foundation for the emerging IoA field
- Practical Value: Offers architectural guidance and technical roadmap for industry development of IoA systems
- Research Direction: The identified seven future research directions provide clear research agenda for academia
This framework applies to scenarios requiring large-scale agent collaboration, including smart cities, Industry 4.0, autonomous vehicle fleets, distributed AI inference, and other domains. It is particularly suitable for complex application scenarios requiring cross-organizational and cross-domain collaboration.
The paper cites 149 related references, covering important works from multiple relevant fields including multi-agent systems, large language models, communication protocols, and blockchain technology, providing solid theoretical foundation for IoA research.
Summary: The IoA concept and framework proposed in this paper possess significant theoretical and practical value, providing comprehensive guidance for constructing next-generation agent interconnection infrastructure. Although as a survey paper it has limitations in technical details and experimental validation, its forward-looking perspective and systematic approach make it an important reference in the field.