2025-11-25T00:01:17.449679

Internet of Agents: Fundamentals, Applications, and Challenges

Wang, Guo, Pan et al.

With the rapid proliferation of large language models and vision-language models, AI agents have evolved from isolated, task-specific systems into autonomous, interactive entities capable of perceiving, reasoning, and acting without human intervention. As these agents proliferate across virtual and physical environments, from virtual assistants to embodied robots, the need for a unified, agent-centric infrastructure becomes paramount. In this survey, we introduce the Internet of Agents (IoA) as a foundational framework that enables seamless interconnection, dynamic discovery, and collaborative orchestration among heterogeneous agents at scale. We begin by presenting a general IoA architecture, highlighting its hierarchical organization, distinguishing features relative to the traditional Internet, and emerging applications. Next, we analyze the key operational enablers of IoA, including capability notification and discovery, adaptive communication protocols, dynamic task matching, consensus and conflict-resolution mechanisms, and incentive models. Finally, we identify open research directions toward building resilient and trustworthy IoA ecosystems.

academic

Internet of Agents: Fundamentals, Applications, and Challenges

Basic Information

Paper ID: 2505.07176
Title: Internet of Agents: Fundamentals, Applications, and Challenges
Authors: Yuntao Wang, Shaolong Guo, Yanghe Pan, Zhou Su, Fahao Chen, Tom H. Luan, Peng Li, Jiawen Kang, Dusit Niyato
Classification: cs.MA (Multi-Agent Systems), cs.AI (Artificial Intelligence)
Publication Date: January 2025 (Accepted by IEEE TCCN in Oct. 2025)
Paper Link: https://arxiv.org/abs/2505.07176

Abstract

With the rapid proliferation of large language models and vision-language models, AI agents have evolved from isolated, task-specific systems into autonomous interactive entities capable of perception, reasoning, and action without human intervention. As these agents are deployed extensively in virtual and physical environments, from virtual assistants to embodied robots, establishing a unified, agent-centric infrastructure has become critical. This survey introduces the Internet of Agents (IoA) as a foundational framework enabling seamless interconnection, dynamic discovery, and collaborative orchestration among large-scale heterogeneous agents. The paper first proposes a universal IoA architecture, highlighting its hierarchical organization, distinguishing characteristics relative to traditional Internet, and emerging applications. It then analyzes key operational enabling technologies for IoA, including capability notification and discovery, adaptive communication protocols, dynamic task matching, consensus and conflict resolution mechanisms, and incentive models. Finally, it identifies open research directions for building resilient and trustworthy IoA ecosystems.

Research Background and Motivation

Core Problems

This research addresses the fundamental challenge of constructing a unified infrastructure to support interconnection, collaboration, and orchestration of large-scale heterogeneous AI agents. Specifically, it encompasses:

Interconnectivity Challenges: Existing multi-agent systems primarily operate through simulation on single devices, whereas practical IoA deployment requires spanning billions of geographically distributed agents, each with unique computational, network, sensing, and energy profiles.
Agent-Native Interfaces: Current computer-using agents (such as OpenAI's Operator) rely on mimicking human GUI operations to control browsers and applications, incurring substantial screen-scraping overhead.
Autonomous Collaboration: IoA comprises physical and virtual agents operating in highly dynamic environments, requiring agents to self-organize and self-negotiate, forming low-cost and efficient collaborative networks.

Problem Significance

With rapid advancement in AI agents, Gartner predicts that by 2028, at least 15% of daily tasks will be autonomously executed by AI agents, and 33% of enterprise applications will integrate agent-driven intelligence. Agents are becoming "new citizens" of digital and physical spaces, reshaping economic structures and human social interactions.

Limitations of Existing Approaches

Existing research primarily focuses on multi-agent systems (MAS), facing three major challenges:

Ecosystem Isolation: Existing frameworks confine agents to their own environments, limiting integration of third-party agents
Single-Device Simulation: Most MAS systems are limited to single-device simulation, contrasting sharply with real-world scenarios where agents operate across multiple devices and geographic locations
Rigid Communication and Coordination: Existing agent protocols and state transitions are largely hard-coded, failing to capture the dynamic, task-specific nature of actual collaboration

Core Contributions

Universal IoA Architecture: Provides a comprehensive overview of IoA architecture, outlining its hierarchical framework, distinguishing characteristics from traditional Internet, key features, and emerging applications
Key Enabling Technologies for IoA: Analyzes critical enabling technologies supporting IoA, including capability notification and discovery, dynamic task orchestration and matching, adaptive communication protocols, consensus and conflict resolution mechanisms, economic models, and regulatory models
Identification of Open Challenges: Identifies unresolved problems in the IoA domain, pointing toward future research directions that promote widespread adoption and sustainable development

Methodology Details

IoA Architecture Design

Hierarchical Architecture

IoA employs a four-layer architectural design:

Infrastructure Layer: Integrates critical resources including AI models, data, knowledge, computing, and communication. Encompasses foundation models (such as GPT-4o, DeepSeek-R1), multimodal data infrastructure, and communication technologies (5G URLLC, mesh networks, etc.)
Agent Management Layer: Manages agent identity, capability, discovery, and lifecycle within IoA. Supports decentralized identifiers (DID), capability semantic modeling, and publish-subscribe mechanisms
Agent Coordination Layer: Manages distributed task execution and agent collaboration. Supports task decomposition, process orchestration, adaptive communication protocols, and dynamic agent-task matching
Application Layer: Provides standardized interfaces and semantic alignment, supporting cross-modal, semantic, and knowledge-level integration

Agent Functional Modules

Each IoA agent comprises four core functional modules:

Planning Module: Large model-based reasoning, task decomposition, and decision-making, incorporating techniques such as Chain-of-Thought (CoT) and Tree-of-Thought (ToT)
Memory Module: Maintains short-term context and long-term knowledge, supporting continuous learning and personalization
Interaction Module: Implements communication and collaboration across human, agent, and environmental interfaces
Action Module: Achieves plan execution through physical actuators and software tool interfaces

Key Technical Innovations

Capability Notification and Discovery

Capability Assessment: Combines self-reported declarations with system verification to construct agent capability profiles
Capability Notification: Employs proactive notification, event-triggered, and periodic synchronization strategies
Capability Retrieval: Evolves from traditional search to semantic retrieval, knowledge-enhanced retrieval, and agent-augmented retrieval

Interaction Structure and Task Orchestration

Interaction Patterns: Multiple collaborative modes including aggregation, reflection, debate, and tool use
Communication Topology: Network structures including chain, star, tree, and graph configurations
Task Decomposition: Rule-based and learning-based decomposition methods
Task Assignment: Routing-based and self-organizing assignment strategies

Communication Protocols

Compares five representative protocols:

MCP (Model Context Protocol): Client-server architecture proposed by Anthropic
A2A (Agent-to-Agent): P2P interaction protocol proposed by Google
ANP (Agent Network Protocol): Fully decentralized P2P architecture
AGNTCY: Hybrid architecture promoting standardized collaboration
Agora: Research project balancing efficiency, generality, and portability

Experimental Setup and Application Scenarios

Practical IoA Prototypes

The paper analyzes several representative IoA implementations:

Chen et al.'s IoA Prototype: Adopts hierarchical client-server architecture supporting heterogeneous virtual agent collaboration
DAWN System: Hierarchical architecture supporting globally distributed IoA collaboration
Siemens Industrial Co-Pilot: Autonomous agents applied across entire production lifecycle
Midea Intelligent Agent Factory: Deploys 14 agents across 38 production scenarios

Application Scenarios

The paper details five representative application scenarios:

Smart Homes: P2P communication within home agent subnets
Healthcare: Robot coordination within medical facilities
Intelligent Factories: Coordination between on-site and external agents
Smart Grids: Energy management through IoA subnets and external agents
Smart Cities: Cross-domain agent networks

Experimental Results and Analysis

Key Characteristics of IoA

The paper identifies six key characteristics of IoA:

Autonomous Intelligence: Agents possess high autonomy, proactively publishing capabilities and independently initiating collaboration
High Dynamicity: Supports on-demand agent creation, migration, and termination, as well as real-time workflow reconfiguration
High Heterogeneity: Spans a broad spectrum from microcontroller nodes to GPU cluster-driven agents
Large-Scale Scalability: Supports scaling from small temporary agent teams to billions of cross-domain agents
Semantic-Aware Communication: Context-aware semantic understanding and reasoning capabilities based on large models
Task-Driven Cooperation: Prioritizes task-oriented networks where agents dynamically adjust capabilities to match task requirements

Comparison with Traditional Systems

The paper systematically compares IoA with traditional Internet and IoT:

Dimension	Traditional Internet	IoT	IoA
Core Objective	Host and information connectivity	Device and information connectivity	Agent and knowledge connectivity
Architecture	Centralized (client-server)	Decentralized (edge-edge-cloud)	Hybrid (P2P + broker)
Interaction Mode	Passive (request-response)	Event-driven	Proactive (goal-oriented)
Communication Level	Bit-level transmission	Bit-level + lightweight protocols	Semantic-level exchange
Autonomy Source	Human control	Rule-based device logic	Large model-driven agent intelligence

Future Research Directions

Seven Key Research Directions

IoA Standardization and Interoperability Framework: Develop unified standardization frameworks ensuring cross-platform and cross-domain interoperability
Secure Adaptive Agent Communication Protocols: Address the trilemma of generality-efficiency-portability while embedding rigorous security guarantees
Decentralized Autonomous Agent Ecosystems: Develop autonomous mechanisms requiring no central authority, analogous to biological ecosystems
Agent-Based Economic Systems: Integrate adaptive incentive mechanisms combining short-term capacity-aware economic rewards with long-term reputation tracking
Privacy-Preserving Agent Interaction: Integrate context-aware and task-driven privacy-by-design, allowing agents to dynamically negotiate privacy protection levels based on task criticality
Cyber-Physical Security IoA: Integrate cyber-physical defenses, such as correlating network traffic with phasor measurement unit readings to detect false data injection in power grid agents
Ethical Interoperable IoA: Ensure ethical compliance requiring transparent moral reasoning and adaptive semantic alignment

Conclusions and Discussion

Main Conclusions

IoA represents the next generation of infrastructure for autonomous interconnected intelligent systems, providing a comprehensive framework for large-scale collaboration among heterogeneous agents. Through hierarchical architecture, dynamic discovery mechanisms, adaptive communication protocols, and economic incentive models, IoA can support diverse application scenarios ranging from smart homes to smart cities.

Limitations

Scalability Challenges: Real-time coordination and consensus mechanisms in large-scale IoA systems still face latency and throughput limitations
Security and Privacy Issues: Cross-domain agent collaboration introduces new attack surfaces and privacy leakage risks
Insufficient Standardization: Lack of unified agent description language and lightweight communication protocols
Economic Model Complexity: Developing interoperable and fair economic frameworks across heterogeneous IoA platforms remains challenging

Future Directions

The paper indicates that IoA maturation requires continued innovation in network architecture, interoperability standards, and security paradigms, particularly in addressing large-scale deployment, cross-domain collaboration, and ethical compliance.

In-Depth Evaluation

Strengths

Comprehensiveness: Provides the first systematic survey of the IoA domain, covering architecture design, key technologies, application scenarios, and future directions
Forward-Looking: Accurately captures technological trends in the evolution from isolated agents to interconnected agent ecosystems
Practicality: Combines academic research with industrial practice, providing concrete prototype cases and application scenarios
Clear Structure: Hierarchical analytical framework facilitates understanding of complex IoA ecosystems

Limitations

Lack of Quantitative Analysis: As a survey paper, it lacks specific performance evaluations and experimental validation
Limited Technical Details: Descriptions of certain key technologies (such as consensus mechanisms, privacy protection) are relatively high-level
Insufficient Challenge Analysis: Analysis of technical implementation difficulties and solutions could be more concrete

Impact

Domain Contribution: Provides important theoretical framework and research foundation for the emerging IoA field
Practical Value: Offers architectural guidance and technical roadmap for industry development of IoA systems
Research Direction: The identified seven future research directions provide clear research agenda for academia

Applicable Scenarios

This framework applies to scenarios requiring large-scale agent collaboration, including smart cities, Industry 4.0, autonomous vehicle fleets, distributed AI inference, and other domains. It is particularly suitable for complex application scenarios requiring cross-organizational and cross-domain collaboration.

References

The paper cites 149 related references, covering important works from multiple relevant fields including multi-agent systems, large language models, communication protocols, and blockchain technology, providing solid theoretical foundation for IoA research.

Summary: The IoA concept and framework proposed in this paper possess significant theoretical and practical value, providing comprehensive guidance for constructing next-generation agent interconnection infrastructure. Although as a survey paper it has limitations in technical details and experimental validation, its forward-looking perspective and systematic approach make it an important reference in the field.