Architecture Overview

Agent Zero is built on a flexible and modular architecture designed for extensibility and customization. This section outlines the key components and the interactions between them.

System Architecture

This simplified diagram illustrates the hierarchical relationship between agents and their interaction with tools, extensions, instruments, prompts, memory and knowledge base.

The user or Agent 0 is at the top of the hierarchy, delegating tasks to subordinate agents, which can further delegate to other agents. Each agent can utilize tools and access the shared assets (prompts, memory, knowledge, extensions and instruments) to perform its tasks.

Runtime Architecture

Agent Zero's runtime architecture is built around Docker containers:

Host System (your machine):
- Requires only Docker and a web browser
- Runs Docker Desktop or Docker Engine
- Handles container orchestration
Runtime Container:
- Houses the complete Agent Zero framework
- Manages the Web UI and API endpoints
- Handles all core functionalities including code execution
- Provides a standardized environment across all platforms

This architecture ensures:

Consistent environment across platforms
Simplified deployment and updates
Enhanced security through containerization
Reduced dependency requirements on host systems
Flexible deployment options for advanced users

Note

The legacy approach of running Agent Zero directly on the host system (using Python, Conda, etc.) is still possible but requires Remote Function Calling (RFC) configuration through the Settings page. See Full Binaries Installation for detailed instructions.

Implementation Details

Directory Structure

Directory	Description
`/docker`	Docker-related files for runtime container
`/docs`	Documentation files and guides
`/instruments`	Custom scripts and tools for runtime environment
`/knowledge`	Knowledge base storage
`/logs`	HTML CLI-style chat logs
`/memory`	Persistent agent memory storage
`/prompts`	System and tool prompts
`/python`	Core Python codebase:
`/api`	API endpoints and interfaces
`/extensions`	Modular extensions
`/helpers`	Utility functions
`/tools`	Tool implementations
`/tmp`	Temporary runtime data
`/webui`	Web interface components:
`/css`	Stylesheets
`/js`	JavaScript modules
`/public`	Static assets
`/work_dir`	Working directory

Key Files

File	Description
`.env`	Environment configuration
`agent.py`	Core agent implementation
`example.env`	Configuration template
`initialize.py`	Framework initialization
`models.py`	Model providers and configs
`preload.py`	Pre-initialization routines
`prepare.py`	Environment preparation
`requirements.txt`	Python dependencies
`run_cli.py`	CLI launcher
`run_ui.py`	Web UI launcher

Note

When using the Docker runtime container, these directories are mounted within the /a0 volume for data persistence until the container is restarted or deleted.

Core Components

Agent Zero's architecture revolves around the following key components:

1. Agents

The core actors within the framework. Agents receive instructions, reason, make decisions, and utilize tools to achieve their objectives. Agents operate within a hierarchical structure, with superior agents delegating tasks to subordinate agents.

Agent Hierarchy and Communication

Agent Zero employs a hierarchical agent structure, where a top-level agent (often the user) can delegate tasks to subordinate agents. This hierarchy allows for the efficient breakdown of complex tasks into smaller, more manageable sub-tasks.

Communication flows between agents through messages, which are structured according to the prompt templates. These messages typically include:

Argument	Description
`Thoughts:`	The agent's Chain of Thought and planning process
`Tool name:`	The specific tool used by the agent
`Responses or queries:`	Results, feedback or queries from tools or other agents

Interaction Flow

A typical interaction flow within Agent Zero might look like this:

The user provides an instruction to Agent 0
Agent 0 initializes VectorDB and access memory
Agent 0 analyzes the instruction and formulates a plan using thoughts argument, possibly involving the use of tools or the creation of sub-agents
If necessary, Agent 0 delegates sub-tasks to subordinate agents
Agents use tools to perform actions, both providing arguments and responses or queries
Agents communicate results and feedback back up the hierarchy
Agent 0 provides the final response to the user

2. Tools

Tools are functionalities that agents can leverage. These can include anything from web search and code execution to interacting with APIs or controlling external software. Agent Zero provides a mechanism for defining and integrating both built-in and custom tools.

Built-in Tools

Agent Zero comes with a set of built-in tools designed to help agents perform tasks efficiently:

Tool	Function
behavior_adjustment	Agent Zero use this tool to change its behavior according to a prior request from the user.
call_subordinate	Allows agents to delegate tasks to subordinate agents
code_execution_tool	Allows agents to execute Python, Node.js, and Shell code in the terminal
input	Allows agents to use the keyboard to interact with an active shell
knowledge_tool	Enables agents to retrieve information from memory, knowledge base or online external sources
response_tool	Allows agents to output a response
memory_tool	Enables agents to save, load, delete and forget information from memory
webpage_content_tool	Enables agents to fetch and analyze the text content of webpages

Knowledge Tool

The knowledge_tool uses SearXNG to search the web and retrieve information. It can also search the local knowledge base and memory for relevant information. The tool returns a summary of the information, which can be used by the agent to make decisions or answer questions.

SearXNG Integration

Agent Zero has integrated SearXNG as its primary search tool, replacing the previous knowledge tools (Perplexity and DuckDuckGo). This integration enhances the agent's ability to retrieve information while ensuring user privacy and customization.

Privacy-Focused Search SearXNG is an open-source metasearch engine that allows users to search multiple sources without tracking their queries. This integration ensures that user data remains private and secure while accessing a wide range of information.
Enhanced Search Capabilities The integration provides access to various types of content, including images, videos, and news articles, allowing users to gather comprehensive information on any topic.
Fallback Mechanism In cases where SearXNG might not return satisfactory results, Agent Zero can be configured to fall back on other sources or methods, ensuring that users always have access to information.

Note

The Knowledge Tool is designed to work seamlessly with both online searches through SearXNG and local knowledge base queries, providing a comprehensive information retrieval system.

Custom Tools

Users can create custom tools to extend Agent Zero's capabilities. Custom tools can be integrated into the framework by defining a tool specification, which includes the tool's prompt to be placed in /prompts/$FOLDERNAME/agent.system.tool.$TOOLNAME.md, as detailed below.

Create agent.system.tool.$TOOL_NAME.md in prompts/$SUBDIR
Add reference in agent.system.tools.md
If needed, implement tool class in python/tools using Tool base class
Follow existing patterns for consistency

Note

Tools are always present in system prompt, so you should keep them to minimum. To save yourself some tokens, use the Instruments module to call custom scripts or functions.

3. Memory System

The memory system is a critical component of Agent Zero, enabling the agent to learn and adapt from past interactions. It operates on a hybrid model where part of the memory is managed automatically by the framework while users can also manually input and extract information.

Memory Structure

The memory is categorized into four distinct areas:

Storage and retrieval of user-provided information (e.g., names, API keys)
Fragments: Contains pieces of information from previous conversations, updated automatically
Solutions: Stores successful solutions from past interactions for future reference
Metadata: Each memory entry includes metadata (IDs, timestamps), enabling efficient filtering and searching based on specific criteria

Messages History and Summarization

Agent Zero employs a sophisticated message history and summarization system to maintain context effectively while optimizing memory usage. This system dynamically manages the information flow, ensuring relevant details are readily available while efficiently handling the constraints of context windows.

Context Extraction: The system identifies key information from previous messages that are vital for ongoing discussions. This process mirrors how humans recall important memories, allowing less critical details to fade.
Summarization Process: Using natural language processing through the utility model, Agent Zero condenses the extracted information into concise summaries. By summarizing past interactions, Agent Zero can quickly recall important facts about the whole chat, leading to more appropriate responses.
Contextual Relevance: The summarized context is prioritized based on its relevance to the current topic, ensuring users receive the most pertinent information.

Implementation Details:

Message Summaries: Individual messages are summarized using a structured format that captures key information while reducing token usage.
Dynamic Compression: The system employs an intelligent compression strategy:
- Recent messages remain in their original form for immediate context.
- Older messages are gradually compressed into more concise summaries.
- Multiple compression levels allow for efficient context window usage.
- Original messages are preserved separately from summaries.
Context Window Optimization:
- Acts as a near-infinite short-term memory for single conversations.
- Dynamically adjusts compression ratios based on available space and settings.
Bulk and Topic Summarization:
- Groups related messages into thematic chunks for better organization.
- Generates concise summaries of multiple messages while preserving key context.
- Enables efficient navigation of long conversation histories.
- Maintains semantic connections between related topics.

By dynamically adjusting context windows and summarizing past interactions, Agent Zero enhances both efficiency and user experience. This innovation not only reflects the framework's commitment to being dynamic and user-centric, but also draws inspiration from human cognitive processes, making AI interactions more relatable and effective. Just as humans forget trivial details, Agent Zero intelligently condenses information to enhance communication.

Note

To maximize the effectiveness of context summarization, users should provide clear and specific instructions during interactions. This helps Agent Zero understand which details are most important to retain.

4. Prompts

The prompts directory contains various Markdown files that control agent behavior and communication. The most important file is agent.system.main.md, which acts as a central hub, referencing other prompt files.

Core Prompt Files

Prompt File	Description
agent.system.main.role.md	Defines the agent's overall role and capabilities
agent.system.main.communication.md	Specifies how the agent should communicate
agent.system.main.solving.md	Describes the agent's approach to tasks
agent.system.main.tips.md	Provides additional tips or guidance
agent.system.main.behaviour.md	Controls dynamic behavior adjustments and rules
agent.system.main.environment.md	Defines the runtime environment context
agent.system.tools.md	Organizes and calls the individual tool prompt files
agent.system.tool.*.md	Individual tool prompt files

Prompt Organization

Default Prompts: Located in prompts/default/, serve as the base configuration
Custom Prompts: Can be placed in custom subdirectories (e.g., prompts/my-custom/)
Behavior Files: Stored in memory as behaviour.md, containing dynamic rules
Tool Prompts: Organized in tool-specific files for modularity

Custom Prompts

Create directory in prompts/ (e.g., my-custom-prompts)
Copy and modify needed files from prompts/default/
Agent Zero will merge your custom files with the default ones
Select your custom prompts in the Settings page (Agent Config section)

Dynamic Behavior System

Behavior Adjustment:
- Agents can modify their behavior in real-time based on user instructions
- Behavior changes are automatically integrated into the system prompt
- Behavioral rules are merged intelligently, avoiding duplicates and conflicts
Behavior Management Components:
- behaviour_adjustment.py: Core tool for updating agent behavior
- _20_behaviour_prompt.py: Extension that injects behavior rules into system prompt
- Custom rules stored in the agent's memory directory as behaviour.md
Behavior Update Process:
1. User requests behavior changes (e.g., "respond in UK English")
2. System identifies behavioral instructions in conversation
3. New rules are merged with existing ruleset
4. Updated behavior is immediately applied

Integration with System Prompt:
- Behavior rules are injected at the start of the system prompt
- Rules are formatted in a structured markdown format
- Changes are applied without disrupting other components
- Maintains separation between core functionality and behavioral rules

Note

You can customize any of these files. Agent Zero will use the files in your custom prompts_subdir if they exist, otherwise, it will fall back to the files in prompts/default.

Tip

The behavior system allows for dynamic adjustments without modifying the base prompt files. Changes made through behavior rules persist across sessions while maintaining the core functionality.

5. Knowledge

Knowledge refers to the user-provided information and data that agents can leverage:

Custom Knowledge: Add files to /knowledge/custom/main directory manually or through the "Import Knowledge" button in the UI
- Supported formats: .txt, .pdf, .csv, .html, .json, .md
- Automatically imported and indexed
- Expandable format support
Knowledge Base:
- Can include PDFs, databases, books, documentation
- /docs folder automatically added
- Used for answering questions and decision-making
- Supports RAG-augmented tasks

6. Instruments

Instruments provide a way to add custom functionalities to Agent Zero without adding to the token count of the system prompt:

Stored in long-term memory of Agent Zero
Unlimited number of instruments available
Recalled when needed by the agent
Can modify agent behavior by introducing new procedures
Function calls or scripts to integrate with other systems
Scripts are run inside the Docker Container

Adding Instruments

Create folder in instruments/custom (no spaces in name)
Add .md description file for the interface
Add .sh script (or other executable) for implementation
The agent will automatically detect and use the instrument

7. Extensions

Extensions are a powerful feature of Agent Zero, designed to keep the main codebase clean and organized while allowing for greater flexibility and modularity.

Structure

Extensions can be found in python/extensions directory:

Folder Organization: Extensions are stored in designated subfolders corresponding to different aspects of the agent's message loop
Execution Order: Files are executed in alphabetical order for predictable behavior
Naming Convention: Files start with numbers to control execution order
Modularity: Each extension focuses on a specific functionality

Types

Message Loop Prompts: Handle system messages and history construction
Memory Management: Handle recall and solution memorization
System Integration: Manage interaction with external systems

Adding Extensions

Create Python file in appropriate python/extensions subfolder
Follow naming convention for execution order (start with number)
Implement functionality following existing patterns
Ensure compatibility with main system
Test thoroughly before deployment

Note

Consider contributing valuable custom components to the main repository. See Contributing for more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

architecture.md

architecture.md

Architecture Overview

System Architecture

Runtime Architecture

Implementation Details

Directory Structure

Key Files

Core Components

1. Agents

Agent Hierarchy and Communication

Interaction Flow

2. Tools

Built-in Tools

Knowledge Tool

SearXNG Integration

Custom Tools

3. Memory System

Memory Structure

Messages History and Summarization

4. Prompts

Core Prompt Files

Prompt Organization

Custom Prompts

Dynamic Behavior System

5. Knowledge

6. Instruments

Adding Instruments

7. Extensions

Structure

Types

Adding Extensions

Files

architecture.md

Latest commit

History

architecture.md

File metadata and controls

Architecture Overview

System Architecture

Runtime Architecture

Implementation Details

Directory Structure

Key Files

Core Components

1. Agents

Agent Hierarchy and Communication

Interaction Flow

2. Tools

Built-in Tools

Knowledge Tool

SearXNG Integration

Custom Tools

3. Memory System

Memory Structure

Messages History and Summarization

4. Prompts

Core Prompt Files

Prompt Organization

Custom Prompts

Dynamic Behavior System

5. Knowledge

6. Instruments

Adding Instruments

7. Extensions

Structure

Types

Adding Extensions