Files
deer-flow/backend/CLAUDE.md
2026-01-19 16:17:31 +08:00

4.6 KiB

CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

DeerFlow is a LangGraph-based AI agent backend that provides a "super agent" with sandbox execution capabilities. The agent can execute code, browse the web, and manage files in isolated sandbox environments.

Commands

# Install dependencies
make install

# Run development server (LangGraph Studio)
make dev

# Lint
make lint

# Format code
make format

Architecture

Configuration System

The app uses a YAML-based configuration system loaded from config.yaml.

Setup: Copy config.example.yaml to config.yaml in the project root directory and customize for your environment.

# From project root (deer-flow/)
cp config.example.yaml config.yaml

Configuration priority:

  1. Explicit config_path argument
  2. DEER_FLOW_CONFIG_PATH environment variable
  3. config.yaml in current directory (backend/)
  4. config.yaml in parent directory (project root - recommended location)

Config values starting with $ are resolved as environment variables (e.g., $OPENAI_API_KEY).

Core Components

Agent Graph (src/agents/)

  • lead_agent is the main entry point registered in langgraph.json
  • Uses ThreadState which extends AgentState with sandbox state
  • Agent is created via create_agent() with model, tools, middleware, and system prompt

Sandbox System (src/sandbox/)

  • Abstract Sandbox base class defines interface: execute_command, read_file, write_file, list_dir
  • SandboxProvider manages sandbox lifecycle: acquire, get, release
  • SandboxMiddleware automatically acquires sandbox on agent start and injects into state
  • LocalSandboxProvider is a singleton implementation for local execution
  • Sandbox tools (bash, ls, read_file, write_file, str_replace) extract sandbox from tool runtime

Model Factory (src/models/)

  • create_chat_model() instantiates LLM from config using reflection
  • Supports thinking_enabled flag with per-model when_thinking_enabled overrides

Tool System (src/tools/)

  • Tools defined in config with use path (e.g., src.sandbox.tools:bash_tool)
  • get_available_tools() resolves tool paths via reflection
  • Community tools in src/community/: Jina AI (web fetch), Tavily (web search)

Reflection System (src/reflection/)

  • resolve_variable() imports module and returns variable (e.g., module:variable)
  • resolve_class() imports and validates class against base class

Skills System (src/skills/)

  • Skills provide specialized workflows for specific tasks (e.g., PDF processing, frontend design)
  • Located in deer-flow/skills/{public,custom} directory structure
  • Each skill has a SKILL.md file with YAML front matter (name, description, license)
  • Skills are automatically discovered and loaded at runtime
  • load_skills() scans directories and parses SKILL.md files
  • Skills are injected into agent's system prompt with paths
  • Path mapping system allows seamless access in both local and Docker sandbox:
    • Local sandbox: /mnt/skills/path/to/deer-flow/skills
    • Docker sandbox: Automatically mounted as volume

Middleware System

  • Custom middlewares in src/agents/middlewares/: Title generation, thread data, clarification, etc.
  • SummarizationMiddleware from LangChain automatically condenses conversation history when token limits are approached
  • Configured in config.yaml under summarization key with trigger/keep thresholds
  • Middlewares are registered in src/agents/lead_agent/agent.py with execution order:
    1. ThreadDataMiddleware - Initializes thread context
    2. SandboxMiddleware - Manages sandbox lifecycle
    3. SummarizationMiddleware - Reduces context when limits are approached (if enabled)
    4. TitleMiddleware - Generates conversation titles
    5. ClarificationMiddleware - Handles clarification requests (must be last)

Config Schema

Models, tools, sandbox providers, skills, and middleware settings are configured in config.yaml:

  • models[]: LLM configurations with use class path
  • tools[]: Tool configurations with use variable path and group
  • sandbox.use: Sandbox provider class path
  • skills.path: Host path to skills directory (optional, default: ../skills)
  • skills.container_path: Container mount path (default: /mnt/skills)
  • title: Automatic thread title generation configuration
  • summarization: Automatic conversation summarization configuration

Code Style

  • Uses ruff for linting and formatting
  • Line length: 240 characters
  • Python 3.12+ with type hints
  • Double quotes, space indentation