mirror of https://gitee.com/wanwujie/deer-flow synced 2026-04-14 18:54:46 +08:00

Files

hetaoBackend 1171598b2f feat: add MCP (Model Context Protocol) support

Add comprehensive MCP integration using langchain-mcp-adapters to enable
pluggable external tools from MCP servers.

Features:
- MCP server configuration via mcp_config.json
- Automatic lazy initialization for seamless use in both FastAPI and LangGraph Studio
- Support for multiple MCP servers (filesystem, postgres, github, brave-search, etc.)
- Environment variable resolution in configuration
- Tool caching mechanism for optimal performance
- Complete documentation and setup guide

Implementation:
- Add src/mcp module with client, tools, and cache components
- Integrate MCP config loading in AppConfig
- Update tool system to include MCP tools automatically
- Add eager initialization in FastAPI lifespan handler
- Add lazy initialization fallback for LangGraph Studio

Dependencies:
- Add langchain-mcp-adapters>=0.1.0

Documentation:
- Add MCP_SETUP.md with comprehensive setup guide
- Update CLAUDE.md with MCP system architecture
- Update config.example.yaml with MCP configuration notes
- Update README.md with MCP setup instructions

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

2026-01-19 18:58:13 +08:00

6.4 KiB

Raw Blame History

CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

DeerFlow is a LangGraph-based AI agent backend that provides a "super agent" with sandbox execution capabilities. The agent can execute code, browse the web, and manage files in isolated sandbox environments.

Commands

# Install dependencies
make install

# Run development server (LangGraph Studio)
make dev

# Lint
make lint

# Format code
make format

Architecture

Configuration System

The app uses a YAML-based configuration system loaded from config.yaml.

Setup: Copy config.example.yaml to config.yaml in the project root directory and customize for your environment.

# From project root (deer-flow/)
cp config.example.yaml config.yaml

Configuration priority:

Explicit config_path argument
DEER_FLOW_CONFIG_PATH environment variable
config.yaml in current directory (backend/)
config.yaml in parent directory (project root - recommended location)

Config values starting with $ are resolved as environment variables (e.g., $OPENAI_API_KEY).

Core Components

Agent Graph (src/agents/)

lead_agent is the main entry point registered in langgraph.json
Uses ThreadState which extends AgentState with sandbox state
Agent is created via create_agent() with model, tools, middleware, and system prompt

Sandbox System (src/sandbox/)

Abstract Sandbox base class defines interface: execute_command, read_file, write_file, list_dir
SandboxProvider manages sandbox lifecycle: acquire, get, release
SandboxMiddleware automatically acquires sandbox on agent start and injects into state
LocalSandboxProvider is a singleton implementation for local execution
Sandbox tools (bash, ls, read_file, write_file, str_replace) extract sandbox from tool runtime

Model Factory (src/models/)

create_chat_model() instantiates LLM from config using reflection
Supports thinking_enabled flag with per-model when_thinking_enabled overrides

Tool System (src/tools/)

Tools defined in config with use path (e.g., src.sandbox.tools:bash_tool)
get_available_tools() resolves tool paths via reflection
Community tools in src/community/: Jina AI (web fetch), Tavily (web search)
Supports MCP (Model Context Protocol) for pluggable external tools

MCP System (src/mcp/)

Integrates with MCP servers to provide pluggable external tools using langchain-mcp-adapters
Configuration in mcp_config.json in project root (separate from config.yaml)
Uses MultiServerMCPClient from langchain-mcp-adapters for multi-server management
Automatic initialization: Tools are loaded on first use with lazy initialization
Supports both eager loading (FastAPI startup) and lazy loading (LangGraph Studio)
initialize_mcp_tools() can be called in FastAPI lifespan handler for eager loading
get_cached_mcp_tools() automatically initializes tools if not already loaded
Works seamlessly in both FastAPI server and LangGraph Studio environments
Each server can be enabled/disabled independently via enabled flag
Supports environment variable resolution (e.g., $GITHUB_TOKEN)
Configuration priority:
1. Explicit config_path argument
2. DEER_FLOW_MCP_CONFIG_PATH environment variable
3. mcp_config.json in current directory (backend/)
4. mcp_config.json in parent directory (project root - recommended location)
Popular MCP servers: filesystem, postgres, github, brave-search, puppeteer
See mcp_config.example.json for configuration examples
Built on top of langchain-ai/langchain-mcp-adapters for seamless integration

Reflection System (src/reflection/)

resolve_variable() imports module and returns variable (e.g., module:variable)
resolve_class() imports and validates class against base class

Skills System (src/skills/)

Skills provide specialized workflows for specific tasks (e.g., PDF processing, frontend design)
Located in deer-flow/skills/{public,custom} directory structure
Each skill has a SKILL.md file with YAML front matter (name, description, license)
Skills are automatically discovered and loaded at runtime
load_skills() scans directories and parses SKILL.md files
Skills are injected into agent's system prompt with paths
Path mapping system allows seamless access in both local and Docker sandbox:
- Local sandbox: /mnt/skills → /path/to/deer-flow/skills
- Docker sandbox: Automatically mounted as volume

Middleware System

Custom middlewares in src/agents/middlewares/: Title generation, thread data, clarification, etc.
SummarizationMiddleware from LangChain automatically condenses conversation history when token limits are approached
Configured in config.yaml under summarization key with trigger/keep thresholds
Middlewares are registered in src/agents/lead_agent/agent.py with execution order:
1. ThreadDataMiddleware - Initializes thread context
2. SandboxMiddleware - Manages sandbox lifecycle
3. SummarizationMiddleware - Reduces context when limits are approached (if enabled)
4. TitleMiddleware - Generates conversation titles
5. ClarificationMiddleware - Handles clarification requests (must be last)

Config Schema

Models, tools, sandbox providers, skills, and middleware settings are configured in config.yaml:

models[]: LLM configurations with use class path
tools[]: Tool configurations with use variable path and group
sandbox.use: Sandbox provider class path
skills.path: Host path to skills directory (optional, default: ../skills)
skills.container_path: Container mount path (default: /mnt/skills)
title: Automatic thread title generation configuration
summarization: Automatic conversation summarization configuration

MCP servers are configured separately in mcp_config.json:

mcpServers: Map of server name to configuration
- enabled: Whether the server is enabled (boolean)
- command: Command to execute to start the server (e.g., "npx", "python")
- args: Arguments to pass to the command (array)
- env: Environment variables (object with $VAR support)
- description: Human-readable description

Code Style

Uses ruff for linting and formatting
Line length: 240 characters
Python 3.12+ with type hints
Double quotes, space indentation

6.4 KiB Raw Blame History