deer-flow

mirror of https://gitee.com/wanwujie/deer-flow synced 2026-04-04 06:32:13 +08:00

Author	SHA1	Message	Date
Xun	3adb4e90cb	fix: improve JSON repair handling for markdown code blocks (#841 ) * fix: improve JSON repair handling for markdown code blocks * unified import path * compress_crawl_udf * fix * reverse	2026-01-30 08:47:23 +08:00
Xun	c0849af37e	feat(context): decrease token in web_search AIMessage (#827 ) This PR addresses token limit issues when web_search is enabled with include_raw_content by implementing a two-pronged approach: changing the default behavior to exclude raw content and adding compression logic for when raw content is included.	2026-01-23 08:31:48 +08:00
Xun	0e64c52975	refactor: Refactors the retriever function to use async/await (#821 ) * refactor: Refactors the retriever function to use async/await	2026-01-20 19:56:26 +08:00
Willem Jiang	2a97170b6c	feat: add Serper search engine support (#762 ) * feat: add Serper search engine support * docs: update configuration guide and env example for Serper * test: add test case for Serper with missing API key	2025-12-15 23:04:26 +08:00
infoquest-byteplus	7ec9e45702	feat: support infoquest (#708 ) * support infoquest * support html checker * support html checker * change line break format * change line break format * change line break format * change line break format * change line break format * change line break format * change line break format * change line break format * Fix several critical issues in the codebase - Resolve crawler panic by improving error handling - Fix plan validation to prevent invalid configurations - Correct InfoQuest crawler JSON conversion logic * add test for infoquest * add test for infoquest * Add InfoQuest introduction to the README * add test for infoquest * fix readme for infoquest * fix readme for infoquest * resolve the conflict * resolve the conflict * resolve the conflict * Fix formatting of INFOQUEST in SearchEngine enum * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Willem Jiang <143703838+willem-bd@users.noreply.github.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-12-02 08:16:35 +08:00
Willem Jiang	170c4eb33c	Upgrade langchain version to 1.x (#720 ) * fix: revert the part of patch of issue-710 to extract the content from the plan * Upgrade the ddgs for the new compatible version * Upgraded langchain to 1.1.0 updated langchain related package to the new compatable version * Update pyproject.toml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-11-28 22:09:13 +08:00
Willem Jiang	bec97f02ae	fix: the crawling error when encountering PDF URLs (#707 ) * fix: the crawling error when encountering PDF URLs * Added the unit test for the new feature of crawl tool * fix: address the code review problems * fix: address the code review problems	2025-11-25 09:24:52 +08:00
jimmyuconn1982	2001a7c223	Fix: clarification bugs - max rounds, locale passing, and over-clarification (#647 ) Fixes: Max rounds bug, locale passing bug, over-clarification issue * reslove Copilot spelling comments --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-10-24 16:43:39 +08:00
Willem Jiang	052490b116	fix: resolve issue #467 - message content validation and Tavily search error handling (#645 ) * fix: resolve issue #467 - message content validation and Tavily search error handling This commit implements a comprehensive fix for issue #467 where the application crashed with 'Field required: input.messages.3.content' error when generating reports. ## Root Cause Analysis The issue had multiple interconnected causes: 1. Tavily tool returned mixed types (lists/error strings) instead of consistent JSON 2. background_investigation_node didn't handle error cases properly, returning None 3. Missing message content validation before LLM calls 4. Insufficient error diagnostics for content-related errors ## Changes Made ### Part 1: Fix Tavily Search Tool (tavily_search_results_with_images.py) - Modified _run() and _arun() methods to return JSON strings instead of mixed types - Error responses now return JSON: {"error": repr(e)} - Successful responses return JSON string: json.dumps(cleaned_results) - Ensures tool results always have valid string content for ToolMessages ### Part 2: Fix background_investigation_node Error Handling (graph/nodes.py) - Initialize background_investigation_results to empty list instead of None - Added proper JSON parsing for string responses from Tavily tool - Handle error responses with explicit error logging - Always return valid JSON (empty list if error) instead of None ### Part 3: Add Message Content Validation (utils/context_manager.py) - New validate_message_content() function validates all messages before LLM calls - Ensures all messages have content attribute and valid string content - Converts complex types (lists, dicts) to JSON strings - Provides graceful fallback for messages with issues ### Part 4: Enhanced Error Diagnostics (_execute_agent_step in graph/nodes.py) - Call message validation before agent invocation - Add detailed logging for content-related errors - Log message types, content types, and lengths when validation fails - Helps with future debugging of similar issues ## Testing - All unit tests pass (395 tests) - Python syntax verified for all modified files - No breaking changes to existing functionality * test: update tests for issue #467 fixes Update test expectations to match the new implementation: - Tavily search tool now returns JSON strings instead of mixed types - background_investigation_node returns empty list [] for errors instead of None - All tests updated to verify the new behavior - All 391 tests pass successfully * Update src/graph/nodes.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-10-23 22:08:14 +08:00
Willem Jiang	9ece3fd9c3	fix: support additional Tavily search parameters via configuration to fix #548 (#643 ) * fix: support additional Tavily search parameters via configuration to fix #548 - Add include_answer, search_depth, include_raw_content, include_images, include_image_descriptions to SEARCH_ENGINE config - Update get_web_search_tool() to load these parameters from configuration with sensible defaults - Parameters are now properly passed to TavilySearchWithImages during initialization - This fixes 'got an unexpected keyword argument' errors when using web_search tool - Update tests to verify new parameters are correctly set * test: add comprehensive unit tests for web search configuration loading - Add test for custom configuration values (include_answer, search_depth, etc.) - Add test for empty configuration (all defaults) - Add test for image_descriptions logic when include_images is false - Add test for partial configuration - Add test for missing config file - Add test for multiple domains in include/exclude lists All 7 new tests pass and provide comprehensive coverage of configuration loading and parameter handling for Tavily search tool initialization. * test: verify all Tavily configuration parameters are optional Add 8 comprehensive tests to verify that all Tavily engine configuration parameters are truly optional: - test_tavily_with_no_search_engine_section: SEARCH_ENGINE section missing - test_tavily_with_completely_empty_config: Entire config missing - test_tavily_with_only_include_answer_param: Single param, rest default - test_tavily_with_only_search_depth_param: Single param, rest default - test_tavily_with_only_include_domains_param: Domain param, rest default - test_tavily_with_explicit_false_boolean_values: False values work correctly - test_tavily_with_empty_domain_lists: Empty lists handled correctly - test_tavily_all_parameters_optional_mix: Multiple missing params work These tests verify: - Tool creation never fails regardless of missing configuration - All parameters have sensible defaults - Boolean parameters can be explicitly set to False - Any combination of optional parameters works - Domain lists can be empty or omitted All 15 Tavily configuration tests pass successfully.	2025-10-22 22:56:02 +08:00
jimmyuconn1982	003f081a7b	fix: Refine clarification workflow state handling (#641 ) * fix: support local models by making thought field optional in Plan model - Make thought field optional in Plan model to fix Pydantic validation errors with local models - Add Ollama configuration example to conf.yaml.example - Update documentation to include local model support - Improve planner prompt with better JSON format requirements Fixes local model integration issues where models like qwen3:14b would fail due to missing thought field in JSON output. * feat: Add intelligent clarification feature for research queries - Add multi-turn clarification process to refine vague research questions - Implement three-dimension clarification standard (Tech/App, Focus, Scope) - Add clarification state management in coordinator node - Update coordinator prompt with detailed clarification guidelines - Add UI settings to enable/disable clarification feature (disabled by default) - Update workflow to handle clarification rounds recursively - Add comprehensive test coverage for clarification functionality - Update documentation with clarification feature usage guide Key components: - src/graph/nodes.py: Core clarification logic and state management - src/prompts/coordinator.md: Detailed clarification guidelines - src/workflow.py: Recursive clarification handling - web/: UI settings integration - tests/: Comprehensive test coverage - docs/: Updated configuration guide * fix: Improve clarification conversation continuity - Add comprehensive conversation history to clarification context - Include previous exchanges summary in system messages - Add explicit guidelines for continuing rounds in coordinator prompt - Prevent LLM from starting new topics during clarification - Ensure topic continuity across clarification rounds Fixes issue where LLM would restart clarification instead of building upon previous exchanges. * fix: Add conversation history to clarification context * fix: resolve clarification feature message to planer, prompt, test issues - Optimize coordinator.md prompt template for better clarification flow - Simplify final message sent to planner after clarification - Fix API key assertion issues in test_search.py * fix: Add configurable max_clarification_rounds and comprehensive tests - Add max_clarification_rounds parameter for external configuration - Add comprehensive test cases for clarification feature in test_app.py - Fixes issues found during interactive mode testing where: - Recursive call failed due to missing initial_state parameter - Clarification exited prematurely at max rounds - Incorrect logging of max rounds reached * Move clarification tests to test_nodes.py and add max_clarification_rounds to zh.json * fix: add max_clarification_rounds parameter passing from frontend to backend - Add max_clarification_rounds parameter in store.ts sendMessage function - Add max_clarification_rounds type definition in chat.ts - Ensure frontend settings page clarification rounds are correctly passed to backend * fix: refine clarification workflow state handling and coverage - Add clarification history reconstruction - Fix clarified topic accumulation - Add clarified_research_topic state field - Preserve clarification state in recursive calls - Add comprehensive test coverage * refactor: optimize coordinator logic and type annotations - Simplify handoff topic logic in coordinator_node - Update type annotations from Tuple to tuple - Improve code readability and maintainability --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-10-22 22:49:07 +08:00
Willem Jiang	d30c4d00d3	fix: convert crawl_tool dict return to JSON string for type consistency (#636 ) Keep fixing #631 This pull request updates the crawl_tool function to return its results as a JSON string instead of a dictionary, and adjusts the unit tests accordingly to handle the new return type. The changes ensure consistent serialization of output and proper validation in tests.	2025-10-21 10:00:33 +08:00
Willem Jiang	e2ff765460	fix: correct image result format for OpenAI compatibility to fix #632 (#634 ) - Change image result type from 'image' to 'image_url' to match OpenAI API expectations - Wrap image URL in dict structure: {"url": "..."} instead of plain string - Update SearchResultPostProcessor to handle dict-based image_url during duplicate removal - Update tests to validate new image format This fixes the 400 error: Invalid value: 'image'. Supported values are: 'text', 'image_url'... Co-authored-by: Willem Jiang <143703838+willem-bd@users.noreply.github.com>	2025-10-20 23:14:09 +08:00
jimmyuconn1982	2510cc61de	feat: Add intelligent clarification feature in coordinate step for research queries (#613 ) * fix: support local models by making thought field optional in Plan model - Make thought field optional in Plan model to fix Pydantic validation errors with local models - Add Ollama configuration example to conf.yaml.example - Update documentation to include local model support - Improve planner prompt with better JSON format requirements Fixes local model integration issues where models like qwen3:14b would fail due to missing thought field in JSON output. * feat: Add intelligent clarification feature for research queries - Add multi-turn clarification process to refine vague research questions - Implement three-dimension clarification standard (Tech/App, Focus, Scope) - Add clarification state management in coordinator node - Update coordinator prompt with detailed clarification guidelines - Add UI settings to enable/disable clarification feature (disabled by default) - Update workflow to handle clarification rounds recursively - Add comprehensive test coverage for clarification functionality - Update documentation with clarification feature usage guide Key components: - src/graph/nodes.py: Core clarification logic and state management - src/prompts/coordinator.md: Detailed clarification guidelines - src/workflow.py: Recursive clarification handling - web/: UI settings integration - tests/: Comprehensive test coverage - docs/: Updated configuration guide * fix: Improve clarification conversation continuity - Add comprehensive conversation history to clarification context - Include previous exchanges summary in system messages - Add explicit guidelines for continuing rounds in coordinator prompt - Prevent LLM from starting new topics during clarification - Ensure topic continuity across clarification rounds Fixes issue where LLM would restart clarification instead of building upon previous exchanges. * fix: Add conversation history to clarification context * fix: resolve clarification feature message to planer, prompt, test issues - Optimize coordinator.md prompt template for better clarification flow - Simplify final message sent to planner after clarification - Fix API key assertion issues in test_search.py * fix: Add configurable max_clarification_rounds and comprehensive tests - Add max_clarification_rounds parameter for external configuration - Add comprehensive test cases for clarification feature in test_app.py - Fixes issues found during interactive mode testing where: - Recursive call failed due to missing initial_state parameter - Clarification exited prematurely at max rounds - Incorrect logging of max rounds reached * Move clarification tests to test_nodes.py and add max_clarification_rounds to zh.json	2025-10-14 13:35:57 +08:00
Fancy-hjyp	5f4eb38fdb	feat: add context compress (#590 ) * feat:Add context compress * feat: Add unit test * feat: add unit test for context manager * feat: add postprocessor param && code format * feat: add configuration guide * fix: fix the configuration_guide * fix: fix the unit test * fix: fix the default value * feat: add test and log for context_manager	2025-09-27 21:42:22 +08:00
HagonChan	c214999606	feat: add strategic_investment report style (#595 ) * add strategic_investment mode * make format * make lint * fix: repair lint-frontend	2025-09-24 09:50:36 +08:00
Gordon	1c27e0f2ae	feat: add support for searx/searxng (#253 ) * add searx/searxng support * nit * Fix indentation in search.py for readability * Clean up imports in search.py Removed unused imports from search.py --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-09-22 18:54:30 +08:00
Fancy-hjyp	6bb0b95579	feat:support config tavily search results (#591 ) * feat:support config tavily search results * feat: support config tavily search results * feat: update the default value of include_images * fix: fix the test --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-09-22 18:26:50 +08:00
jimma	eec8e4dd60	refactor(logging): add explicit error log message (#576 ) Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-09-12 22:09:08 +08:00
CHANGXUBO	db6c1bf7cb	fix: update TavilySearchWithImages to inherit from TavilySearchResults (#522 )	2025-08-21 09:52:12 +08:00
zgjja	3b4e993531	feat: 1. replace black with ruff for fomatting and sort import (#489 ) 2. use tavily from`langchain-tavily` rather than the older one from `langchain-community` Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-08-17 22:57:23 +08:00
Willem Jiang	9e691ecf20	fix: added configuration of python_repl (#503 ) * fix: added configuration of python_repl * fix the lint and unit test errors * fix the lint and unit test errors * fix:the lint check errors	2025-08-06 14:27:03 +08:00
HansleCho	bedf7d4af2	Feat: Add Wikipedia search engine (#478 ) * feat: add Wikipedia search engine * wikipedia * make format	2025-07-29 13:58:08 +08:00
DanielWalnut	6d8853b7c7	refine the research prompt (#460 )	2025-07-22 14:49:04 +08:00
Willem Jiang	3c46201ff0	fix: fix the lint check errors of the main branch (#403 )	2025-07-12 14:43:25 +08:00
yihong	2363b21447	fix: some lint fix using tools (#98 ) * fix: some lint fix using tools Signed-off-by: yihong0618 <zouzou0208@gmail.com> * fix: md lint Signed-off-by: yihong0618 <zouzou0208@gmail.com> * fix: some lint fix using tools Signed-off-by: yihong0618 <zouzou0208@gmail.com> * fix: address comments Signed-off-by: yihong0618 <zouzou0208@gmail.com> * fix: tests Signed-off-by: yihong0618 <zouzou0208@gmail.com> --------- Signed-off-by: yihong0618 <zouzou0208@gmail.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-07-12 13:59:02 +08:00
HagonChan	dfd4712d9f	feat: add Domain Control Features for Tavily Search Engine (#401 ) * feat: add Domain Control Features for Tavily Search Engine * fixed * chore: update config.md	2025-07-12 08:53:51 +08:00
Willem Jiang	dcdd7288ed	test: add unit tests of the app (#305 ) * test: add unit tests in server * test: add unit tests of app.py in server * test: reformat the codes * test: add more tests to cover the exception part * test: add more tests on the server app part * fix: don't show the detail exception to the client * test: try to fix the CI test * fix: keep the TTS API call without exposure information * Fixed the unit test errors * Fixed the lint error	2025-06-18 14:13:05 +08:00
Willem Jiang	4c2fe2e7f5	test: add more unit tests of tools (#315 ) * test: add more test on test_tts.py * test: add unit test of search and retriever in tools * test: remove the main code of search.py * test: add the travily_search unit test * reformate the codes * test: add unit tests of tools * Added the pytest-asyncio dependency * added the license header of test_tavily_search_api_wrapper.py	2025-06-12 20:43:32 +08:00
Xintao Wang	cda3870add	fix: enable proxy support in aiohttp by adding trust_env=True (#289 )	2025-06-07 15:30:13 +08:00
Willem Jiang	45568ca95b	fix:added sanitizing check on the log message (#272 ) * fix:added sanitizing check on the log message * fix: reformat the codes	2025-06-03 11:50:54 +08:00
Willem Jiang	db3e74629f	fix: added permissions setting in the workflow (#273 ) * fix: added permissions setting in the workflow * fix: reformat the code of src/tools/retriever.py	2025-06-03 11:48:51 +08:00
JeffJiang	4ddd659d8d	feat: rag retrieving tool call result display (#263 ) * feat: local search tool call result display * chore: add file copyright * fix: miss edit plan interrupt feedback * feat: disable pasting html into input box	2025-05-29 19:52:34 +08:00
JeffJiang	462752b462	feat: RAG Integration (#238 ) * feat: add rag provider and retriever * feat: retriever tool * feat: add retriever tool to the researcher node * feat: add rag http apis * feat: new message input supports resource mentions * feat: new message input component support resource mentions * refactor: need_web_search to need_search * chore: RAG integration docs * chore: change example api host * fix: user message color in dark mode * fix: mentions style * feat: add local_search_tool to researcher prompt * chore: research prompt * fix: ragflow page size and reporter with * docs: ragflow integration and add acknowledgment projects * chore: format	2025-05-28 14:13:46 +08:00
DanielWalnut	8bbcdbe4de	feat: config max_search_results for search engine (#192 ) * feat: implement UI * feat: config max_search_results for search engine via api --------- Co-authored-by: Henry Li <henry1943@163.com>	2025-05-18 13:23:52 +08:00
laundry	3d5e579ebd	fix: fix start error when search engine is not tavliy and env TAVILY_API_KEY not exist (#133 ) Change-Id: I58e865a11e89acaa3c0b884578cd995d0e9b5422	2025-05-14 14:45:36 +08:00
Zhao Longjie	9266201fe5	fix: background investigator node support more search engine (#75 ) Change-Id: I030a2b9218dfbda2dd2383b7a73266dd7de589c7	2025-05-12 20:15:47 +08:00
Li Xin	6ffe46e39b	feat: support images in the search results	2025-04-19 09:57:02 +08:00
He Tao	a6ab97c970	feat: integrate volcengine tts functionality	2025-04-18 15:28:31 +08:00
He Tao	6937abcd91	chore: add license headers	2025-04-17 11:34:42 +08:00
He Tao	76fd04df22	chore: change the project name	2025-04-17 11:17:03 +08:00
He Tao	a55c357d7f	feat: include raw content for tavily search	2025-04-12 12:01:30 +08:00
He Tao	3a342a62ba	feat: support arxiv & brave search	2025-04-11 15:37:55 +08:00
He Tao	1195612c47	feat: support duckduckgo search engine	2025-04-10 11:45:04 +08:00
He Tao	03798ded08	feat: lite deep researcher implementation	2025-04-09 20:32:16 +08:00

45 Commits