deer-flow

mirror of https://gitee.com/wanwujie/deer-flow synced 2026-04-09 00:34:46 +08:00

Author	SHA1	Message	Date
Willem Jiang	8d9d767051	feat(eval): add report quality evaluation module and UI integration (#776 ) * feat(eval): add report quality evaluation module Addresses issue #773 - How to evaluate generated report quality objectively. This module provides two evaluation approaches: 1. Automated metrics (no LLM required): - Citation count and source diversity - Word count compliance per report style - Section structure validation - Image inclusion tracking 2. LLM-as-Judge evaluation: - Factual accuracy scoring - Completeness assessment - Coherence evaluation - Relevance and citation quality checks The combined evaluator provides a final score (1-10) and letter grade (A+ to F). Files added: - src/eval/__init__.py - src/eval/metrics.py - src/eval/llm_judge.py - src/eval/evaluator.py - tests/unit/eval/test_metrics.py - tests/unit/eval/test_evaluator.py * feat(eval): integrate report evaluation with web UI This commit adds the web UI integration for the evaluation module: Backend: - Add EvaluateReportRequest/Response models in src/server/eval_request.py - Add /api/report/evaluate endpoint to src/server/app.py Frontend: - Add evaluateReport API function in web/src/core/api/evaluate.ts - Create EvaluationDialog component with grade badge, metrics display, and optional LLM deep evaluation - Add evaluation button (graduation cap icon) to research-block.tsx toolbar - Add i18n translations for English and Chinese The evaluation UI allows users to: 1. View quick metrics-only evaluation (instant) 2. Optionally run deep LLM-based evaluation for detailed analysis 3. See grade (A+ to F), score (1-10), and metric breakdown * feat(eval): improve evaluation reliability and add LLM judge tests - Extract MAX_REPORT_LENGTH constant in llm_judge.py for maintainability - Add comprehensive unit tests for LLMJudge class (parse_response, calculate_weighted_score, evaluate with mocked LLM) - Pass reportStyle prop to EvaluationDialog for accurate evaluation criteria - Add researchQueries store map to reliably associate queries with research - Add getResearchQuery helper to retrieve query by researchId - Remove unused imports in test_metrics.py * fix(eval): use resolveServiceURL for evaluate API endpoint The evaluateReport function was using a relative URL '/api/report/evaluate' which sent requests to the Next.js server instead of the FastAPI backend. Changed to use resolveServiceURL() consistent with other API functions. * fix: improve type accuracy and React hooks in evaluation components - Fix get_word_count_target return type from Optional[Dict] to Dict since it always returns a value via default fallback - Fix useEffect dependency issue in EvaluationDialog using useRef to prevent unwanted re-evaluations - Add aria-label to GradeBadge for screen reader accessibility	2025-12-25 21:55:48 +08:00
Willem Jiang	04296cdf5a	feat: add resource upload support for RAG (#768 ) * feat: add resource upload support for RAG - Backend: Added ingest_file method to Retriever and MilvusRetriever - Backend: Added /api/rag/upload endpoint - Frontend: Added RAGTab in settings for uploading resources - Frontend: Updated translations and settings registration * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestions from code review * Apply suggestions from code review of src/rag/milvus.py --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-12-19 09:55:34 +08:00
Jiahe Wu	3e8f2ce3ad	feat(web): add enable_web_search frontend UI (#681 ) (#766 ) Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-12-17 23:36:32 +08:00
Jiahe Wu	a4f64abd1f	feat(web): add multi-format report export (Markdown, HTML, PDF, Word,… (#756 ) * feat(web): add multi-format report export (Markdown, HTML, PDF, Word, Image) * fix: correct import order for docx (lint error) * fix(web): address Copilot review comments for multi-format export - Add i18n support for dropdown menu items (en/zh) - Add DOMPurify for HTML sanitization (XSS protection) - Fix async handling for canvas.toBlob with Promise wrapper - Add toast notifications for export errors - Fix Tooltip + DropdownMenuTrigger nesting (accessibility) - Ensure container cleanup in finally block * fix(web): enhance markdown parsing for PDF and Word export - Add list support (bullet and numbered) for PDF export - Add parseInlineMarkdown helper for Word export to handle bold, italic, code, links - Add list support for Word export (bullet and numbered) - Address Copilot review comments from PR #756 * fix(web): address PR review feedback for multi-format export - Extract PDF formatting magic numbers into PDF_CONSTANTS - Add Tooltip wrapper for download dropdown button - Reduce triggerDownload cleanup timeout from 1000ms to 100ms - Use marked.Lexer.lexInline for robust markdown parsing - Add console.warn for image export cleanup errors - Add numbering config for Word document ordered lists - Fix CSS class typo: px-5pb-20 -> px-5 pb-20 - Remove unreachable dead code in parseInlineMarkdown --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-12-16 09:06:24 +08:00
Qiyuan Jiao	a38c8584d7	feat: add edit and refresh functionality for MCP servers in settings tab (#680 ) * feat: add edit and refresh functionality for MCP servers in settings tab * feat: fix lint error and enhance MCP server dialog with validation and error handling * fix: add missing newline at the end of en.json file * feat: only refreshing specific servers * feat: add validation messages for MCP server configuration and improve server update logic	2025-11-06 10:38:45 +08:00
jimmyuconn1982	2510cc61de	feat: Add intelligent clarification feature in coordinate step for research queries (#613 ) * fix: support local models by making thought field optional in Plan model - Make thought field optional in Plan model to fix Pydantic validation errors with local models - Add Ollama configuration example to conf.yaml.example - Update documentation to include local model support - Improve planner prompt with better JSON format requirements Fixes local model integration issues where models like qwen3:14b would fail due to missing thought field in JSON output. * feat: Add intelligent clarification feature for research queries - Add multi-turn clarification process to refine vague research questions - Implement three-dimension clarification standard (Tech/App, Focus, Scope) - Add clarification state management in coordinator node - Update coordinator prompt with detailed clarification guidelines - Add UI settings to enable/disable clarification feature (disabled by default) - Update workflow to handle clarification rounds recursively - Add comprehensive test coverage for clarification functionality - Update documentation with clarification feature usage guide Key components: - src/graph/nodes.py: Core clarification logic and state management - src/prompts/coordinator.md: Detailed clarification guidelines - src/workflow.py: Recursive clarification handling - web/: UI settings integration - tests/: Comprehensive test coverage - docs/: Updated configuration guide * fix: Improve clarification conversation continuity - Add comprehensive conversation history to clarification context - Include previous exchanges summary in system messages - Add explicit guidelines for continuing rounds in coordinator prompt - Prevent LLM from starting new topics during clarification - Ensure topic continuity across clarification rounds Fixes issue where LLM would restart clarification instead of building upon previous exchanges. * fix: Add conversation history to clarification context * fix: resolve clarification feature message to planer, prompt, test issues - Optimize coordinator.md prompt template for better clarification flow - Simplify final message sent to planner after clarification - Fix API key assertion issues in test_search.py * fix: Add configurable max_clarification_rounds and comprehensive tests - Add max_clarification_rounds parameter for external configuration - Add comprehensive test cases for clarification feature in test_app.py - Fixes issues found during interactive mode testing where: - Recursive call failed due to missing initial_state parameter - Clarification exited prematurely at max rounds - Incorrect logging of max rounds reached * Move clarification tests to test_nodes.py and add max_clarification_rounds to zh.json	2025-10-14 13:35:57 +08:00
HagonChan	c214999606	feat: add strategic_investment report style (#595 ) * add strategic_investment mode * make format * make lint * fix: repair lint-frontend	2025-09-24 09:50:36 +08:00
orifake	8bdc6bfa2d	fix: missing i18n message (#410 ) Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2025-07-14 18:56:17 +08:00
HagonChan	afbcdd68d8	fix: add missing translation for chat.page (#409 ) * fix: Error: MISSING_MESSAGE: Could not resolve chat.page` in messages for locale 'en' Fixed a `MISSING_MESSAGE` error that was occurring on the chat page due to missing translation keys for `chat.page` in the internationalization messages. * Update en.json	2025-07-14 18:54:01 +08:00
Willem Jiang	70b86d8464	feat: add the Chinese i8n support on the setting table (#404 ) * feat: Added i8n to the mcp table * feat: Added i8n to the about table	2025-07-12 21:28:08 +08:00
johnny0120	e1187d7d02	feat: add i18n support and add Chinese (#372 ) * feat: add i18n support and add Chinese * fix: resolve conflicts * Update en.json with cancle settings * Update zh.json with settngs cancle --------- Co-authored-by: johnny0120 <15564476+johnny0120@users.noreply.github.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Willem Jiang <143703838+willem-bd@users.noreply.github.com>	2025-07-12 15:18:28 +08:00

11 Commits