backend/src/models/factory.py

import logging

from langchain.chat_models import BaseChatModel

from src.config import get_app_config, get_tracing_config, is_tracing_enabled
from src.reflection import resolve_class

logger = logging.getLogger(__name__)


def create_chat_model(name: str | None = None, thinking_enabled: bool = False, **kwargs) -> BaseChatModel:
    """Create a chat model instance from the config.

    Args:
        name: The name of the model to create. If None, the first model in the config will be used.

    Returns:
        A chat model instance.
    """
    config = get_app_config()
    if name is None:
        name = config.models[0].name
    model_config = config.get_model_config(name)
    if model_config is None:
        raise ValueError(f"Model {name} not found in config") from None
    model_class = resolve_class(model_config.use, BaseChatModel)
    model_settings_from_config = model_config.model_dump(
        exclude_none=True,
        exclude={
            "use",
            "name",
            "display_name",
            "description",
            "supports_thinking",
            "supports_reasoning_effort",
            "when_thinking_enabled",
            "thinking",
            "supports_vision",
        },
    )
    # Compute effective when_thinking_enabled by merging in the `thinking` shortcut field.
    # The `thinking` shortcut is equivalent to setting when_thinking_enabled["thinking"].
    has_thinking_settings = (model_config.when_thinking_enabled is not None) or (model_config.thinking is not None)
    effective_wte: dict = dict(model_config.when_thinking_enabled) if model_config.when_thinking_enabled else {}
    if model_config.thinking is not None:
        merged_thinking = {**(effective_wte.get("thinking") or {}), **model_config.thinking}
        effective_wte = {**effective_wte, "thinking": merged_thinking}
    if thinking_enabled and has_thinking_settings:
        if not model_config.supports_thinking:
            raise ValueError(f"Model {name} does not support thinking. Set `supports_thinking` to true in the `config.yaml` to enable thinking.") from None
        if effective_wte:
            model_settings_from_config.update(effective_wte)
    if not thinking_enabled and has_thinking_settings:
        if effective_wte.get("extra_body", {}).get("thinking", {}).get("type"):
            # OpenAI-compatible gateway: thinking is nested under extra_body
            kwargs.update({"extra_body": {"thinking": {"type": "disabled"}}})
            kwargs.update({"reasoning_effort": "minimal"})
        elif effective_wte.get("thinking", {}).get("type"):
            # Native langchain_anthropic: thinking is a direct constructor parameter
            kwargs.update({"thinking": {"type": "disabled"}})
    if not model_config.supports_reasoning_effort:
        kwargs.update({"reasoning_effort": None})

    model_instance = model_class(**kwargs, **model_settings_from_config)

    if is_tracing_enabled():
        try:
            from langchain_core.tracers.langchain import LangChainTracer

            tracing_config = get_tracing_config()
            tracer = LangChainTracer(
                project_name=tracing_config.project,
            )
            existing_callbacks = model_instance.callbacks or []
            model_instance.callbacks = [*existing_callbacks, tracer]
            logger.debug(f"LangSmith tracing attached to model '{name}' (project='{tracing_config.project}')")
        except Exception as e:
            logger.warning(f"Failed to attach LangSmith tracing to model '{name}': {e}")
    return model_instance
feat: add LangSmith tracing integration (#878) * feat: add LangSmith tracing integration Add optional LangSmith tracing support that can be enabled via environment variables (LANGSMITH_TRACING, LANGSMITH_API_KEY, LANGSMITH_PROJECT, LANGSMITH_ENDPOINT). When enabled, a LangChainTracer callback is attached to chat models and run metadata is injected for trace tagging. Co-Authored-By: Claude <noreply@anthropic.com> * Update backend/src/config/tracing_config.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/models/factory.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add threading lock to ensure thread-safe access to tracing configuration --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2026-02-21 16:41:34 +08:00			`import logging`
feat(subagents): make subagent timeout configurable via config.yaml (#897) * feat(subagents): make subagent timeout configurable via config.yaml - Add SubagentsAppConfig supporting global and per-agent timeout_seconds - Load subagents config section in AppConfig.from_file() - Registry now applies config.yaml overrides without mutating builtin defaults - Polling safety-net in task_tool is now dynamic (execution timeout + 60s buffer) - Document subagents section in config.example.yaml - Add make test command and enforce TDD policy in CLAUDE.md - Add 38 unit tests covering config validation, timeout resolution, registry override behavior, and polling timeout formula Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(subagents): add logging for subagent timeout config and execution - Log loaded timeout config (global default + per-agent overrides) on startup - Log debug message in registry when config.yaml overrides a builtin timeout - Include timeout in executor's async execution start log - Log effective timeout and polling limit when a task is dispatched - Fix UnboundLocalError: move max_poll_count assignment before logger.info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * ci(backend): add lint step and run all unit tests via Makefile Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix lint --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> 2026-02-25 08:39:29 +08:00
feat: add model modules 2026-01-14 07:16:45 +08:00			`from langchain.chat_models import BaseChatModel`

feat: add LangSmith tracing integration (#878) * feat: add LangSmith tracing integration Add optional LangSmith tracing support that can be enabled via environment variables (LANGSMITH_TRACING, LANGSMITH_API_KEY, LANGSMITH_PROJECT, LANGSMITH_ENDPOINT). When enabled, a LangChainTracer callback is attached to chat models and run metadata is injected for trace tagging. Co-Authored-By: Claude <noreply@anthropic.com> * Update backend/src/config/tracing_config.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/models/factory.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add threading lock to ensure thread-safe access to tracing configuration --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2026-02-21 16:41:34 +08:00			`from src.config import get_app_config, get_tracing_config, is_tracing_enabled`
feat: add model modules 2026-01-14 07:16:45 +08:00			`from src.reflection import resolve_class`

feat: add LangSmith tracing integration (#878) * feat: add LangSmith tracing integration Add optional LangSmith tracing support that can be enabled via environment variables (LANGSMITH_TRACING, LANGSMITH_API_KEY, LANGSMITH_PROJECT, LANGSMITH_ENDPOINT). When enabled, a LangChainTracer callback is attached to chat models and run metadata is injected for trace tagging. Co-Authored-By: Claude <noreply@anthropic.com> * Update backend/src/config/tracing_config.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/models/factory.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add threading lock to ensure thread-safe access to tracing configuration --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2026-02-21 16:41:34 +08:00			`logger = logging.getLogger(__name__)`
feat: add model modules 2026-01-14 07:16:45 +08:00
feat(subagents): make subagent timeout configurable via config.yaml (#897) * feat(subagents): make subagent timeout configurable via config.yaml - Add SubagentsAppConfig supporting global and per-agent timeout_seconds - Load subagents config section in AppConfig.from_file() - Registry now applies config.yaml overrides without mutating builtin defaults - Polling safety-net in task_tool is now dynamic (execution timeout + 60s buffer) - Document subagents section in config.example.yaml - Add make test command and enforce TDD policy in CLAUDE.md - Add 38 unit tests covering config validation, timeout resolution, registry override behavior, and polling timeout formula Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(subagents): add logging for subagent timeout config and execution - Log loaded timeout config (global default + per-agent overrides) on startup - Log debug message in registry when config.yaml overrides a builtin timeout - Include timeout in executor's async execution start log - Log effective timeout and polling limit when a task is dispatched - Fix UnboundLocalError: move max_poll_count assignment before logger.info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * ci(backend): add lint step and run all unit tests via Makefile Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix lint --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> 2026-02-25 08:39:29 +08:00
style: format 2026-01-14 09:21:19 +08:00			`def create_chat_model(name: str \| None = None, thinking_enabled: bool = False, **kwargs) -> BaseChatModel:`
feat: add model modules 2026-01-14 07:16:45 +08:00			`"""Create a chat model instance from the config.`

			`Args:`
			`name: The name of the model to create. If None, the first model in the config will be used.`

			`Returns:`
			`A chat model instance.`
			`"""`
			`config = get_app_config()`
			`if name is None:`
			`name = config.models[0].name`
			`model_config = config.get_model_config(name)`
			`if model_config is None:`
			`raise ValueError(f"Model {name} not found in config") from None`
			`model_class = resolve_class(model_config.use, BaseChatModel)`
			`model_settings_from_config = model_config.model_dump(`
			`exclude_none=True,`
			`exclude={`
			`"use",`
			`"name",`
			`"display_name",`
			`"description",`
			`"supports_thinking",`
feat: add reasoning_effort configuration support for Doubao/GPT-5 models (#947) * feat: Add reasoning effort configuration support * Add `reasoning_effort` parameter to model config and agent initialization * Support reasoning effort levels (minimal/low/medium/high) for Doubao/GPT-5 models * Add UI controls in input box for reasoning effort selection * Update doubao-seed-1.8 example config with reasoning effort support Fixes & Cleanup: * Ensure UTF-8 encoding for file operations * Remove unused imports * fix: set reasoning_effort to None for unsupported models * fix: unit test error * Update frontend/src/components/workspace/input-box.tsx Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> 2026-03-02 20:49:41 +08:00			`"supports_reasoning_effort",`
feat: add model modules 2026-01-14 07:16:45 +08:00			`"when_thinking_enabled",`
feat: add thinking settings to compatible anthropic api (#1017) 2026-03-08 20:18:21 +08:00			`"thinking",`
feat: add view_image tool and optimize web fetch tools Add image viewing capability for vision-enabled models with ViewImageMiddleware and view_image_tool. Limit web_fetch tool output to 4096 characters to prevent excessive content. Update model config to support vision capability flag. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> 2026-01-29 13:44:04 +08:00			`"supports_vision",`
feat: add model modules 2026-01-14 07:16:45 +08:00			`},`
			`)`
feat: add thinking settings to compatible anthropic api (#1017) 2026-03-08 20:18:21 +08:00			# Compute effective when_thinking_enabled by merging in the `thinking` shortcut field.
			# The `thinking` shortcut is equivalent to setting when_thinking_enabled["thinking"].
			`has_thinking_settings = (model_config.when_thinking_enabled is not None) or (model_config.thinking is not None)`
			`effective_wte: dict = dict(model_config.when_thinking_enabled) if model_config.when_thinking_enabled else {}`
			`if model_config.thinking is not None:`
			`merged_thinking = {(effective_wte.get("thinking") or {}), model_config.thinking}`
			`effective_wte = {**effective_wte, "thinking": merged_thinking}`
			`if thinking_enabled and has_thinking_settings:`
feat: add model modules 2026-01-14 07:16:45 +08:00			`if not model_config.supports_thinking:`
style: format 2026-01-14 09:21:19 +08:00			raise ValueError(f"Model {name} does not support thinking. Set `supports_thinking` to true in the `config.yaml` to enable thinking.") from None
feat: add thinking settings to compatible anthropic api (#1017) 2026-03-08 20:18:21 +08:00			`if effective_wte:`
			`model_settings_from_config.update(effective_wte)`
			`if not thinking_enabled and has_thinking_settings:`
			`if effective_wte.get("extra_body", {}).get("thinking", {}).get("type"):`
			`# OpenAI-compatible gateway: thinking is nested under extra_body`
			`kwargs.update({"extra_body": {"thinking": {"type": "disabled"}}})`
			`kwargs.update({"reasoning_effort": "minimal"})`
			`elif effective_wte.get("thinking", {}).get("type"):`
			`# Native langchain_anthropic: thinking is a direct constructor parameter`
			`kwargs.update({"thinking": {"type": "disabled"}})`
feat: add reasoning_effort configuration support for Doubao/GPT-5 models (#947) * feat: Add reasoning effort configuration support * Add `reasoning_effort` parameter to model config and agent initialization * Support reasoning effort levels (minimal/low/medium/high) for Doubao/GPT-5 models * Add UI controls in input box for reasoning effort selection * Update doubao-seed-1.8 example config with reasoning effort support Fixes & Cleanup: * Ensure UTF-8 encoding for file operations * Remove unused imports * fix: set reasoning_effort to None for unsupported models * fix: unit test error * Update frontend/src/components/workspace/input-box.tsx Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> 2026-03-02 20:49:41 +08:00			`if not model_config.supports_reasoning_effort:`
			`kwargs.update({"reasoning_effort": None})`
feat: add thinking settings to compatible anthropic api (#1017) 2026-03-08 20:18:21 +08:00
feat: add model modules 2026-01-14 07:16:45 +08:00			`model_instance = model_class(kwargs, model_settings_from_config)`
feat: add LangSmith tracing integration (#878) * feat: add LangSmith tracing integration Add optional LangSmith tracing support that can be enabled via environment variables (LANGSMITH_TRACING, LANGSMITH_API_KEY, LANGSMITH_PROJECT, LANGSMITH_ENDPOINT). When enabled, a LangChainTracer callback is attached to chat models and run metadata is injected for trace tagging. Co-Authored-By: Claude <noreply@anthropic.com> * Update backend/src/config/tracing_config.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/models/factory.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add threading lock to ensure thread-safe access to tracing configuration --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2026-02-21 16:41:34 +08:00
			`if is_tracing_enabled():`
			`try:`
			`from langchain_core.tracers.langchain import LangChainTracer`

			`tracing_config = get_tracing_config()`
			`tracer = LangChainTracer(`
			`project_name=tracing_config.project,`
			`)`
			`existing_callbacks = model_instance.callbacks or []`
			`model_instance.callbacks = [*existing_callbacks, tracer]`
feat(subagents): make subagent timeout configurable via config.yaml (#897) * feat(subagents): make subagent timeout configurable via config.yaml - Add SubagentsAppConfig supporting global and per-agent timeout_seconds - Load subagents config section in AppConfig.from_file() - Registry now applies config.yaml overrides without mutating builtin defaults - Polling safety-net in task_tool is now dynamic (execution timeout + 60s buffer) - Document subagents section in config.example.yaml - Add make test command and enforce TDD policy in CLAUDE.md - Add 38 unit tests covering config validation, timeout resolution, registry override behavior, and polling timeout formula Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(subagents): add logging for subagent timeout config and execution - Log loaded timeout config (global default + per-agent overrides) on startup - Log debug message in registry when config.yaml overrides a builtin timeout - Include timeout in executor's async execution start log - Log effective timeout and polling limit when a task is dispatched - Fix UnboundLocalError: move max_poll_count assignment before logger.info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * ci(backend): add lint step and run all unit tests via Makefile Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix lint --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> 2026-02-25 08:39:29 +08:00			`logger.debug(f"LangSmith tracing attached to model '{name}' (project='{tracing_config.project}')")`
feat: add LangSmith tracing integration (#878) * feat: add LangSmith tracing integration Add optional LangSmith tracing support that can be enabled via environment variables (LANGSMITH_TRACING, LANGSMITH_API_KEY, LANGSMITH_PROJECT, LANGSMITH_ENDPOINT). When enabled, a LangChainTracer callback is attached to chat models and run metadata is injected for trace tagging. Co-Authored-By: Claude <noreply@anthropic.com> * Update backend/src/config/tracing_config.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/agents/lead_agent/agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update backend/src/models/factory.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add threading lock to ensure thread-safe access to tracing configuration --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2026-02-21 16:41:34 +08:00			`except Exception as e:`
			`logger.warning(f"Failed to attach LangSmith tracing to model '{name}': {e}")`
feat: add model modules 2026-01-14 07:16:45 +08:00			`return model_instance`