sub2api

mirror of https://gitee.com/wanwujie/sub2api synced 2026-04-03 06:52:13 +08:00

Author	SHA1	Message	Date
alfadb	7d26b81075	fix: address review - add missing whitespace patterns and narrow error matching	2026-03-18 14:31:57 +08:00
alfadb	b8ada63ac3	fix: strip empty text blocks in retry filter and fix error pattern matching Empty text blocks ({"type":"text","text":""}) cause Anthropic upstream to return 400: "text content blocks must be non-empty". This was not caught by the existing error detection pattern in isThinkingBlockSignatureError, nor handled by FilterThinkingBlocksForRetry. - Add empty text block stripping to FilterThinkingBlocksForRetry - Fix isThinkingBlockSignatureError to match new Anthropic error format - Add fast-path byte patterns to avoid unnecessary JSON parsing Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 14:20:00 +08:00
shaw	a14babdc73	fix: 兼容 Claude Code v2.1.78+ 新 JSON 格式 metadata.user_id Claude Code v2.1.78 起将 metadata.user_id 从拼接字符串改为 JSON：旧: user_{hex}_account_{uuid}_session_{uuid} 新: {"device_id":"...","account_uuid":"...","session_id":"..."} 新增集中解析/格式化模块 metadata_userid.go： - ParseMetadataUserID: 自动识别两种格式，提取 DeviceID/AccountUUID/SessionID - FormatMetadataUserID: 根据 UA 版本输出对应格式（>= 2.1.78 输出 JSON） - ExtractCLIVersion: 从 UA 提取版本号，消除与 ClaudeCodeValidator.ExtractVersion 的重复修改消费者统一使用新模块： - claude_code_validator: 用 ParseMetadataUserID 替代只匹配旧格式的 userIDPattern - identity_service: RewriteUserID/WithMasking 增加 fingerprintUA 参数，解析用 ParseMetadataUserID，输出用 FormatMetadataUserID（版本感知） - gateway_service: GenerateSessionHash 用 ParseMetadataUserID 提取 session_id， buildOAuthMetadataUserID 用 FormatMetadataUserID 输出版本匹配格式，两处 RewriteUserIDWithMasking 调用传入 fp.UserAgent - account_test_service: generateSessionString 改用 FormatMetadataUserID，自动跟随 DefaultHeaders UA 版本删除三个旧正则: userIDPattern, userIDRegex, sessionIDRegex 统一 hex 匹配为 [a-fA-F0-9]，修复旧 userIDRegex 只匹配小写的不一致	2026-03-18 11:08:58 +08:00
Ethan0x0000	1b79b0f3ff	feat: add InboundEndpoint/UpstreamEndpoint fields to non-OpenAI usage records Extend RecordUsageInput and RecordUsageLongContextInput structs with InboundEndpoint and UpstreamEndpoint so that Claude, Gemini, and Sora handlers can record endpoint info alongside OpenAI handlers. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-03-15 22:13:22 +08:00
erio	cfe72159d0	feat(ops): add ignore insufficient balance errors toggle and extract error constants - Add 5th error filter switch IgnoreInsufficientBalanceErrors to suppress upstream insufficient balance / insufficient_quota errors from ops log - Extract hardcoded error strings into package-level constants for shouldSkipOpsErrorLog, normalizeOpsErrorType, classifyOpsPhase, and classifyOpsIsBusinessLimited - Define ErrNoAvailableAccounts sentinel error and replace all errors.New("no available accounts") call sites - Update tests to use require.ErrorIs with the sentinel error	2026-03-15 17:26:18 +08:00
YanzheL	1bff2292a6	fix: extract and log Claude output_config.effort in usage records Claude's output_config.effort parameter (low/medium/high/max) was not being extracted from requests or logged in the reasoning_effort column of usage logs. Only the OpenAI path populated this field. Changes: - Extract output_config.effort in ParseGatewayRequest - Add ReasoningEffort field to ForwardResult - Populate reasoning_effort in both RecordUsage and RecordUsageWithLongContext - Guard against overwriting service-set effort values in handler - Update stale comments that described reasoning_effort as OpenAI-only - Add unit tests for extraction, normalization, and persistence	2026-03-15 12:55:37 +08:00
InCerry	8f0ea7a02d	Merge branch 'main' into fix/enc_coot	2026-03-14 18:46:33 +08:00
SsageParuders	4644af2ccc	refactor: merge bedrock-apikey into bedrock with auth_mode credential Consolidate two separate channel types (bedrock + bedrock-apikey) into a single "AWS Bedrock" channel. Authentication mode is now distinguished by credentials.auth_mode ("sigv4" \| "apikey") instead of separate types. Backend: - Remove AccountTypeBedrockAPIKey constant - IsBedrock() simplified; IsBedrockAPIKey() checks auth_mode - Add IsAPIKeyOrBedrock() helper to eliminate repeated type checks - Extend pool mode, quota scheduling, and billing to bedrock - Add RetryableOnSameAccount to handleBedrockUpstreamErrors - Add "bedrock" scope to Beta Policy for independent control Frontend: - Merge two buttons into one "AWS Bedrock" with auth mode radio - Badge displays "Anthropic \| AWS" - Pool mode and quota limit UI available for bedrock - Quota display in account list (usage bars, capacity badges, reset) - Remove all bedrock-apikey type references	2026-03-14 17:13:30 +08:00
InCerry	e4a4dfd038	Merge remote-tracking branch 'origin/main' into fix/enc_coot # Conflicts: # backend/internal/service/openai_gateway_service.go	2026-03-14 13:04:24 +08:00
InCerry	2666422b99	fix: handle invalid encrypted content error and retry logic.	2026-03-14 11:42:42 +08:00
Wesley Liddick	e6d59216d4	Merge pull request #975 from Ylarod/aws-bedrock sub2api: add bedrock support	2026-03-14 10:52:24 +08:00
Ylarod	e90ec847b6	fix lint	2026-03-13 19:15:27 +08:00
Ylarod	11f7b83522	sub2api: add bedrock support	2026-03-13 17:00:16 +08:00
ius	6a685727d0	fix: harden usage billing idempotency and backpressure	2026-03-12 18:38:09 +08:00
ius	8d4d3b03bb	fix: remove unused gateway usage helpers	2026-03-12 17:08:57 +08:00
ius	b764d3b8f6	Merge remote-tracking branch 'origin/main' into feat/billing-ledger-decouple-usage-log-20260312	2026-03-12 16:53:28 +08:00
ius	611fd884bd	feat: decouple billing correctness from usage log batching	2026-03-12 16:53:18 +08:00
amberwarden	6e90ec6111	fix: 为 Anthropic Messages API 流式转发添加下游 keepalive ping Anthropic Messages API 的流式转发路径（gateway_service.go）在上游长时间无数据时（如 Opus extended thinking 阶段）不会向下游发送任何内容，导致 Cloudflare Tunnel 等代理因连接空闲而断开。复用已有的 StreamKeepaliveInterval 配置（默认 10 秒），在 select 循环中添加 keepalive 分支，定时发送 Anthropic 原生格式的 ping 事件保活，与 OpenAI 兼容路径的实现模式保持一致。 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 18:43:03 +08:00
shaw	00a0a12138	feat: Anthropic平台可配置 anthropic-beta 策略	2026-03-10 11:20:10 +08:00
shaw	a461538d58	fix: 修复gpt->claude转换无法命中codex缓存问题	2026-03-09 15:08:37 +08:00
shaw	ebe6f418f3	fix: gpt->claude格式转换对齐effort映射和fast	2026-03-09 11:42:35 +08:00
Wesley Liddick	6cb8980404	Merge pull request #807 from touwaeriol/fix/openai-passthrough-v2 fix(openai): remove misplaced passthrough check from isModelSupportedByAccount	2026-03-09 09:06:35 +08:00
erio	91ef085d7d	fix: increase SSE scanner max line size from 40MB to 500MB 4K image base64 data can exceed 40MB limit, causing "bufio.Scanner: token too long" errors. Scanner is adaptive (starts at 64KB, grows as needed), so increasing the cap has no impact on normal responses.	2026-03-09 08:56:54 +08:00
Wesley Liddick	97aaa24733	Merge pull request #858 from james-6-23/fix/pool-mode-03bf3485 支持 API Key 上游池模式的同账号重试次数配置与自定义错误策略	2026-03-09 08:48:53 +08:00
Wesley Liddick	01180b316f	Merge pull request #841 from touwaeriol/feature/account-periodic-quota feat(account): 为 API Key 账号新增日/周周期性配额限制	2026-03-08 20:34:15 +08:00
kyx236	e643fc382c	feat: 支持 API Key 上游池模式同账号重试次数配置与自定义错误策略	2026-03-08 14:12:17 +08:00
shaw	a3791104f9	feat: 支持后台设置是否启用整流开关	2026-03-07 21:55:38 +08:00
erio	1ee17383f8	feat(account): add daily/weekly periodic quota limits for API Key accounts Extend the existing total quota limit with daily and weekly periodic dimensions. Each dimension is independently configurable and uses lazy reset — when the period expires, usage is automatically reset to zero on the next increment. Any dimension exceeding its limit will pause the account from scheduling. Backend: - Add GetQuotaDailyLimit/Used, GetQuotaWeeklyLimit/Used, HasAnyQuotaLimit - Rewrite IncrementQuotaUsed with atomic CTE SQL for 3-dimension update - Rewrite ResetQuotaUsed to clear all dimensions and period timestamps - Update postUsageBilling to use HasAnyQuotaLimit() - Preserve daily/weekly used values on account edit Frontend: - Refactor QuotaLimitCard from single v-model to 3-dimension props - Add QuotaBadge component for compact D/W/$ display - Update AccountCapacityCell with per-dimension badges - Update Create/Edit modals with daily/weekly quota fields - Update AccountActionMenu hasQuotaLimit to check all dimensions - Add i18n strings for daily/weekly/total quota labels Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-07 19:06:59 +08:00
Wesley Liddick	afbe8bf001	Merge pull request #809 from alfadb/feature/openai-messages feat(openai): 添加 /v1/messages 端点和 API 兼容层	2026-03-06 20:16:06 +08:00
Wesley Liddick	005d0c5f53	Merge pull request #815 from mt21625457/pr/openai-user-group-rate-upstream fix(openai): 统一专属倍率计费链路并补齐回归测试	2026-03-06 17:33:09 +08:00
yangjianbo	a18bbb5f2f	fix(openai): 统一专属倍率计费链路并补齐回归测试抽取共享的用户分组专属倍率解析器，统一缓存、singleflight 与回退逻辑。\n\n让 OpenAI 独立计费链路复用专属倍率解析，修复 usage 记录与实际扣费未命中用户专属倍率的问题。\n\n补齐 OpenAI 计费与解析器单元测试，并修复全量回归中暴露的 lint 阻塞项。\n\nCo-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 16:47:51 +08:00
erio	c28f691f32	fix(openai): remove misplaced passthrough check from isModelSupportedByAccount isModelSupportedByAccount 不被 OpenAI 调度路径调用， OpenAI /responses 和 /chat/completions 走的是 openai_account_scheduler.go，透传短路已在 PR #806 的第二个 commit 中正确添加到该文件。此处的检查是多余的死代码，因为 OpenAI 账号不会走到 isModelSupportedByAccount 的这个分支。	2026-03-06 14:32:08 +08:00
alfadb	ff1f114989	feat(openai): add /v1/messages endpoint and API compatibility layer Add Anthropic Messages API support for OpenAI platform groups, enabling clients using Claude-style /v1/messages format to access OpenAI accounts through automatic protocol conversion. - Add apicompat package with type definitions and bidirectional converters (Anthropic ↔ Chat, Chat ↔ Responses, Anthropic ↔ Responses) - Implement /v1/messages endpoint for OpenAI gateway with streaming support - Add model mapping UI for OpenAI OAuth accounts (whitelist + mapping modes) - Support prompt caching fields and codex OAuth transforms - Fix tool call ID conversion for Responses API (fc_ prefix) - Ensure function_call_output has non-empty output field Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 14:29:22 +08:00
erio	79ae15d5e8	fix: OpenAI passthrough accounts bypass model mapping check 透传模式账号仅替换认证，应允许所有模型通过。之前调度阶段的 isModelSupportedByAccount 不感知透传模式，导致 model_mapping 中未配置的新模型（如 gpt-5.4）被拒绝返回 503。	2026-03-06 14:01:47 +08:00
Wesley Liddick	63a8c76946	Merge pull request #798 from touwaeriol/feature/account-load-factor feat: add account load_factor for scheduling load calculation	2026-03-06 09:42:10 +08:00
erio	0d6c1c7790	feat: add independent load_factor field for scheduling load calculation	2026-03-06 05:07:10 +08:00
erio	02dea7b09b	refactor: unify post-usage billing logic and fix account quota calculation - Extract postUsageBilling() to consolidate billing logic across GatewayService.RecordUsage, RecordUsageWithLongContext, and OpenAIGatewayService.RecordUsage, eliminating ~120 lines of duplicated code - Fix account quota to use TotalCost × accountRateMultiplier (was using raw TotalCost, inconsistent with account cost stats) - Fix RecordUsageWithLongContext API Key quota only updating in balance mode (now updates regardless of billing type) - Fix WebSocket client disconnect detection on Windows by adding "an established connection was aborted" to known disconnect errors	2026-03-06 00:54:17 +08:00
erio	05527b13db	feat: add quota limit for API key accounts - Add configurable spending limit (quota_limit) for apikey-type accounts - Atomic quota accumulation via PostgreSQL JSONB operations on TotalCost - Scheduler filters out over-quota accounts with outbox-triggered snapshot refresh - Display quota usage ($used / $limit) in account capacity column - Add "Reset Quota" action in account menu to reset usage to zero - Editing account settings preserves quota_used (no accidental reset) - Covers all 3 billing paths: Anthropic, Gemini, OpenAI RecordUsage chore: bump version to 0.1.90.4	2026-03-06 00:35:09 +08:00
shaw	9d70c38504	fix: 修复claude apikey账号请求时未携带beta=true 查询参数的bug	2026-03-05 15:01:04 +08:00
shaw	aeb464f3ca	feat: 模型映射应用 /v1/messages/count_tokens端点	2026-03-05 14:49:28 +08:00
Wesley Liddick	43c203333e	Merge pull request #733 from DaydreamCoding/fix/group-isolation fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度	2026-03-03 15:10:30 +08:00
shaw	a80ec5d8bb	feat: apikey支持5h/1d/7d速率控制	2026-03-03 15:01:10 +08:00
QTom	530a16291c	fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度当 API Key 无分组时，调度仅从未分组账号池中选取。修复 isAccountInGroup 在 groupID==nil 时的逻辑，同时补全 scheduler_snapshot_service 和 gemini_compat_service 中的 SimpleMode 保护，确保分组隔离在所有调度路径生效。新增 ListSchedulableUngroupedByPlatform/s 方法，使用 Ent 的 Not(HasAccountGroups()) 谓词实现未分组账号隔离。新增 17 个单元和端到端隔离测试，覆盖所有分支和边界条件。	2026-03-03 13:20:58 +08:00
QTom	a9285b8a94	feat(gateway): 双模式用户消息队列 — 串行队列 + 软性限速新增 UMQ (User Message Queue) 双模式支持: - serialize: 账号级分布式串行锁 + RPM 自适应延迟（严格限流） - throttle: 仅 RPM 自适应前置延迟，不阻塞并发（软性限速）后端: - config: 新增 Mode 字段，保留 Enabled 向后兼容 - service: 新增 UserMessageQueueService（Lua 锁/延迟算法/清理 worker） - repository: 新增 UserMsgQueueCache（Redis Lua acquire/release/force-release） - handler: 新增 UserMsgQueueHelper（SSE ping + 等待循环 + throttle） - gateway: 按 mode 分支集成 serialize/throttle 逻辑 - lint: 修复 gofmt rewrite rules、errcheck 类型断言、staticcheck QF1012 前端: - 三态选择器 UI（关闭/软性限速/串行队列）替代 toggle 开关 - BulkEdit 支持 null 语义（不修改） - i18n 中英文文案通过 6 轮专家评审（42 次 review）、golangci-lint、单元测试、集成测试。	2026-03-03 01:05:11 +08:00
QTom	2491e9b5ad	fix: round-3 review fixes for RPM limiting - Add sanitizeExtraBaseRPM to BulkUpdate handler (was missing) - Add WindowCost scheduling checks to legacy non-sticky selection paths (4 sites), matching existing sticky + load-aware coverage - Export ParseExtraInt from service package, remove duplicate parseExtraIntForValidation from admin handler	2026-02-28 20:38:06 +08:00
QTom	e63c83955a	fix: address deep code review issues for RPM limiting - Move IncrementRPM after Forward success to prevent phantom RPM consumption during account switch retries - Add base_rpm input sanitization (clamp to 0-10000) in Create/Update - Add WindowCost scheduling checks to legacy path sticky sessions (4 check sites + 4 prefetch sites), fixing pre-existing gap - Clean up rpm_strategy/rpm_sticky_buffer when disabling RPM in BulkEditModal (JSONB merge cannot delete keys, use empty values) - Add json.Number test cases to TestGetBaseRPM/TestGetRPMStickyBuffer - Document TOCTOU race as accepted soft-limit design trade-off	2026-02-28 20:38:06 +08:00
QTom	ff9683b0fc	fix: move RPM prefetch before routing segment in legacy/mixed paths Ensures isAccountSchedulableForRPM calls within the routing segment hit the prefetch cache instead of querying Redis individually.	2026-02-28 20:37:37 +08:00
QTom	607237571f	fix: address code review issues for RPM limiting feature - Use TxPipeline (MULTI/EXEC) instead of Pipeline for atomic INCR+EXPIRE - Filter negative values in GetBaseRPM(), update test expectation - Add RPM batch query (GetRPMBatch) to account List API - Add warn logs for RPM increment failures in gateway handler - Reset enableRpmLimit on BulkEditAccountModal close - Use union type 'tiered' \| 'sticky_exempt' for rpmStrategy refs - Add design decision comments for rdb.Time() RTT trade-off	2026-02-28 20:37:37 +08:00
QTom	f648b8e026	feat: increment RPM counter before request forwarding	2026-02-28 20:37:10 +08:00
QTom	678c3ae132	feat: integrate RPM scheduling checks into account selection flow	2026-02-28 20:37:10 +08:00

1 2 3 4 5 ...

281 Commits