sub2api

mirror of https://gitee.com/wanwujie/sub2api synced 2026-04-09 09:34:46 +08:00

Author	SHA1	Message	Date
erio	b3fe0506fb	Merge branch 'release/custom-0.1.95' into release/custom-0.1.96	2026-03-12 18:12:47 +08:00
haruka	25cb5e7505	fix 第一次 400，第二次触发切账号信号	2026-03-12 11:30:53 +08:00
haruka	f44927b9f8	add test for fix #935	2026-03-12 11:04:14 +08:00
erio	4caa3c2701	merge: integrate upstream v0.1.95 with our customizations Merge main (custom features) into release/custom-0.1.95 (upstream v0.1.95). New upstream features: group subscription binding, multi-dimension quota (daily/weekly/total), allow_messages_dispatch, default_mapped_model, recover state API. Our customizations: simulate_claude_max_enabled, usage status detection, 403 validation handling.	2026-03-11 03:23:44 +08:00
shaw	a3791104f9	feat: 支持后台设置是否启用整流开关	2026-03-07 21:55:38 +08:00
Elysia	65a106792a	fix issue #791	2026-03-06 20:37:09 +08:00
erio	a6026e7ac4	Merge tag 'v0.1.90' into merge/upstream-v0.1.90 注册邮箱域名白名单策略上线，后台大数据场景性能大幅优化。 - 注册邮箱域名白名单：支持管理员配置允许注册的邮箱域名策略 - Keys 页面表单筛选：用户 /keys 页面支持按条件筛选 API Key - Settings 页面分 Tab 拆分：管理后台设置页面按功能模块分 Tab 展示 - 后台大数据场景加载性能优化：仪表盘/用户/账号/Ops 页面大数据集加载显著提速 - Usage 大表分页优化：默认避免全量 COUNT(*)，大幅降低分页查询耗时 - 消除重复的 normalizeAccountIDList，补充新增组件的单元测试 - 清理无用文件和过时文档，精简项目结构 - EmailVerifyView 硬编码英文字符串替换为 i18n 调用 - 修复 Anthropic 平台无限流重置时间的 429 误标记账号限流问题 - 修复自定义菜单页面管理员视角菜单不生效问题 - 修复 Ops 错误详情弹窗未展示真实上游 payload 的问题 - 修复充值/订阅菜单 icon 显示问题 # Conflicts: # .gitignore # backend/cmd/server/VERSION # backend/ent/group.go # backend/ent/runtime/runtime.go # backend/ent/schema/group.go # backend/go.sum # backend/internal/handler/admin/account_handler.go # backend/internal/handler/admin/dashboard_handler.go # backend/internal/pkg/usagestats/usage_log_types.go # backend/internal/repository/group_repo.go # backend/internal/repository/usage_log_repo.go # backend/internal/server/middleware/security_headers.go # backend/internal/server/router.go # backend/internal/service/account_usage_service.go # backend/internal/service/admin_service_bulk_update_test.go # backend/internal/service/dashboard_service.go # backend/internal/service/gateway_service.go # frontend/src/api/admin/dashboard.ts # frontend/src/components/account/BulkEditAccountModal.vue # frontend/src/components/charts/GroupDistributionChart.vue # frontend/src/components/layout/AppSidebar.vue # frontend/src/i18n/locales/en.ts # frontend/src/i18n/locales/zh.ts # frontend/src/views/admin/GroupsView.vue # frontend/src/views/admin/SettingsView.vue # frontend/src/views/admin/UsageView.vue # frontend/src/views/user/PurchaseSubscriptionView.vue	2026-03-04 19:58:38 +08:00
yangjianbo	bb664d9bbf	feat(sync): full code sync from release	2026-02-28 15:01:20 +08:00
erio	81d896bf78	fix: sync Antigravity ForwardResult.Usage with client response simulation Apply Claude Max cache billing to usage before returning ForwardResult in Antigravity Forward, ensuring RecordUsage gets the same simulated usage that clients see. Restore apply+fallback in RecordUsage for consistency across GatewayService and Antigravity paths.	2026-02-27 20:42:53 +08:00
erio	741eae59bb	refactor: decouple claude max cache simulation from RecordUsage Extract setupClaudeMaxStreamingHook and applyClaudeMaxNonStreamingRewrite facade functions to helpers file. RecordUsage now uses detect-only (no mutation), client response rewriting handled at Forward layer.	2026-02-27 19:59:36 +08:00
erio	a6f9f9f968	feat: replace gemini-3-pro-image with gemini-3.1-flash-image - Add migration 060 to update model_mapping for all antigravity accounts - Remove gemini-3-pro-image and gemini-3-pro-image-preview mappings - Add gemini-3.1-flash-image and gemini-3.1-flash-image-preview mappings - Update frontend usage window to show GImage for new model - Update isImageGenerationModel to support new model	2026-02-27 09:52:50 +08:00
erio	756b09b6b8	feat: replace gemini-3-pro-image with gemini-3.1-flash-image - Add migration 060 to update model_mapping for all antigravity accounts - Remove gemini-3-pro-image and gemini-3-pro-image-preview mappings - Add gemini-3.1-flash-image and gemini-3.1-flash-image-preview mappings - Update frontend usage window to show GImage for new model - Update isImageGenerationModel to support new model	2026-02-27 09:30:44 +08:00
erio	4573868c08	fix(antigravity): bill with mapped model and use final model key for rate limiting - Use mapped model (billingModel) instead of original request model for billing - Use resolveFinalAntigravityModelKey for 429 rate limit model key, ensuring rate limit records match the actual upstream model - Add regression tests for both fixes	2026-02-24 18:08:19 +08:00
erio	0dacdf480b	fix: distinguish client disconnection from upstream retry failure Before this change, when a client disconnected mid-request, the error message was "Upstream request failed after retries", which is misleading and pollutes error logs. Now we check context.Err() to return a more accurate "Client disconnected" message for both Claude and Gemini forward paths.	2026-02-24 16:45:08 +08:00
yangjianbo	41d0383fb7	merge(test): 合并 main 并解决前端筛选器冲突	2026-02-15 22:04:06 +08:00
程序猿MT	1cf51b14f7	Merge branch 'Wei-Shaw:main' into main	2026-02-15 20:49:14 +08:00
shaw	a817cafe3d	feat: 区分 Anthropic 5m/1h 缓存创建 token 的差异化计费 Anthropic API 的 cache_creation 对象区分了 ephemeral_5m 和 ephemeral_1h 两种缓存创建 token，1h 单价远高于 5m（如 claude-3-5-haiku: 5m=$1/MTok, 1h=$6/MTok）。此前系统统一按 5m 单价计费，导致计费偏低。后端： - pricing_service: 加载 LiteLLM 的 cache_creation_input_token_cost_above_1hr - billing_service: GetModelPricing 启用分类计费（安全守卫 1h>5m）， CalculateCost 按 5m/1h 分别计费，无明细时回退到 5m 单价 - gateway_service: parseSSEUsage/handleNonStreamingResponse 用 gjson 提取嵌套 cache_creation 对象的 ephemeral_5m/1h_input_tokens - antigravity_gateway_service: extractSSEUsage/extractClaudeUsage 同步提取 - usage_log: 修复 GORM column tag 确保写入正确的数据库列 - 新增迁移 054: 删除 GORM 自动生成的重复列前端： - 使用记录 tooltip 展示 5m/1h 缓存创建明细（带彩色 badge 区分） - 表格单元格缓存写入数值旁显示 1h 标识	2026-02-14 18:15:35 +08:00
yangjianbo	abf5de69fb	Merge branch 'main' into test	2026-02-12 23:43:47 +08:00
程序猿MT	174d7c774d	Merge branch 'Wei-Shaw:main' into main	2026-02-12 23:12:41 +08:00
yangjianbo	584cfc3db2	chore(logging): 完成后端日志审计与结构化迁移 - 将高密度服务与处理器日志迁移到新日志系统（LegacyPrintf/结构化日志） - 增加 stdlog bridge 与兼容测试，保留旧日志捕获能力 - 将 OpenAI 断流告警改为结构化 Warn 并改造对应测试为 sink 捕获 - 补齐后端相关文件 logger 引用并通过全量 go test	2026-02-12 19:01:09 +08:00
程序猿MT	8da5fac69e	Merge branch 'Wei-Shaw:main' into main	2026-02-11 18:39:52 +08:00
SilentFlower	19cca11e00	[UPDATE] 增强 Claude Thinking 模式支持与 Opus 4.6 动态预算适配 ✨ feat(antigravity): 支持 thinking adaptive 类型并适配 Opus 4.6 动态预算 🧪 test(gateway): 增加 thinking 模式解析与签名块过滤的边界用例测试	2026-02-11 10:31:16 +08:00
Edric Li	2a1067c82b	Merge remote-tracking branch 'upstream/main'	2026-02-10 21:52:33 +08:00
Edric Li	a54b81cf74	perf: 错误处理性能优化 - MatchRule 延迟/限制 body ToLower，先用 statusCode 短路，只在需要关键词匹配时转换且限制 8KB - 预计算规则的小写关键词/平台和 error code set，消除运行时重复 ToLower 和线性扫描 - MODEL_CAPACITY_EXHAUSTED 全局去重，避免并发请求重复重试同一模型 - 503 重试 body 读取限制从 2MB 降至 8KB - time.After 替换为 time.NewTimer，防止 context 取消时 timer 泄漏	2026-02-10 21:40:31 +08:00
Edric Li	2d4236f76e	fix: 修复错误透传规则 skip_monitoring 未生效的问题 - ops_error_logger: status < 400 分支增加 OpsSkipPassthroughKey 检查 - ops_upstream_context: 新增 checkSkipMonitoringForUpstreamEvent，中间重试/故障转移事件也能触发跳过标记 - gateway_handler/openai_gateway_handler/gemini_v1beta_handler: handleFailoverExhausted 匹配规则后设置 OpsSkipPassthroughKey - antigravity_gateway_service: writeMappedClaudeError 增加 applyErrorPassthroughRule 调用	2026-02-10 20:56:01 +08:00
yangjianbo	3b0910f664	Merge branch 'main' into test-sora	2026-02-10 18:01:17 +08:00
程序猿MT	1dd3158c7e	Merge branch 'Wei-Shaw:main' into main	2026-02-10 13:55:51 +08:00
song	1f647b120a	feat(antigravity): 转发与测试支持daily/prod单URL切换	2026-02-10 13:51:29 +08:00
Edric Li	7d0a30fa8f	merge: sync upstream main (antigravity single-account 503 retry) 合并上游新增的 Antigravity 单账号 503 退避重试机制，解决与本地 MODEL_CAPACITY_EXHAUSTED 逻辑的冲突，两者共存。	2026-02-10 12:00:21 +08:00
shaw	5dd83d3cf2	fix: 移除特定system以适配新版cc客户端缓存失效的bug	2026-02-10 10:28:34 +08:00
Wesley Liddick	14e1aac9b5	Merge pull request #533 from GuangYiDing/feat/antigravity-single-account-503-retry feat: Antigravity 单账号分组 503 退避重试机制	2026-02-10 09:59:48 +08:00
yangjianbo	58912d4ac5	perf(backend): 使用 gjson/sjson 优化热路径 JSON 处理将 API 网关热路径中的 json.Unmarshal+json.Marshal 替换为 gjson 零拷贝查询和 sjson 精准写入： - unwrapV1InternalResponse 性能提升 22x（4009ns→182ns），内存分配减少 28.5x - unwrapGeminiResponse、extractGeminiUsage、estimateGeminiCountTokens、ParseGeminiRateLimitResetTime 改为接收 []byte 使用 gjson 提取 - ParseGatewayRequest 的 model/stream/metadata/thinking/max_tokens 改用 gjson 类型安全提取 - Handler 层（sora/openai）改用 gjson 提取字段、sjson 注入/修改字段，移除 map[string]any 中间变量 - Sora Client 响应解析改用 gjson ForEach 遍历，减少内存分配 - 新增约 100 个单元测试用例，所有改动函数覆盖率 >85% Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 08:59:30 +08:00
Edric Li	6114f69cca	feat: MODEL_CAPACITY_EXHAUSTED 使用固定1s间隔重试60次，不切换账号 MODEL_CAPACITY_EXHAUSTED (503) 表示模型容量不足，所有账号共享同一容量池，切换账号无意义。改为固定1s间隔重试最多60次，重试耗尽后直接返回上游错误。 - 新增 antigravityModelCapacityRetryMaxAttempts=60 和 antigravityModelCapacityRetryWait=1s - shouldTriggerAntigravitySmartRetry 新增 isModelCapacityExhausted 返回值 - handleSmartRetry 对 MODEL_CAPACITY_EXHAUSTED 使用独立重试策略 - handleModelRateLimit 对 MODEL_CAPACITY_EXHAUSTED 仅标记 Handled，不设限流 - 重试耗尽后不设置模型限流、不清除粘性会话、不切换账号	2026-02-10 02:03:06 +08:00
Edric Li	d6c2921f2b	feat: same-account retry before failover for transient errors For retryable transient errors (Google 400 "invalid project resource name" and empty stream responses), retry on the same account up to 2 times (with 500ms delay) before switching to another account. - Add RetryableOnSameAccount field to UpstreamFailoverError - Add same-account retry loop in both Gemini and Claude/OpenAI handler paths - Move temp-unschedule from service layer to handler layer (only after all same-account retries exhausted) - Reduce temp-unschedule cooldown from 30 minutes to 1 minute	2026-02-10 00:53:54 +08:00
Edric Li	61c73287dc	feat: failover and temp-unschedule on empty stream response - Empty stream responses now return UpstreamFailoverError instead of plain 502, triggering automatic account switching (up to 10 retries) - Add tempUnscheduleEmptyResponse: accounts returning empty responses are temp-unscheduled for 30 minutes - Apply to both Claude and Gemini non-streaming paths - Align googleConfigErrorCooldown from 60m to 30m for consistency	2026-02-09 23:25:30 +08:00
Edric Li	89905ec43d	feat: failover and temp-unschedule on Google "Invalid project resource name" 400 Google 后端间歇性返回 400 "Invalid project resource name" 错误，此前该错误直接透传给客户端且不触发账号切换，导致请求失败。 - 在 Antigravity 和 Gemini 两个平台的所有转发路径中，精确匹配该错误消息后触发 failover 自动换号重试 - 命中后将账号临时封禁 1 小时，避免反复调度到同一故障账号 - 提取共享函数 isGoogleProjectConfigError / tempUnscheduleGoogleConfigError 消除跨 Service 的代码重复	2026-02-09 22:48:32 +08:00
yangjianbo	16131c3d3f	Merge branch 'main' of https://github.com/mt21625457/aicodex2api	2026-02-09 20:26:03 +08:00
erio	6892e84ad2	fix: skip rate limiting when custom error codes don't match upstream status Add ShouldHandleErrorCode guard at the entry of handleGeminiUpstreamError and AntigravityGatewayService.handleUpstreamError so that accounts with custom error codes (e.g. [599]) are not rate-limited when the upstream returns a non-matching status (e.g. 429).	2026-02-09 19:55:05 +08:00
erio	73f455745c	feat: ErrorPolicySkipped returns 500 instead of upstream status code When custom error codes are enabled and the upstream error code is NOT in the configured list, return HTTP 500 to the client instead of transparently forwarding the original status code. Also adds integration test TestCustomErrorCode599 verifying that 429, 500, 503, 401, 403 all return 500 without triggering SetRateLimited or SetError.	2026-02-09 19:54:54 +08:00
Rose Ding	f6cfab9901	feat: 添加 Antigravity 单账号 503 退避重试机制当分组内只有一个可用账号且上游返回 503 (MODEL_CAPACITY_EXHAUSTED) 时，不再设置模型限流+切换账号（因为切换回来还是同一个账号），而是在 Service 层原地等待+重试，避免双重等待问题。主要变更： - Handler 层：检测单账号 503 场景，清除排除列表并设置 SingleAccountRetry 标记 - Service 层：新增 handleSingleAccountRetryInPlace 原地重试逻辑 - Service 层：预检查跳过单账号模式下的限流检查 - 新增 ctxkey.SingleAccountRetry 上下文标记	2026-02-09 14:26:01 +08:00
erio	fc095bf054	refactor: replace scope-level rate limiting with model-level rate limiting Merge functional changes from develop branch: - Remove AntigravityQuotaScope system (claude/gemini_text/gemini_image) - Replace with per-model rate limiting using resolveAntigravityModelKey - Remove model load statistics (IncrModelCallCount/GetModelLoadBatch) - Simplify account selection to unified priority→load→LRU algorithm - Remove SetAntigravityQuotaScopeLimit from AccountRepository - Clean up scope-related UI indicators and API fields	2026-02-09 08:19:01 +08:00
erio	9236936a55	feat: route AccountTypeUpstream to ForwardUpstream in Forward() entry Without this routing guard, ForwardUpstream is never called because Forward() always proceeds with the standard OAuth/cookie flow.	2026-02-09 07:27:10 +08:00
erio	125152460f	fix: use upstream retryDelay for rate limit duration instead of fixed default - In handleSmartRetry, use the actual upstream retryDelay to set model rate limit duration instead of always using the 30s default - Return info.RetryDelay from shouldTriggerAntigravitySmartRetry when shouldRateLimitModel=true, so callers know the actual delay - Extract getDefaultRateLimitDuration() and resolveResetTime() helpers to reduce duplication in handleUpstreamError 429 handling - Improve debug logging with upstream_retry_delay and response body	2026-02-09 07:11:29 +08:00
erio	6d90fb0bc3	feat: detect client disconnect during streaming and continue draining upstream for billing	2026-02-09 07:06:26 +08:00
erio	2f1182e8a9	feat: unified error policy for Antigravity + enable custom error codes for Gemini accounts	2026-02-09 06:54:42 +08:00
Wesley Liddick	2185a3b674	Merge pull request #517 from touwaeriol/fix/upstream-baseurl refactor(upstream): replace upstream account type with apikey + auto-append base_url	2026-02-08 14:03:12 +08:00
Wesley Liddick	9e3c306a5b	Merge pull request #513 from touwaeriol/pr/antigravity-full-v2 feat(antigravity): comprehensive enhancements — rate limiting, scheduling & smart retry	2026-02-08 14:01:17 +08:00
erio	69816f8691	fix: remove unused upstreamHopByHopHeaders variable to pass golangci-lint	2026-02-08 13:30:39 +08:00
erio	fb58560d15	refactor(upstream): replace upstream account type with apikey, auto-append /antigravity Upstream accounts now use the standard APIKey type instead of a dedicated upstream type. GetBaseURL() and new GetGeminiBaseURL() automatically append /antigravity for Antigravity platform APIKey accounts, eliminating the need for separate upstream forwarding methods. - Remove ForwardUpstream, ForwardUpstreamGemini, testUpstreamConnection - Remove upstream branch guards in Forward/ForwardGemini/TestConnection - Add migration 052 to convert existing upstream accounts to apikey - Update frontend CreateAccountModal to create apikey type - Add unit tests for GetBaseURL and GetGeminiBaseURL	2026-02-08 13:06:25 +08:00
erio	6ab77f5eb5	fix(upstream): passthrough response body directly instead of parsing SSE ForwardUpstream/ForwardUpstreamGemini should pipe the upstream response directly to the client (headers + body), not parse it as SSE stream.	2026-02-08 08:49:43 +08:00

1 2 3 4

169 Commits