CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-19 04:40:52 +08:00

Author	SHA1	Message	Date
Luis Pater	1b2f907671	feat(executor): update system instruction handling for Claude and Gemini-3-Pro models	2026-01-08 12:42:26 +08:00
Luis Pater	bda04eed8a	feat(executor): add model-specific support for "gemini-3-pro" in execution and payload handling	2026-01-08 12:27:03 +08:00
Luis Pater	67985d8226	feat(executor): enhance Antigravity payload with user role and dynamic system instructions	2026-01-08 10:55:25 +08:00
Luis Pater	f4ba1ab910	fix(executor): remove unused `tokenRefreshTimeout` constant and pass zero timeout to HTTP client	2026-01-07 18:16:49 +08:00
LTbinglingfeng	5e5d8142f9	fix(auth): error when antigravity refresh token missing during refresh	2026-01-07 01:09:50 +08:00
LTbinglingfeng	b01619b441	fix(management): refresh antigravity token for api-call $TOKEN$	2026-01-07 00:14:02 +08:00
zhiqing0205	aa8526edc0	fix(codex): use unicode title casing for plan	2026-01-06 10:24:02 +08:00
zhiqing0205	ac3ca0ad8e	feat(codex): include plan type in auth filename	2026-01-06 02:25:56 +08:00
MohammadErfan Jabbari	fe6043aec7	fix(antigravity): preserve finish_reason tool_calls across streaming chunks When streaming responses with tool calls, the finish_reason was being overwritten. The upstream sends functionCall in chunk 1, then finishReason: STOP in chunk 2. The old code would set finish_reason from every chunk, causing "tool_calls" to be overwritten by "stop". This broke clients like Claude Code that rely on finish_reason to detect when tool calls are complete. Changes: - Add SawToolCall bool to track tool calls across entire stream - Add UpstreamFinishReason to cache the finish reason - Only emit finish_reason on final chunk (has both finishReason + usage) - Priority: tool_calls > max_tokens > stop Includes 5 unit tests covering: - Tool calls not overwritten by subsequent STOP - Normal text gets "stop" - MAX_TOKENS without tool calls gets "max_tokens" - Tool calls take priority over MAX_TOKENS - Intermediate chunks have no finish_reason Fixes streaming tool call detection for Claude Code + Gemini models. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-05 18:45:25 +01:00
maoring24	00280b6fe8	feat(claude): add native request cloaking for non-claude-code clients integrate claude-cloak functionality to disguise api requests: - add CloakConfig with mode (auto/always/never) and strict-mode options - generate fake user_id in claude code format (user_[hex]_account__session_[uuid]) - inject claude code system prompt (configurable strict mode) - obfuscate sensitive words with zero-width characters - auto-detect claude code clients via user-agent 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-05 20:32:51 +08:00
Luis Pater	8f8dfd081b	Merge pull request #850 from can1357/main feat(translator): add developer role support for Gemini translators	2026-01-05 11:27:24 +08:00
hkfires	05444cf32d	fix(gemini): abort default injection on existing thinking keys	2026-01-05 10:24:30 +08:00
Luis Pater	8edbda57cf	feat(translator): add `thoughtSignature` to node parts for Gemini and Antigravity requests Enhanced node structure by including `thoughtSignature` for inline data parts in Gemini OpenAI, Gemini CLI, and Antigravity request handlers to improve traceability of thought processes.	2026-01-05 09:25:17 +08:00
CodeIgnitor	52760a4eaa	fix(auth): use backend project ID for free tier Gemini CLI OAuth users Fixes issue where free tier users cannot access Gemini 3 preview models due to frontend/backend project ID mapping. ## Problem Google's Gemini API uses a frontend/backend project mapping system for free tier users: - Frontend projects (e.g., gen-lang-client-) are user-visible - Backend projects (e.g., mystical-victor-) host actual API access - Only backend projects have access to preview models (gemini-3-) Previously, CLIProxyAPI ignored the backend project ID returned by Google's onboarding API and kept using the frontend ID, preventing access to preview models. ## Solution ### CLI (internal/cmd/login.go) - Detect free tier users (gen-lang-client- projects or FREE/LEGACY tier) - Show interactive prompt allowing users to choose frontend or backend - Default to backend (recommended for preview model access) - Pro users: maintain original behavior (keep frontend ID) ### Web UI (internal/api/handlers/management/auth_files.go) - Detect free tier users using same logic - Automatically use backend project ID (recommended choice) - Pro users: maintain original behavior (keep frontend ID) ### Deduplication (internal/cmd/login.go) - Add deduplication when user selects ALL projects - Prevents redundant API calls when multiple frontend projects map to same backend - Skips duplicate project IDs in activation loop ## Impact - Free tier users: Can now access gemini-3-pro-preview and gemini-3-flash-preview models - Pro users: No change in behavior (backward compatible) - Only affects Gemini CLI OAuth (not antigravity or API key auth) ## Testing - Tested with free tier account selecting single project - Tested with free tier account selecting ALL projects - Verified deduplication prevents redundant onboarding calls - Confirmed pro user behavior unchanged	2026-01-05 02:41:24 +05:00
Shun Kakinoki	bc32096e9c	fix: prevent race condition in objectstore auth sync Remove os.RemoveAll() call in syncAuthFromBucket() that was causing a race condition with the file watcher. Problem: 1. syncAuthFromBucket() wipes local auth directory with RemoveAll 2. File watcher detects deletions and propagates them to remote store 3. syncAuthFromBucket() then pulls from remote, but files are now gone Solution: Use incremental sync instead of delete-then-pull. Just ensure the directory exists and overwrite files as they're downloaded. This prevents the watcher from seeing spurious delete events.	2026-01-05 00:10:59 +09:00
Supra4E8C	cd22c849e2	feat(management): 更新OAuth模型映射的清理逻辑以增强数据安全性	2026-01-04 17:57:34 +08:00
Supra4E8C	f0e73efda2	feat(management): add vertex api key and oauth model mappings endpoints	2026-01-04 17:32:00 +08:00
Supra4E8C	3156109c71	feat(management): 支持管理接口调整日志大小/强制前缀/路由策略	2026-01-04 12:21:49 +08:00
can1357	6762e081f3	feat(translator): add developer role support for Gemini translators Treat OpenAI's "developer" role the same as "system" role in request translation for gemini, gemini-cli, and antigravity backends.	2026-01-03 21:01:01 +01:00
Luis Pater	7815ee338d	fix(translator): adjust `message_delta` emission boundary in Claude-to-OpenAI conversion Fixed incorrect boundary logic for `message_delta` emission, ensuring proper handling of usage updates and `emitMessageStopIfNeeded` within the response loop.	2026-01-04 01:36:51 +08:00
Luis Pater	44b6c872e2	feat(config): add support for `Fork` in OAuth model mappings with alias handling Implemented `Fork` flag in `ModelNameMapping` to allow aliases as additional models while preserving the original model ID. Updated the `applyOAuthModelMappings` logic, added tests for `Fork` behavior, and updated documentation and examples accordingly.	2026-01-04 01:18:29 +08:00
Luis Pater	7a77b23f2d	feat(executor): add token refresh timeout and improve context handling during refresh Introduced `tokenRefreshTimeout` constant for token refresh operations and enhanced context propagation for `refreshToken` by embedding roundtrip information if available. Adjusted `refreshAuth` to ensure default context initialization and handle cancellation errors appropriately.	2026-01-04 00:26:08 +08:00
Luis Pater	ebec293497	feat(api): integrate `TokenStore` for improved auth entry management Replaced file-based auth entry counting with `TokenStore`-backed implementation, enhancing flexibility and context-aware token management. Updated related logic to reflect this change.	2026-01-03 04:53:47 +08:00
Luis Pater	e02ceecd35	feat(registry): introduce `ModelRegistryHook` for monitoring model registrations and unregistrations Added support for external hooks to observe model registry events using the `ModelRegistryHook` interface. Implemented thread-safe, non-blocking execution of hooks with panic recovery. Comprehensive tests added to verify hook behavior during registration, unregistration, blocking, and panic scenarios.	2026-01-02 23:18:40 +08:00
hkfires	fdf5720217	fix(gemini): remove default thinking for gemini 3 models	2026-01-02 10:55:59 +08:00
Luis Pater	2a663d5cba	feat(executor): enhance payload translation with original request context Refactored `applyPayloadConfig` to `applyPayloadConfigWithRoot`, adding support for default rule validation against the original payload when available. Updated all executors to use `applyPayloadConfigWithRoot` and incorporate an optional original request payload for translations.	2026-01-02 00:03:26 +08:00
hkfires	3902fd7501	fix(iflow): remove thinking field from request body in thinking config handler	2026-01-01 19:40:28 +08:00
hkfires	4fc3d5e935	refactor(iflow): simplify thinking config handling for GLM and MiniMax models	2026-01-01 19:31:08 +08:00
hkfires	2d2f4572a7	fix(translator): remove unnecessary whitespace trimming in reasoning text collection	2026-01-01 12:39:09 +08:00
hkfires	8f4c46f38d	fix(translator): emit tool_result messages before user content in Claude-to-OpenAI conversion	2026-01-01 11:11:43 +08:00
hkfires	b6ba51bc2a	feat(translator): add thinking block and tool result handling for Claude-to-OpenAI conversion	2026-01-01 09:41:25 +08:00
Luis Pater	6a66d32d37	Merge pull request #803 from HsnSaboor/fix-invalid-function-names-sanitization-v2 feat(translator): resolve invalid function name errors by sanitizing Claude tool names	2026-01-01 01:15:50 +08:00
Luis Pater	8d15723195	feat(registry): add `GetAvailableModelsByProvider` method for retrieving models by provider	2025-12-31 23:37:46 +08:00
hkfires	8bf3305b2b	fix(thinking): fallback to upstream model for thinking support when alias not in registry	2025-12-31 18:07:13 +08:00
hkfires	d00e3ea973	feat(thinking): add numeric budget to thinkingLevel conversion fallback	2025-12-31 17:14:47 +08:00
hkfires	89db4e9481	fix(thinking): use model alias for thinking config resolution in mapped models	2025-12-31 17:09:22 +08:00
hkfires	e332419081	feat(registry): add thinking support for gemini-2.5-computer-use-preview model	2025-12-31 17:09:22 +08:00
Luis Pater	e998b1229a	feat(updater): add fallback URL and logic for missing management asset	2025-12-31 11:51:20 +08:00
Saboor Hassan	47b9503112	chore: revert changes to internal/translator to comply with path guard This commit reverts all modifications within internal/translator. A separate issue will be created for the maintenance team to integrate SanitizeFunctionName into the translators. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2025-12-31 02:19:26 +05:00
Saboor Hassan	3b9253c2be	fix(translator): resolve invalid function name errors by sanitizing Claude tool names This commit centralizes tool name sanitization in SanitizeFunctionName, applying character compliance, starting character rules, and length limits. It also fixes a regression in gemini_schema tests and preserves MCP-specific shortening logic while ensuring compliance. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2025-12-31 02:14:46 +05:00
Saboor Hassan	d241359153	fix(translator): address PR feedback for tool name sanitization - Pre-compile sanitization regex for better performance. - Optimize SanitizeFunctionName for conciseness and correctness. - Handle 64-char edge cases by truncating before prepending underscore. - Fix bug in Antigravity translator (incorrect join index). - Refactor Gemini translators to avoid redundant sanitization calls. - Add comprehensive unit tests including 64-char edge cases. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2025-12-31 01:54:41 +05:00
Saboor Hassan	f4d4249ba5	feat(translator): sanitize tool/function names for upstream provider compatibility Implemented SanitizeFunctionName utility to ensure Claude tool names meet Gemini/Upstream strict naming conventions (alphanumeric, starts with letter/underscore, max 64 chars). Applied sanitization to tool definitions and usage in all relevant translators. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2025-12-31 01:41:07 +05:00
hkfires	e0381a6ae0	refactor(watcher): extract model summary functions to dedicated file	2025-12-30 22:39:12 +08:00
hkfires	2c01b2ef64	feat(watcher): add Gemini models and OAuth model mappings change detection	2025-12-30 22:39:12 +08:00
Chén Mù	e947266743	Merge pull request #795 from router-for-me/modelmappings refactor(executor): resolve upstream model at conductor level before execution	2025-12-30 05:31:19 -08:00
Luis Pater	c6b0e85b54	Fixed: #790 fix(gemini): include full text in response output events	2025-12-30 20:44:13 +08:00
hkfires	26efbed05c	refactor(executor): remove redundant upstream model parameter from translateRequest	2025-12-30 20:20:42 +08:00
hkfires	96340bf136	refactor(executor): resolve upstream model at conductor level before execution	2025-12-30 19:31:54 +08:00
hkfires	b055e00c1a	fix(executor): use upstream model for thinking config and payload translation	2025-12-30 17:49:44 +08:00
sususu	414db44c00	fix(antigravity): parse retry-after delay from 429 response body When receiving HTTP 429 (Too Many Requests) responses, parse the retry delay from the response body using parseRetryDelay and populate the statusErr.retryAfter field. This allows upstream callers to respect the server's requested retry timing. Applied to all error paths in Execute, executeClaudeNonStream, ExecuteStream, CountTokens, and refreshToken functions.	2025-12-30 16:07:32 +08:00

... 4 5 6 7 8 ...

1181 Commits