CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
hkfires	ed8b0f25ee	fix(thinking): use LookupModelInfo for model data	2026-01-15 13:06:41 +08:00
hkfires	72f2125668	fix(executor): properly handle thinking application errors	2026-01-15 13:06:39 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
Luis Pater	e8e3bc8616	feat(executor): add HttpRequest support across executors for better http request handling	2026-01-10 16:25:25 +08:00
Ben Vargas	e785bfcd12	Use unprefixed Claude request for translation Keep the upstream payload prefixed for OAuth while passing the unprefixed request body into response translators. This avoids proxy_ leaking into OpenAI Responses echoed tool metadata while preserving the Claude OAuth workaround.	2026-01-09 00:54:35 -07:00
Ben Vargas	dcac3407ab	Fix Claude OAuth tool name mapping Prefix tool names with proxy_ for Claude OAuth requests and strip the prefix from streaming and non-streaming responses to restore client-facing names. Updates the Claude executor to: - add prefixing for tools, tool_choice, and tool_use messages when using OAuth tokens - strip the prefix from tool_use events in SSE and non-streaming payloads - add focused unit tests for prefix/strip helpers	2026-01-09 00:10:38 -07:00
Luis Pater	2a663d5cba	feat(executor): enhance payload translation with original request context Refactored `applyPayloadConfig` to `applyPayloadConfigWithRoot`, adding support for default rule validation against the original payload when available. Updated all executors to use `applyPayloadConfigWithRoot` and incorporate an optional original request payload for translations.	2026-01-02 00:03:26 +08:00
hkfires	96340bf136	refactor(executor): resolve upstream model at conductor level before execution	2025-12-30 19:31:54 +08:00
hkfires	b055e00c1a	fix(executor): use upstream model for thinking config and payload translation	2025-12-30 17:49:44 +08:00
Ben Vargas	aca2ef6359	Fix: disable thinking when tool_choice forces tool use Anthropic API does not allow extended thinking when tool_choice is set to "any" or a specific tool. This was causing 400 errors when using features like Amp's /handoff command which forces tool_choice. Added disableThinkingIfToolChoiceForced() that removes thinking config when incompatible tool_choice is detected, applied to both streaming and non-streaming paths. Fixes router-for-me/CLIProxyAPI#630	2025-12-27 16:31:37 -07:00
Luis Pater	6d1e20e940	fix(claude_executor): update header logic for API key handling Refined header assignment to use `x-api-key` for Anthropic API requests, ensuring correct authorization behavior based on request attributes and URL validation.	2025-12-23 22:30:25 +08:00
Luis Pater	a74ee3f319	Merge pull request #481 from sususu98/fix/increase-buffer-size fix: increase buffer size for stream scanners to 50MB across multiple executors	2025-12-11 21:20:54 +08:00
hkfires	6285459c08	fix(runtime): unify claude thinking config resolution	2025-12-11 17:20:44 +08:00
Luis Pater	423ce97665	feat(util): implement dynamic thinking suffix normalization and refactor budget resolution logic - Added support for parsing and normalizing dynamic thinking model suffixes. - Centralized budget resolution across executors and payload helpers. - Retired legacy Gemini-specific thinking handlers in favor of unified logic. - Updated executors to use metadata-based thinking configuration. - Added `ResolveOriginalModel` utility for resolving normalized upstream models using request metadata. - Updated executors (Gemini, Codex, iFlow, OpenAI, Qwen) to incorporate upstream model resolution and substitute model values in payloads and request URLs. - Ensured fallbacks handle cases with missing or malformed metadata to derive models robustly. - Refactored upstream model resolution to dynamically incorporate metadata for selecting and normalizing models. - Improved handling of thinking configurations and model overrides in executors. - Removed hardcoded thinking model entries and migrated logic to metadata-based resolution. - Updated payload mutations to always include the resolved model.	2025-12-11 03:10:50 +08:00
sususu	76c563d161	fix(executor): increase buffer size for stream scanners to 50MB across multiple executors	2025-12-10 23:20:04 +08:00
nestharus	e73cdf5cff	fix(claude): ensure max_tokens exceeds thinking budget for thinking models Fixes an issue where Claude thinking models would return 400 errors when the thinking.budget_tokens was greater than or equal to max_tokens. Changes: - Add MaxCompletionTokens: 128000 to all Claude thinking model definitions - Add ensureMaxTokensForThinking() function in claude_executor.go that: - Checks if thinking is enabled with a budget_tokens value - Looks up the model's MaxCompletionTokens from the registry - Ensures max_tokens is set to at least the model's MaxCompletionTokens - Falls back to budget_tokens + 4000 buffer if registry lookup fails This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens) is always satisfied when using extended thinking features. Fixes: #339 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-26 22:31:05 -08:00
Luis Pater	a4a26d978e	Fixed: #339 feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions - Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration. - Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure.	2025-11-26 11:42:57 +08:00
Luis Pater	ed9f6e897e	Fixed: #337 fix(executor): replace redundant commented code with `checkSystemInstructions` helper - Replaced commented-out `sjson.SetRawBytes` lines with the new `checkSystemInstructions` function. - Centralized system instruction handling for better code clarity and reuse. - Ensured consistent logic for managing `system` field across Claude executor flows.	2025-11-26 08:27:48 +08:00
Luis Pater	2e5681ea32	Merge branch 'dev' into feat/claude-thinking-and-beta-headers	2025-11-26 02:16:40 +08:00
Luis Pater	52c17f03a5	fix(executor): comment out redundant code for setting Claude system instructions - Commented out multiple instances of `sjson.SetRawBytes` for setting `system` key to Claude instructions as they are redundant. - Code cleanup to improve clarity and maintainability without affecting functionality.	2025-11-26 02:06:16 +08:00
nestharus	d0e694d4ed	feat(claude): add thinking model variants and beta headers support - Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants) - Add Thinking support for antigravity models with -thinking suffix - Add injectThinkingConfig() for automatic thinking budget based on model suffix - Add resolveUpstreamModel() mappings for thinking variants to actual Claude models - Add extractAndRemoveBetas() to convert betas array to anthropic-beta header - Update applyClaudeHeaders() to merge custom betas from request body Closes #324	2025-11-25 03:33:05 -08:00
Luis Pater	db2d22c978	fix(runtime): simplify scanner buffer allocation in executor implementations	2025-11-18 10:59:49 +08:00
Luis Pater	fcd98f4f9b	feat(runtime): add payload configuration support for executors Introduce `PayloadConfig` in the configuration to define default and override rules for modifying payload parameters. Implement `applyPayloadConfig` and `applyPayloadConfigWithRoot` to apply these rules across various executors, ensuring consistent parameter handling for different models and protocols. Update all relevant executors to utilize this functionality.	2025-11-13 23:27:40 +08:00
hkfires	cfb9cb8951	feat(config): support HTTP headers across providers	2025-11-08 20:52:05 +08:00
Luis Pater	38cfbac8f0	fix(executor): adjust `Anthropic-Beta` header handling for consistent API requests	2025-11-03 20:49:01 +08:00
Luis Pater	5be4d22b9b	fix(executor): ensure consistent header application in Claude API requests	2025-11-03 17:57:20 +08:00
hkfires	a517290726	refactor(executor): summarize API error bodies of html in debug logs	2025-10-31 06:58:38 +08:00
Luis Pater	9d42e4b239	feat(runtime): add User-Agent headers to codex and claude executors - Standardized User-Agent strings for Codex and Claude executors to improve request tracing and compatibility. - Updated header insertion logic in both executors for consistency.	2025-10-29 12:57:37 +08:00
Luis Pater	847c2502a5	Fixed: #172 feat(runtime): add Brotli and Zstd compression support, improve response handling - Implemented Brotli and Zstd decompression handling in `FileRequestLogger` and executor logic for enhanced compatibility. - Added `decodeResponseBody` utility for streamlined multi-encoding support (Gzip, Deflate, Brotli, Zstd). - Improved resource cleanup with composite readers for proper closure under all conditions. - Updated dependencies in `go.mod` and `go.sum` to include Brotli and Zstd libraries.	2025-10-28 08:39:03 +08:00
Luis Pater	c7196ba7dc	feat(claude): add model alias mapping and improve key normalization - Introduced model alias mapping for Claude configurations, enabling upstream and client-facing model name associations. - Added `computeClaudeModelsHash` to generate a consistent hash for model aliases. - Implemented `normalizeClaudeKey` function to standardize input API key configuration, including models. - Enhanced executor to resolve model aliases to upstream names dynamically. - Updated documentation and configuration examples to reflect new model alias support.	2025-10-28 00:14:19 +08:00
Luis Pater	20985d1a10	Refactor executor error handling and usage reporting - Updated the Execute methods in various executors (GeminiCLIExecutor, GeminiExecutor, IFlowExecutor, OpenAICompatExecutor, QwenExecutor) to return a response and error as named return values for improved clarity. - Enhanced error handling by deferring failure tracking in usage reporters, ensuring that failures are reported correctly. - Improved response body handling by ensuring proper closure and error logging for HTTP responses across all executors. - Added failure tracking and reporting in the usage reporter to capture unsuccessful requests. - Updated the usage logging structure to include a 'Failed' field for better tracking of request outcomes. - Adjusted the logic in the RequestStatistics and Record methods to accommodate the new failure tracking mechanism.	2025-10-21 11:22:24 +08:00
Luis Pater	3dd0844b98	Enhance logging for API requests and responses across executors - Added detailed logging of upstream request metadata including URL, method, headers, and body for Codex, Gemini, IFlow, OpenAI Compat, and Qwen executors. - Implemented error logging for API response failures to capture errors during HTTP requests. - Introduced structured logging for authentication details (AuthID, AuthLabel, AuthType, AuthValue) to improve traceability. - Updated response logging to include status codes and headers for better debugging. - Ensured that all executors consistently log API interactions to facilitate monitoring and troubleshooting.	2025-10-17 04:12:38 +08:00
Adamcf	15981aa412	fix: add Claude→Claude passthrough to prevent SSE event fragmentation When from==to (Claude→Claude scenario), directly forward SSE stream line-by-line without invoking TranslateStream. This preserves the multi-line SSE event structure (event:/data:/blank) and prevents JSON parsing errors caused by event fragmentation. Resolves: JSON parsing error when using Claude Code streaming responses fix: correct SSE event formatting in Handler layer Remove duplicate newline additions (\n\n) that were breaking SSE event format. The Executor layer already provides properly formatted SSE chunks with correct line endings, so the Handler should forward them as-is without modification. Changes: - Remove redundant \n\n addition after each chunk - Add len(chunk) > 0 check before writing - Format error messages as proper SSE events (event: error\ndata: {...}\n\n) - Add chunkIdx counter for future debugging needs This fixes JSON parsing errors caused by malformed SSE event streams. fix: update comments for clarity in SSE event forwarding	2025-10-15 22:13:44 +08:00
Luis Pater	bbdd68a8b4	feat(registry/runtime): add Gemini 2.5 model and increase buffer sizes - Added new "Gemini 2.5 Flash Image Preview" model definition, with enhanced image generation capabilities. - Increased scanner buffer size to 20,971,520 bytes across executors and translators to handle larger payloads.	2025-10-06 04:44:45 +08:00
Luis Pater	de796ac1c2	feat(runtime): introduce `newProxyAwareHTTPClient` for enhanced proxy handling - Added `newProxyAwareHTTPClient` to centralize proxy configuration with priority on `auth.ProxyURL` and `cfg.ProxyURL`. - Integrated enhanced proxy support across executors for HTTP, HTTPS, and SOCKS5 protocols. - Refactored redundant HTTP client initialization to use `newProxyAwareHTTPClient` for consistent behavior.	2025-09-30 09:04:15 +08:00
Luis Pater	352a67857b	refactor(runtime): move `Anthropic-Beta` header setting to `applyClaudeHeaders` for better header management	2025-09-29 20:51:36 +08:00
Luis Pater	f5dc380b63	rebuild branch	2025-09-25 10:32:48 +08:00
Luis Pater	3f69254f43	remove all	2025-09-25 10:31:02 +08:00
Luis Pater	a2c5fdaf66	refactor(executor): remove ClientAdapter and legacy fallback logic - Deleted `ClientAdapter` implementation and associated fallback methods. - Removed legacy executor logic from `codex`, `claude`, `gemini`, and `qwen` executors. - Simplified `handlers` by eliminating `UnwrapError` handling and related dependencies. - Cleaned up `model_registry` by removing logic associated with suspended clients. - Updated `.gitignore` to ignore `.serena/` directory.	2025-09-24 21:09:36 +08:00
Luis Pater	3dd5095792	feat(translators): add token counting support for Claude and Gemini responses - Implemented `TokenCount` transform method across translators to calculate token usage. - Integrated token counting logic into executor pipelines for Claude, Gemini, and CLI translators. - Added corresponding API endpoints and handlers (`/messages/count_tokens`) for token usage retrieval. - Enhanced translation registry to support `TokenCount` functionality alongside existing response types.	2025-09-24 11:59:38 +08:00
Luis Pater	3ade03f3b3	feat(usage): implement usage tracking infrastructure across executors - Added `LoggerPlugin` to log usage metrics for observability. - Introduced a new `Manager` to handle usage record queuing and plugin registration. - Integrated new usage reporter and detailed metrics parsing into executors, covering providers like OpenAI, Codex, Claude, and Gemini. - Improved token usage breakdown across streaming and non-streaming responses.	2025-09-24 03:49:09 +08:00
Luis Pater	5090d9853b	feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses - Enhanced support for extracting system instructions from input arrays. - Improved input message role and type determination logic for consistent message processing. - Refined instruction handling logic across translator types for better compatibility.	2025-09-24 00:20:49 +08:00
Luis Pater	d41ff2076f	feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses - Enhanced support for extracting system instructions from input arrays. - Improved input message role and type determination logic for consistent message processing. - Refined instruction handling logic across translator types for better compatibility.	2025-09-23 23:12:34 +08:00
Luis Pater	11b0efc38f	feat(claude-executor): add ZSTD decoding support for Claude executor responses - Integrated ZSTD decompression via `github.com/klauspost/compress` for responses with "zstd" content-encoding. - Added helper `hasZSTDEcoding` to detect ZSTD-encoded responses. - Updated response handling logic to initialize and use a ZSTD decoder when necessary. refactor(api-handlers): split streaming and non-streaming response handling - Introduced `handleNonStreamingResponse` for processing non-streaming requests in `ClaudeCodeAPIHandler`. - Improved code clarity by separating streaming and non-streaming logic. fix(service): remove redundant token refresh interval assignment logic in `cliproxy` service.	2025-09-23 12:44:44 +08:00
Luis Pater	ac59023abb	feat(executor): add `CountTokens` support across all executors - Introduced `CountTokens` method to Codex, Claude, Gemini, Qwen, OpenAI-compatible, and other executors. - Implemented `ExecuteCount` in `AuthManager` for token counting via provider round-robin. - Updated handlers to leverage `ExecuteCountWithAuthManager` for streamlined token counting. - Added fallback and error handling logic for token counting requests.	2025-09-23 02:27:51 +08:00
Luis Pater	d32fc0400e	refactor(headers): centralize header logic using `EnsureHeader` utility - Introduced `EnsureHeader` in `internal/misc/header_utils.go` to streamline header setting across executors. - Updated Codex, Claude, and Gemini executors to utilize `EnsureHeader` for consistent header application. - Incorporated Gin context headers (if available) into request header manipulation for better integration.	2025-09-23 02:01:57 +08:00
Luis Pater	7ea88358f0	refactor(executor): centralize header application logic for executors - Replaced repetitive header setting logic with helper methods (`applyCodexHeaders`, `applyClaudeHeaders`, `applyQwenHeaders`) in Codex, Claude, and Qwen executors. - Ensured consistent headers in HTTP requests across all executors. - Introduced UUID and additional structured headers for better traceability (e.g., session IDs, metadata).	2025-09-23 01:20:10 +08:00
Luis Pater	c6b391304d	chore(executor): add debug logging for API request errors - Added detailed debug logs in all executors (Codex, Claude, Gemini, Qwen, OpenAI-compatible) to capture HTTP status and response body for failed API requests.	2025-09-22 23:37:53 +08:00
Luis Pater	2e836cee88	feat(auth): standardize `last_refresh` metadata handling across executors - Added `last_refresh` timestamp to metadata for Codex, Claude, Qwen, and Gemini executors. - Implemented `extractLastRefreshTimestamp` utility for parsing diverse timestamp formats in management handlers. - Ensured consistent update and preservation of `last_refresh` in file-based auth handling.	2025-09-22 23:23:31 +08:00
Luis Pater	837ae1b1b3	chore(logging): add debug logs for executor `Refresh` methods - Introduced `logrus` for structured debugging across all executors. - Added debug log messages in `Refresh` methods for better traceability. - Updated `Manager` to log additional details during refresh checks.	2025-09-22 20:03:31 +08:00

1 2

53 Commits