CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
hkfires	e370f86f63	fix(gemini-executor): uppercase responseModalities	2025-10-26 21:26:15 +08:00
hkfires	7f266aa19e	fix(aistudio): ensure colon-spaced JSON in responses	2025-10-26 20:21:45 +08:00
hkfires	f3f31274e8	refactor(wsrelay): rename RoundTrip to NonStream	2025-10-26 20:01:46 +08:00
hkfires	7061cd6058	fix(gemini): map responseModalities to uppercase IMAGE/TEXT	2025-10-26 19:35:22 +08:00
hkfires	7459c2c81a	fix(aistudio): remove generationConfig and tools when action is countTokens	2025-10-26 16:28:20 +08:00
hkfires	ea6065f1b1	fix(aistudio): strip usage metadata from non-final stream chunks	2025-10-26 07:46:04 +08:00
hkfires	c32e013605	feat(aistudio): track Gemini usage and improve stream errors	2025-10-26 07:46:04 +08:00
hkfires	3839d93ba0	feat: add websocket routing and executor unregister API - Introduce Server.AttachWebsocketRoute(path, handler) to mount websocket upgrade handlers on the Gin engine. - Track registered WS paths via wsRoutes with wsRouteMu to prevent duplicate registrations; initialize in NewServer and import sync. - Add Manager.UnregisterExecutor(provider) for clean executor lifecycle management. - Add github.com/gorilla/websocket v1.5.3 dependency and update go.sum. Motivation: enable services to expose WS endpoints through the core server and allow removing auth executors dynamically while avoiding duplicate route setup. No breaking changes.	2025-10-26 07:46:03 +08:00
Luis Pater	a552a45b81	Fixed: #140 #133 #80 feat(translator): add token counting functionality for Gemini, Claude, and CLI - Introduced `TokenCount` handling across various Codex translators (Gemini, Claude, CLI) with respective implementations. - Added utility methods for token counting and formatting responses. - Integrated `tiktoken-go/tokenizer` library for tokenization. - Updated CodexExecutor with token counting logic to support multiple models including GPT-5 variants. - Refined go.mod and go.sum to include new dependencies. feat(runtime): add token counting functionality across executors - Implemented token counting in OpenAICompatExecutor, QwenExecutor, and IFlowExecutor. - Added utilities for token counting and response formatting using `tiktoken-go/tokenizer`. - Integrated token counting into translators for Gemini, Claude, and Gemini CLI. - Enhanced multiple model support, including GPT-5 variants, for token counting. docs: update environment variable instructions for multi-model support - Added details for setting `ANTHROPIC_DEFAULT_OPUS_MODEL`, `ANTHROPIC_DEFAULT_SONNET_MODEL`, and `ANTHROPIC_DEFAULT_HAIKU_MODEL` for version 2.x.x. - Clarified usage of `ANTHROPIC_MODEL` and `ANTHROPIC_SMALL_FAST_MODEL` for version 1.x.x. - Expanded examples for setting environment variables across different models including Gemini, GPT-5, Claude, and Qwen3.	2025-10-26 05:39:15 +08:00
Luis Pater	e783923464	feat(executor): add debug logs for rate-limiting retries in Gemini CLI executor	2025-10-23 10:39:21 +08:00
Luis Pater	e6d7677373	docs: add GPT-5 Codex guidelines for internal usage - Added comprehensive instructions for Codex CLI harness, sandboxing, approvals, and editing constraints to `internal/misc/codex_instructions/`. - Clarified `approval_policy` configurations and scenarios requiring escalated permissions. - Provided detailed style and structure guidelines for presenting results in the Codex CLI.	2025-10-23 09:14:56 +08:00
Luis Pater	20985d1a10	Refactor executor error handling and usage reporting - Updated the Execute methods in various executors (GeminiCLIExecutor, GeminiExecutor, IFlowExecutor, OpenAICompatExecutor, QwenExecutor) to return a response and error as named return values for improved clarity. - Enhanced error handling by deferring failure tracking in usage reporters, ensuring that failures are reported correctly. - Improved response body handling by ensuring proper closure and error logging for HTTP responses across all executors. - Added failure tracking and reporting in the usage reporter to capture unsuccessful requests. - Updated the usage logging structure to include a 'Failed' field for better tracking of request outcomes. - Adjusted the logic in the RequestStatistics and Record methods to accommodate the new failure tracking mechanism.	2025-10-21 11:22:24 +08:00
Luis Pater	eadccb229f	Fixed: #148 feat(executor): add initial cache_helpers.go file	2025-10-20 10:17:29 +08:00
hkfires	4504ba5329	feat(iflow): add masked token logs; increase refresh lead to 24h	2025-10-19 10:56:29 +08:00
hkfires	9f45806106	feat(logging): centralize sensitive header masking	2025-10-18 17:16:00 +08:00
Luis Pater	3dd0844b98	Enhance logging for API requests and responses across executors - Added detailed logging of upstream request metadata including URL, method, headers, and body for Codex, Gemini, IFlow, OpenAI Compat, and Qwen executors. - Implemented error logging for API response failures to capture errors during HTTP requests. - Introduced structured logging for authentication details (AuthID, AuthLabel, AuthType, AuthValue) to improve traceability. - Updated response logging to include status codes and headers for better debugging. - Ensured that all executors consistently log API interactions to facilitate monitoring and troubleshooting.	2025-10-17 04:12:38 +08:00
Luis Pater	ade279d1f2	Feature: #103 feat(gemini): add Gemini thinking configuration support and metadata normalization - Introduced logic to parse and apply `thinkingBudget` and `include_thoughts` configurations from metadata. - Enhanced request handling to include normalized Gemini model metadata, preserving the original model identifier. - Updated Gemini and Gemini-CLI executors to apply thinking configuration based on metadata overrides. - Refactored handlers to support metadata extraction and cloning during request preparation.	2025-10-16 11:31:18 +08:00
Adamcf	15981aa412	fix: add Claude→Claude passthrough to prevent SSE event fragmentation When from==to (Claude→Claude scenario), directly forward SSE stream line-by-line without invoking TranslateStream. This preserves the multi-line SSE event structure (event:/data:/blank) and prevents JSON parsing errors caused by event fragmentation. Resolves: JSON parsing error when using Claude Code streaming responses fix: correct SSE event formatting in Handler layer Remove duplicate newline additions (\n\n) that were breaking SSE event format. The Executor layer already provides properly formatted SSE chunks with correct line endings, so the Handler should forward them as-is without modification. Changes: - Remove redundant \n\n addition after each chunk - Add len(chunk) > 0 check before writing - Format error messages as proper SSE events (event: error\ndata: {...}\n\n) - Add chunkIdx counter for future debugging needs This fixes JSON parsing errors caused by malformed SSE event streams. fix: update comments for clarity in SSE event forwarding	2025-10-15 22:13:44 +08:00
Luis Pater	32a8102d71	feat(usage): add support for tracking request source in usage records - Introduced `Source` field to usage-related structs for better origin tracking. - Updated `newUsageReporter` to resolve and populate the `Source` attribute. - Implemented `resolveUsageSource` to determine source from auth metadata or API key.	2025-10-14 02:11:43 +08:00
hkfires	b895018ff5	refactor(provider): remove Gemini Web cookie-based provider	2025-10-11 12:53:03 +08:00
Luis Pater	20787cd107	feat(registry, executor, util): add support for `gemini-2.5-flash-image-preview` and improve aspect ratio handling - Introduced `gemini-2.5-flash-image-preview` model to the registry with updated definitions. - Enhanced Gemini CLI and API executors to handle image aspect ratio adjustments for the new model. - Added utility function to create base64 white image placeholders based on aspect ratio configurations.	2025-10-10 01:49:58 +08:00
Luis Pater	b2cdbbdd47	feat(registry, executor): add support for `glm-4.6` model and enhance Gemini CLI token handling - Added `glm-4.6` model to registry and documentation. - Updated Gemini CLI executor to pass configuration to `prepareGeminiCLITokenSource` for improved token management.	2025-10-09 20:57:18 +08:00
Luis Pater	d45ebff66b	feat(registry, executor): add support for `gemini-2.5-flash-image` model - Introduced `gemini-2.5-flash-image` model with updated definitions in registry. - Enhanced model marker detection in Gemini CLI executor to include support for the new model.	2025-10-09 10:06:10 +08:00
hkfires	9bb7df7af7	feat(gemini-web): Enable config hot-reload and fix Gem selection	2025-10-07 20:23:33 +08:00
hkfires	c62ecc2442	fix(gemini): Disable thinking config for incompatible models	2025-10-06 16:32:03 +08:00
Luis Pater	bbdd68a8b4	feat(registry/runtime): add Gemini 2.5 model and increase buffer sizes - Added new "Gemini 2.5 Flash Image Preview" model definition, with enhanced image generation capabilities. - Increased scanner buffer size to 20,971,520 bytes across executors and translators to handle larger payloads.	2025-10-06 04:44:45 +08:00
hkfires	c8029b7166	feat(iflow): Add User-Agent header to API requests	2025-10-05 18:50:35 +08:00
hkfires	b839e351c4	feat: Add support for iFlow provider	2025-10-05 15:51:09 +08:00
Luis Pater	2eef6875e9	feat(auth): improve OpenAI compatibility normalization and API key handling - Refined trimming and normalization logic for `baseURL` and `apiKey` attributes. - Updated `Authorization` header logic to omit empty API keys. - Enhanced compatibility processing by handling empty `api-key-entries`. - Improved legacy format fallback and added safeguards for empty credentials across executor paths.	2025-10-03 02:38:30 +08:00
Luis Pater	12c09f1a46	feat(runtime): remove `previous_response_id` from Codex executor request body - Implemented logic to delete `previous_response_id` property from the request body in Codex executor. - Applied changes consistently across relevant Codex executor paths.	2025-10-02 12:00:06 +08:00
hkfires	8858e07d8b	feat(gemini-web): Add support for custom auth labels	2025-09-30 12:21:51 +08:00
hkfires	82187bffba	feat(gemini-web): Add conversation affinity selector	2025-09-30 12:21:51 +08:00
Luis Pater	832268cae7	refactor(proxy): improve SOCKS5 proxy authentication handling - Added nil check for proxy user credentials to prevent potential nil pointer dereference. - Enhanced authentication logic for SOCKS5 proxies in `proxy_helpers.go` and `proxy.go`.	2025-09-30 11:23:39 +08:00
Luis Pater	de796ac1c2	feat(runtime): introduce `newProxyAwareHTTPClient` for enhanced proxy handling - Added `newProxyAwareHTTPClient` to centralize proxy configuration with priority on `auth.ProxyURL` and `cfg.ProxyURL`. - Integrated enhanced proxy support across executors for HTTP, HTTPS, and SOCKS5 protocols. - Refactored redundant HTTP client initialization to use `newProxyAwareHTTPClient` for consistent behavior.	2025-09-30 09:04:15 +08:00
Luis Pater	352a67857b	refactor(runtime): move `Anthropic-Beta` header setting to `applyClaudeHeaders` for better header management	2025-09-29 20:51:36 +08:00
hkfires	562a49a194	feat(provider/gemini-web): Prioritize explicit label for account identification	2025-09-27 10:56:15 +08:00
Luis Pater	57c9ba49f4	refactor(config): migrate to `SDKConfig` and streamline proxy handling - Replaced `config.Config` with `config.SDKConfig` across components for simpler configuration management. - Updated proxy setup functions and handlers to align with `SDKConfig` improvements. - Reorganized handler imports to match new SDK structure.	2025-09-27 04:50:23 +08:00
Luis Pater	514add4b85	refactor(executor): remove redundant handling of "reasoning.effort" in gpt-5 and gpt-5-codex models	2025-09-26 18:13:28 +08:00
hkfires	2175a10932	feat(gemini-web): Introduce stable account label for identification	2025-09-25 10:59:20 +08:00
Luis Pater	f5dc380b63	rebuild branch	2025-09-25 10:32:48 +08:00
Luis Pater	3f69254f43	remove all	2025-09-25 10:31:02 +08:00
Luis Pater	0db0b03db9	chore(docs): add and refine package-level comments across modules - Added detailed package-level comments to improve documentation coverage. - Clarified parameter descriptions, return types, and functionality of exported methods across packages. - Enhanced overall code readability and API documentation consistency.	2025-09-25 00:14:17 +08:00
Luis Pater	48bbd9e214	fix(gemini): handle "[DONE]" chunk, trim "data:" prefix, and remove session_id from requests - Adjusted stream handling to skip "[DONE]" chunks. - Ensured "data:" prefix is trimmed for non-prefixed input in translation. - Removed `session_id` from request bodies before processing.	2025-09-24 23:34:46 +08:00
hkfires	d4f5ec2492	Removed the cookie snapshot feature.	2025-09-24 22:12:29 +08:00
hkfires	e9707c2f9e	refactor(gemini-web): Move provider logic to its own package The Gemini Web API client logic has been relocated from `internal/client/gemini-web` to a new, more specific `internal/provider/gemini-web` package. This refactoring improves code organization and modularity by better isolating provider-specific implementations. As a result of this move, the `GeminiWebState` struct and its methods have been exported (capitalized) to make them accessible from the executor. All call sites have been updated to use the new package path and the exported identifiers.	2025-09-24 22:12:29 +08:00
Luis Pater	a2c5fdaf66	refactor(executor): remove ClientAdapter and legacy fallback logic - Deleted `ClientAdapter` implementation and associated fallback methods. - Removed legacy executor logic from `codex`, `claude`, `gemini`, and `qwen` executors. - Simplified `handlers` by eliminating `UnwrapError` handling and related dependencies. - Cleaned up `model_registry` by removing logic associated with suspended clients. - Updated `.gitignore` to ignore `.serena/` directory.	2025-09-24 21:09:36 +08:00
hkfires	b86ed46845	fix(codex): Remove reasoning.effort for default gpt-5-codex model	2025-09-24 13:17:19 +08:00
Luis Pater	3dd5095792	feat(translators): add token counting support for Claude and Gemini responses - Implemented `TokenCount` transform method across translators to calculate token usage. - Integrated token counting logic into executor pipelines for Claude, Gemini, and CLI translators. - Added corresponding API endpoints and handlers (`/messages/count_tokens`) for token usage retrieval. - Enhanced translation registry to support `TokenCount` functionality alongside existing response types.	2025-09-24 11:59:38 +08:00
Luis Pater	3ade03f3b3	feat(usage): implement usage tracking infrastructure across executors - Added `LoggerPlugin` to log usage metrics for observability. - Introduced a new `Manager` to handle usage record queuing and plugin registration. - Integrated new usage reporter and detailed metrics parsing into executors, covering providers like OpenAI, Codex, Claude, and Gemini. - Improved token usage breakdown across streaming and non-streaming responses.	2025-09-24 03:49:09 +08:00
Luis Pater	5090d9853b	feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses - Enhanced support for extracting system instructions from input arrays. - Improved input message role and type determination logic for consistent message processing. - Refined instruction handling logic across translator types for better compatibility.	2025-09-24 00:20:49 +08:00

1 2

71 Commits