CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 20:40:52 +08:00

Author	SHA1	Message	Date
Luis Pater	b84ccc6e7a	feat: add unit tests for routing strategies and implement dynamic selector updates Added comprehensive tests for `FillFirstSelector` and `RoundRobinSelector` to ensure proper behavior, including deterministic, cyclical, and concurrent scenarios. Introduced dynamic routing strategy updates in `service.go`, normalizing strategies and seamlessly switching between `fill-first` and `round-robin`. Updated `Manager` to support selector changes via the new `SetSelector` method.	2025-12-22 22:52:23 +08:00
gwizz	5bf89dd757	fix: keep streaming defaults legacy-safe	2025-12-23 00:53:18 +11:00
gwizz	4442574e53	fix: stop streaming loop on context cancel	2025-12-23 00:37:55 +11:00
gwizz	c020fa60d0	fix: keep round-robin as default routing	2025-12-22 23:39:41 +11:00
gwizz	b078be4613	feat: add fill-first routing strategy	2025-12-22 23:38:10 +11:00
gwizz	71a6dffbb6	fix: improve streaming bootstrap and forwarding	2025-12-22 23:34:23 +11:00
moxi	830fd8eac2	Fix responses-format handling for chat completions	2025-12-22 13:54:02 +08:00
Supra4E8C	cd0c94f48a	fix(sdk/auth): prevent OAuth manual prompt goroutine leak,Use timer-based manual prompt per provider and remove oauth_callback helper.	2025-12-21 07:06:28 +08:00
Supra4E8C	93414f1baa	feat (auth): CLI OAuth supports pasting callback URLs to complete login - Added callback URL resolution and terminal prompt logic - Codex/Claude/iFlow/Antigravity/Gemini login supports callback URL or local callback completion - Update Gemini login option signature and manager call - CLI default prompt function is compatible with null input to continue waiting	2025-12-20 18:25:55 +08:00
hkfires	c84ff42bcd	fix(amp): add /docs routes to proxy	2025-12-20 10:15:25 +08:00
Luis Pater	8a5db02165	Fixed: #607 refactor(config): re-export internal configuration types for SDK consumers	2025-12-20 04:49:02 +08:00
BigUncle	39597267ae	fix(auth): prevent token refresh loop by ignoring timestamp fields Add metadataEqualIgnoringTimestamps() function to compare metadata JSON without timestamp/expired/expires_in/last_refresh/access_token fields. This prevents unnecessary file writes when only these fields change during refresh, breaking the fsnotify event → Watcher callback → refresh loop. Key insight: Google OAuth returns a new access_token on each refresh, which was causing file writes and triggering the refresh loop. Fixes antigravity channel excessive log generation issue. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-18 21:37:05 +08:00
Luis Pater	670685139a	fix(api): update route patterns to support wildcards for Gemini actions Normalize action handling by accommodating wildcard patterns in route definitions for Gemini endpoints. Adjust `request.Action` parsing logic to correctly process routes with prefixed actions.	2025-12-17 01:17:02 +08:00
Luis Pater	52b6306388	feat(config): add support for model prefixes and prefix normalization Refactor model management to include an optional `prefix` field for model credentials, enabling better namespace handling. Update affected configuration files, APIs, and handlers to support prefix normalization and routing. Remove unused OpenAI compatibility provider logic to simplify processing.	2025-12-17 01:07:26 +08:00
hkfires	3bc489254b	fix(api): prevent double logging for error responses The WriteErrorResponse function now caches the error response body in the gin context. The deferred request logger checks for this cached response. If an error response is found, it bypasses the standard response logging. This prevents scenarios where an error is logged twice or an empty payload log overwrites the original, more detailed error log.	2025-12-15 16:36:01 +08:00
hkfires	4c07ea41c3	feat(api): return structured JSON error responses The API error handling is updated to return a structured JSON payload instead of a plain text message. This provides more context and allows clients to programmatically handle different error types. The new error response has the following structure: { "error": { "message": "...", "type": "..." } } The `type` field is determined by the HTTP status code, such as `authentication_error`, `rate_limit_error`, or `server_error`. If the underlying error message from an upstream service is already a valid JSON string, it will be preserved and returned directly. BREAKING CHANGE: API error responses are now in a structured JSON format instead of plain text. Clients expecting plain text error messages will need to be updated to parse the new JSON body.	2025-12-15 16:19:52 +08:00
hkfires	f26da24a2f	feat(auth): add proxy information to debug logs	2025-12-15 13:14:55 +08:00
Luis Pater	b6ad243e9e	Merge pull request #498 from teeverc/fix/claude-streaming-flush fix(claude): flush Claude SSE chunks immediately	2025-12-13 23:58:34 +08:00
hkfires	e7cedbee6e	fix(auth): prevent duplicate iflow BXAuth tokens	2025-12-12 19:57:19 +08:00
teeverc	5ab3032335	Update sdk/api/handlers/claude/code_handlers.go thank you gemini Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-12 00:26:01 -08:00
teeverc	1215c635a0	fix: flush Claude SSE chunks immediately to match OpenAI behavior - Write each SSE chunk directly to c.Writer and flush immediately - Remove buffered writer and ticker-based flushing that caused delayed output - Add 500ms timeout case for consistency with OpenAI/Gemini handlers - Clean up unused bufio import This fixes the 'not streaming' issue where small responses were held in the buffer until timeout/threshold was reached. Amp-Thread-ID: https://ampcode.com/threads/T-019b1186-164e-740c-96ab-856f64ee6bee Co-authored-by: Amp <amp@ampcode.com>	2025-12-12 00:14:19 -08:00
Luis Pater	6e2306a5f2	refactor(handlers): improve request logging and payload handling	2025-12-12 08:52:52 +08:00
hkfires	88bdd25f06	fix(amp): set status on claude stream errors	2025-12-11 20:12:06 +08:00
Luis Pater	423ce97665	feat(util): implement dynamic thinking suffix normalization and refactor budget resolution logic - Added support for parsing and normalizing dynamic thinking model suffixes. - Centralized budget resolution across executors and payload helpers. - Retired legacy Gemini-specific thinking handlers in favor of unified logic. - Updated executors to use metadata-based thinking configuration. - Added `ResolveOriginalModel` utility for resolving normalized upstream models using request metadata. - Updated executors (Gemini, Codex, iFlow, OpenAI, Qwen) to incorporate upstream model resolution and substitute model values in payloads and request URLs. - Ensured fallbacks handle cases with missing or malformed metadata to derive models robustly. - Refactored upstream model resolution to dynamically incorporate metadata for selecting and normalizing models. - Improved handling of thinking configurations and model overrides in executors. - Removed hardcoded thinking model entries and migrated logic to metadata-based resolution. - Updated payload mutations to always include the resolved model.	2025-12-11 03:10:50 +08:00
hkfires	347769b3e3	fix(openai-compat): use model id for auth model display	2025-12-09 18:09:14 +08:00
hkfires	da23ddb061	fix(gemini): normalize model listing output	2025-12-09 17:34:15 +08:00
vuonglv(Andy)	5c3a013cd1	feat(config): add configurable host binding for server (#454 ) * feat(config): add configurable host binding for server	2025-12-08 23:16:39 +08:00
Luis Pater	0ebabf5152	feat(antigravity): add FetchAntigravityProjectID function and integrate project ID retrieval	2025-12-06 01:32:12 +08:00
Luis Pater	c44c46dd80	Fixed: #421 feat(antigravity): implement project ID retrieval and integration in payload processing	2025-12-06 00:40:55 +08:00
Luis Pater	0fd2abbc3b	refactor(cliproxy, config): remove vertex-compat flow, streamline Vertex API key handling - Removed `vertex-compat` executor and related configuration. - Consolidated Vertex compatibility checks into `vertex` handling with `apikey`-based model resolution. - Streamlined model generation logic for Vertex API key entries.	2025-12-02 09:18:24 +08:00
Aero	0ebb654019	feat: Add support for VertexAI compatible service (#375 ) feat: consolidate Vertex AI compatibility with API key support in Gemini	2025-12-02 08:14:22 +08:00
Luis Pater	a748e93fd9	fix(executor, auth): ensure index assignment consistency for auth objects - Updated `usage_helpers.go` to call `EnsureIndex()` for proper index assignment in reporter initialization. - Adjusted `auth/manager.go` to assign auth indices inside a locked section when they are unassigned, ensuring thread safety and consistency.	2025-11-30 16:56:29 +08:00
hkfires	022aa81be1	feat(cliproxy): support wildcard exclusions for models	2025-11-30 08:02:00 +08:00
hkfires	c43f0ea7b1	refactor(config): rename model blacklist fields to excluded models	2025-11-29 21:23:47 +08:00
hkfires	6a191358af	fix(auth): fix runtime auth reload on oauth blacklist change	2025-11-29 20:30:11 +08:00
hkfires	5983e3ec87	feat(auth): add oauth provider model blacklist	2025-11-28 10:37:10 +08:00
hkfires	f8cebb9343	feat(config): add per-key model blacklist for providers	2025-11-27 21:57:07 +08:00
hkfires	6c17dbc4da	style(amp): tidy whitespace in proxy module and tests	2025-11-26 18:57:26 +08:00
Luis Pater	a4a26d978e	Fixed: #339 feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions - Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration. - Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure.	2025-11-26 11:42:57 +08:00
Luis Pater	506f1117dd	fix(handlers): refactor API response capture to append data safely - Introduced `appendAPIResponse` helper to preserve and append data to existing API responses. - Ensured newline inclusion when appending, if necessary. - Improved `nil` and data type checks for response handling. - Updated middleware to skip request logging for `GET` requests.	2025-11-25 11:37:02 +08:00
Luis Pater	bb9955e461	fix(auth): resolve index reassignment issue during auth management - Fixed improper handling of `indexAssigned` and `Index` during auth reassignment. - Ensured `EnsureIndex` is invoked after validating existing auth entries.	2025-11-24 10:10:09 +08:00
Luis Pater	7063a176f4	#293 feat(retry): add configurable retry logic with cooldown support - Introduced `max-retry-interval` configuration for cooldown durations between retries. - Added `SetRetryConfig` in `Manager` to handle retry attempts and cooldown intervals. - Enhanced provider execution logic to include retry attempts, cooldown management, and dynamic wait periods. - Updated API endpoints and YAML configuration to support `max-retry-interval`.	2025-11-24 09:55:15 +08:00
Luis Pater	327cc7039e	refactor(auth): use customizable HTTP client for Antigravity requests - Replaced `http.DefaultClient` with a configurable `http.Client` instance for Antigravity OAuth flow methods. - Updated `exchangeAntigravityCode` and `fetchAntigravityUserInfo` to accept `httpClient` as a parameter. - Added `util.SetProxy` usage to initialize the `httpClient` with proxy support.	2025-11-21 20:54:56 +08:00
hkfires	27faf718a3	fix(auth): use fixed antigravity callback port 51121	2025-11-21 13:56:33 +08:00
Luis Pater	2d84d2fb6a	feat(auth, executor, cmd): add Antigravity provider integration - Implemented OAuth login flow for the Antigravity provider in `auth/antigravity.go`. - Added `AntigravityExecutor` for handling requests and streaming via Antigravity APIs. - Created `antigravity_login.go` command for triggering Antigravity authentication. - Introduced OpenAI-to-Antigravity translation logic in `translator/antigravity/openai/chat-completions`. refactor(translator, executor): update Gemini CLI response translation and add Antigravity payload customization - Renamed Gemini CLI translation methods to align with response handling (`ConvertGeminiCliResponseToGemini` and `ConvertGeminiCliResponseToGeminiNonStream`). - Updated `init.go` to reflect these method changes. - Introduced `geminiToAntigravity` function to embed metadata (`model`, `userAgent`, `project`, etc.) into Antigravity payloads. - Added random project, request, and session ID generators for enhanced tracking. - Streamlined `buildRequest` to use `geminiToAntigravity` transformation before request execution.	2025-11-21 12:43:16 +08:00
Luis Pater	db81331ae8	refactor(middleware): extract request logging logic and optimize condition checks - Added `shouldLogRequest` helper to simplify path-based request logging logic. - Updated middleware to skip management endpoints for improved security. - Introduced an explicit `nil` logger check for minimal overhead. - Updated dependencies in `go.mod`. feat(auth): add handling for 404 response with retry logic - Introduced support for 404 `not_found` status with a 12-hour backoff period. - Updated `manager.go` to align state and status messages for 404 scenarios. refactor(translator): comment out debug logging in Gemini responses request	2025-11-20 23:20:40 +08:00
Luis Pater	9ff38dd785	Merge branch 'dev' into feat-amp-cli-module	2025-11-20 20:26:47 +08:00
Luis Pater	371324c090	feat(registry): expand Gemini model definitions and support Vertex AI	2025-11-20 18:16:26 +08:00
Luis Pater	d50b0f7524	refactor(executor): simplify Gemini CLI execution and remove internal retry logic - Removed nested retry handling for 429 rate limit errors. - Simplified request/response handling by cleaning redundant retry-related code. - Eliminated `parseRetryDelay` function and max retry configuration logic.	2025-11-20 17:49:37 +08:00
Ben Vargas	70ee4e0aa0	chore: remove unused httpx sdk package	2025-11-19 21:17:52 -07:00

1 2 3 4

196 Commits