CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 20:40:52 +08:00

Author	SHA1	Message	Date
hkfires	8ce22b8403	fix(sse): preserve usage metadata for stop chunks	2025-11-22 12:50:23 +08:00
Luis Pater	d1cdedc4d1	Merge pull request #303 from router-for-me/image feat(translator): support image size and googleSearch tools	2025-11-22 11:20:58 +08:00
Luis Pater	d291eb9489	Fixed: #302 feat(executor): enhance WebSocket error handling and metadata logging - Added handling for stream closure before start with appropriate error recording. - Improved metadata logging for non-OK HTTP status codes in WebSocket responses. - Consolidated event processing logic with `processEvent` for better error handling and payload management. - Refactored stream initialization to include the first event handling for smoother execution flow.	2025-11-22 11:18:13 +08:00
hkfires	dc8d3201e1	feat(translator): support image size and googleSearch tools	2025-11-22 10:36:52 +08:00
Luis Pater	7757210af6	feat(auth): implement Antigravity OAuth authentication flow - Added new endpoint `/antigravity-auth-url` to initiate Antigravity authentication. - Implemented `RequestAntigravityToken` to manage the OAuth flow, including token exchange and user info retrieval. - Introduced `.oauth-antigravity` temporary file handling for state and code management. - Added `sanitizeAntigravityFileName` utility for safe token file names based on user email. - Registered `/antigravity/callback` endpoint for OAuth redirects.	2025-11-22 01:45:06 +08:00
Luis Pater	c1031e2d3f	feat(translator): add Antigravity translation logic - Introduced request and response translation functions to enable compatibility between OpenAI Chat Completions API and Antigravity. - Registered translation utilities for both streaming and non-streaming scenarios. - Added support for reasoning content, tool calls, and metadata handling. - Established request normalization and embedding for Antigravity-compatible payloads. - Added new fields to `Params` struct for better tracking of finish reasons, usage metadata, and tool usage. - Refactored handling of response transitions, final events, and state-driven logic in `ConvertAntigravityResponseToClaude`. - Introduced `appendFinalEvents` and `resolveStopReason` helper functions for cleaner separation of concerns. - Added `TotalTokenCount` field to `Params` struct for enhanced token tracking. - Updated token count calculations to fallback on `TotalTokenCount` when specific counts are missing. - Introduced `hasNonZeroUsageMetadata` function to validate presence of token data in `usage_metadata`.	2025-11-21 23:40:59 +08:00
hkfires	abc2465b29	fix(gemini-cli): ignore thoughtSignature and empty parts	2025-11-21 17:12:56 +08:00
hkfires	4ba5b43d82	feat(executor): share SSE usage filtering across streams	2025-11-21 16:51:05 +08:00
Luis Pater	2d84d2fb6a	feat(auth, executor, cmd): add Antigravity provider integration - Implemented OAuth login flow for the Antigravity provider in `auth/antigravity.go`. - Added `AntigravityExecutor` for handling requests and streaming via Antigravity APIs. - Created `antigravity_login.go` command for triggering Antigravity authentication. - Introduced OpenAI-to-Antigravity translation logic in `translator/antigravity/openai/chat-completions`. refactor(translator, executor): update Gemini CLI response translation and add Antigravity payload customization - Renamed Gemini CLI translation methods to align with response handling (`ConvertGeminiCliResponseToGemini` and `ConvertGeminiCliResponseToGeminiNonStream`). - Updated `init.go` to reflect these method changes. - Introduced `geminiToAntigravity` function to embed metadata (`model`, `userAgent`, `project`, etc.) into Antigravity payloads. - Added random project, request, and session ID generators for enhanced tracking. - Streamlined `buildRequest` to use `geminiToAntigravity` transformation before request execution.	2025-11-21 12:43:16 +08:00
Luis Pater	cbcfeb92cc	Fixed: #291 feat(executor): add thinking level to budget conversion utility - Introduced `ConvertThinkingLevelToBudget` to map thinking level ("high"/"low") to corresponding budget values. - Applied the utility in `aistudio_executor.go` before stripping unsupported configs. - Updated dependencies to include `tidwall/gjson` for JSON parsing.	2025-11-21 00:48:12 +08:00
Luis Pater	db81331ae8	refactor(middleware): extract request logging logic and optimize condition checks - Added `shouldLogRequest` helper to simplify path-based request logging logic. - Updated middleware to skip management endpoints for improved security. - Introduced an explicit `nil` logger check for minimal overhead. - Updated dependencies in `go.mod`. feat(auth): add handling for 404 response with retry logic - Introduced support for 404 `not_found` status with a 12-hour backoff period. - Updated `manager.go` to align state and status messages for 404 scenarios. refactor(translator): comment out debug logging in Gemini responses request	2025-11-20 23:20:40 +08:00
Luis Pater	9ff38dd785	Merge branch 'dev' into feat-amp-cli-module	2025-11-20 20:26:47 +08:00
Luis Pater	98596c0a3f	refactor(translator): remove `service_tier` from Codex OpenAI request payload	2025-11-20 20:12:06 +08:00
hkfires	3f4f8b3b2d	feat(iflow): add cookie-based authentication endpoint	2025-11-20 18:23:43 +08:00
Luis Pater	371324c090	feat(registry): expand Gemini model definitions and support Vertex AI	2025-11-20 18:16:26 +08:00
Luis Pater	d50b0f7524	refactor(executor): simplify Gemini CLI execution and remove internal retry logic - Removed nested retry handling for 429 rate limit errors. - Simplified request/response handling by cleaning redundant retry-related code. - Eliminated `parseRetryDelay` function and max retry configuration logic.	2025-11-20 17:49:37 +08:00
Ben Vargas	a6cb16bb48	security: fix localhost middleware header spoofing vulnerability Fix critical security vulnerability in amp-restrict-management-to-localhost feature where attackers could bypass localhost restriction by spoofing X-Forwarded-For headers. Changes: - Use RemoteAddr (actual TCP connection) instead of ClientIP() in localhostOnlyMiddleware to prevent header spoofing attacks - Add comprehensive test coverage for spoofing prevention (6 test cases) - Update documentation with reverse proxy deployment guidance and limitations of the RemoteAddr approach The fix prevents attacks like: curl -H "X-Forwarded-For: 127.0.0.1" https://server/api/user Trade-off: Users behind reverse proxies will need to disable the feature and use alternative security measures (firewall rules, proxy ACLs). Addresses security review feedback from PR #287.	2025-11-19 22:09:04 -07:00
Luis Pater	0586da9c2b	refactor(registry): move Gemini 3 Pro Preview model definition to base set	2025-11-20 10:51:16 +08:00
Ben Vargas	3d8d02bfc3	Fix amp v1beta1 routing and gemini retry config	2025-11-19 19:11:35 -07:00
Ben Vargas	7ae00320dc	fix(amp): enable OAuth fallback for Gemini v1beta1 routes AMP CLI sends Gemini requests to non-standard paths that were being directly proxied to ampcode.com without checking for local OAuth. This fix adds: - GeminiBridge handler to transform AMP CLI paths to standard format - Enhanced model extraction from AMP's /publishers/google/models/* paths - FallbackHandler wrapper to check for local OAuth before proxying Flow: - If user has local Google OAuth → use it (free tier) - If no local OAuth → fallback to ampcode.com (charges credits) Fixes issue where gemini-3-pro-preview requests always charged AMP credits even when user had valid Google Cloud OAuth configured.	2025-11-19 18:23:17 -07:00
Ben Vargas	72d82268e5	fix(amp): filter context-1m beta header for local OAuth providers Amp CLI sends 'context-1m-2025-08-07' in Anthropic-Beta header which requires a special 1M context window subscription. After upstream rebase to v6.3.7 (commit `38cfbac`), CLIProxyAPI now respects client-provided Anthropic-Beta headers instead of always using defaults. When users configure local OAuth providers (Claude, etc), requests bypass the ampcode.com proxy and use their own API subscriptions. These personal subscriptions typically don't include the 1M context beta feature, causing 'long context beta not available' errors. Changes: - Add filterBetaFeatures() helper to strip specific beta features - Filter context-1m-2025-08-07 in fallback handler when using local providers - Preserve full headers when proxying to ampcode.com (paid users get all features) - Add 7 test cases covering all edge cases This fix is isolated to the Amp module and only affects the local provider path. Users proxying through ampcode.com are unaffected and receive full 1M context support as part of their paid service.	2025-11-19 18:23:17 -07:00
Ben Vargas	8193392bfe	Add AMP fallback proxy and shared Gemini normalization - add fallback handler that forwards Amp provider requests to ampcode.com when the provider isn’t configured locally - wrap AMP provider routes with the fallback so requests always have a handler - share Gemini thinking model normalization helper between core handlers and AMP fallback	2025-11-19 18:23:17 -07:00
Ben Vargas	9ad0f3f91e	feat: Add Amp CLI integration with comprehensive documentation Add full Amp CLI support to enable routing AI model requests through the proxy while maintaining Amp-specific features like thread management, user info, and telemetry. Includes complete documentation and pull bot configuration. Features: - Modular architecture with RouteModule interface for clean integration - Reverse proxy for Amp management routes (thread/user/meta/ads/telemetry) - Provider-specific route aliases (/api/provider/{provider}/*) - Secret management with precedence: config > env > file - 5-minute secret caching to reduce file I/O - Automatic gzip decompression for responses - Proper connection cleanup to prevent leaks - Localhost-only restriction for management routes (configurable) - CORS protection for management endpoints Documentation: - Complete setup guide (USING_WITH_FACTORY_AND_AMP.md) - OAuth setup for OpenAI (ChatGPT Plus/Pro) and Anthropic (Claude Pro/Max) - Factory CLI config examples with all model variants - Amp CLI/IDE configuration examples - tmux setup for remote server deployment - Screenshots and diagrams Configuration: - Pull bot disabled for this repo (manual rebase workflow) - Config fields: AmpUpstreamURL, AmpUpstreamAPIKey, AmpRestrictManagementToLocalhost - Compatible with upstream DisableCooling and other features Technical details: - internal/api/modules/amp/: Complete Amp routing module - sdk/api/httpx/: HTTP utilities for gzip/transport - 94.6% test coverage with 34 comprehensive test cases - Clean integration minimizes merge conflict risk Security: - Management routes restricted to localhost by default - Configurable via amp-restrict-management-to-localhost - Prevents drive-by browser attacks on user data This provides a production-ready foundation for Amp CLI integration while maintaining clean separation from upstream code for easy rebasing. Amp-Thread-ID: https://ampcode.com/threads/T-9e2befc5-f969-41c6-890c-5b779d58cf18	2025-11-19 18:23:17 -07:00
Ben Vargas	0ff094b87f	fix(executor): prevent streaming on failed response when no fallback Fix critical bug where ExecuteStream would create a streaming channel from a failed (non-2xx) response after exhausting all retries with no fallback models available. When retries were exhausted on the last model, the code would break from the inner loop but fall through to streaming channel creation (line 401), immediately returning at line 461. This made the error handling code at lines 464-471 unreachable, causing clients to receive an empty/closed stream instead of a proper error response. Solution: Check if httpResp is non-2xx before creating the streaming channel. If failed, continue the outer loop to reach error handling. Identified by: codex-bot review Ref: https://github.com/router-for-me/CLIProxyAPI/pull/280#pullrequestreview-3484560423	2025-11-19 13:14:40 -07:00
Ben Vargas	ed23472d94	fix(executor): prevent streaming from 429 response when fallback available Fix critical bug where ExecuteStream would create a streaming channel using a 429 error response instead of continuing to the next fallback model after exhausting retries. When 429 retries were exhausted and a fallback model was available, the inner retry loop would break but immediately fall through to the streaming channel creation, attempting to stream from the failed 429 response instead of trying the next model. Solution: Add shouldContinueToNextModel flag to explicitly skip the streaming logic and continue the outer model loop when appropriate. Identified by: codex-bot review Ref: https://github.com/router-for-me/CLIProxyAPI/pull/280#pullrequestreview-3484479106	2025-11-19 13:05:38 -07:00
Ben Vargas	ede4471b84	feat(translator): add default thinkingConfig for gemini-3-pro-preview Match official Gemini CLI behavior by always sending default thinkingConfig when client doesn't specify reasoning parameters. - Set thinkingBudget=-1 (dynamic) for gemini-3-pro-preview - Set include_thoughts=true to return thinking process - Apply to both /v1/chat/completions and /v1/responses endpoints - See: ai-gemini-cli/packages/core/src/config/defaultModelConfigs.ts	2025-11-19 12:47:39 -07:00
Ben Vargas	6a3de3a89c	feat(executor): add intelligent retry logic for 429 rate limits Implement Google RetryInfo.retryDelay support for handling 429 rate limit errors. Retries same model up to 3 times using exact delays from Google's API before trying fallback models. - Add parseRetryDelay() to extract Google's retry guidance - Implement inner retry loop in Execute() and ExecuteStream() - Context-aware waiting with cancellation support - Cap delays at 60s maximum for safety	2025-11-19 12:47:39 -07:00
Ben Vargas	782bba0bc4	feat(registry): enable gemini-3-pro-preview for gemini-cli provider Add gemini-3-pro-preview model to GetGeminiCLIModels() to make it available for OAuth-based Gemini CLI users, matching the model already available in AI Studio provider. Model spec: - ID: gemini-3-pro-preview - Version: 3.0 - Input: 1M tokens - Output: 64K tokens - Thinking: 128-32K tokens (dynamic)	2025-11-19 12:47:39 -07:00
Luis Pater	bf116b68f8	feat(registry): add GPT-5.1 Codex Max model definitions and support - Introduced `gpt-5.1-codex-max` variants to model definitions (`low`, `medium`, `high`, `xhigh`). - Updated executor logic to map effort levels for Codex Max models. - Added `lastCodexMaxPrompt` processing for `gpt-5.1-codex-max` prompts. - Defined instructions for `gpt-5.1-codex-max` in a new file: `codex_instructions/gpt-5.1-codex-max_prompt.md`.	2025-11-20 03:12:22 +08:00
Luis Pater	cc3cf09c00	feat(auth): add AuthIndex for diagnostics and ensure usage recording	2025-11-19 22:02:40 +08:00
hkfires	b285b07986	fix(iflow): adjust auth filename email sanitization	2025-11-19 19:50:06 +08:00
hkfires	8a33f3ef69	fix: detect HTML error bodies without text/html content type	2025-11-19 14:45:33 +08:00
Luis Pater	7a8e00fcea	fix(translator): handle missing parameters in Gemini tool schema gracefully	2025-11-19 13:19:46 +08:00
Luis Pater	89771216a1	feat(translator): add ThoughtSignature handling in Gemini request transformations	2025-11-19 11:34:13 +08:00
Luis Pater	14ddfd4b79	Merge pull request #270 from router-for-me/iflow feat(auth): add iFlow cookie-based authentication support	2025-11-19 01:54:34 +08:00
Luis Pater	567227f35f	Merge pull request #268 from router-for-me/tools fix: use underscore suffix in short name mapping	2025-11-19 01:43:41 +08:00
Luis Pater	17016ae6a5	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:48:21 +08:00
Luis Pater	01b7b60901	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:46:58 +08:00
hkfires	b52a5cc066	feat(auth): add iFlow cookie-based authentication support	2025-11-18 22:35:35 +08:00
hkfires	1ba057112a	fix: use underscore suffix in short name mapping Replace the "~<n>" suffix with "_<n>" when generating unique short names in codex translators (Claude, Gemini, OpenAI chat). This avoids using a special character in identifiers, improving compatibility with downstream APIs while preserving length constraints.	2025-11-18 16:59:25 +08:00
Luis Pater	23a7633e6d	fix(registry): update Thinking parameters and replace Gemini-3 Preview with Gemini-2.5 Flash Lite	2025-11-18 11:51:52 +08:00
Luis Pater	e5e985978d	Fixed: #263 fix(translator): remove input_examples from tool schema in Gemini-Claude requests	2025-11-18 11:27:48 +08:00
Luis Pater	db2d22c978	fix(runtime): simplify scanner buffer allocation in executor implementations	2025-11-18 10:59:49 +08:00
Luis Pater	1c815c58a6	fix(translator): simplify string handling in Gemini responses	2025-11-16 19:02:27 +08:00
Luis Pater	4eab141410	feat(translator): add support for reasoning/thinking content blocks in OpenAI-Claude and Gemini responses	2025-11-16 17:37:39 +08:00
Luis Pater	5937b8e429	Fixed: #260 fix(translator): handle simple string input conversion in Gemini responses	2025-11-16 13:30:11 +08:00
Luis Pater	9875565339	fix(claude translator): ensure default token counts when usage data is missing	2025-11-16 13:18:21 +08:00
Luis Pater	faa483b57d	Merge pull request #257 from lollipopkit/main fix(claude translator): guard tool schema properties	2025-11-16 12:19:38 +08:00
Luis Pater	f0711be302	fix(auth): prevent access to removed credentials lingering in memory Add logic to avoid exposing credentials that have been removed from disk but still persist in memory. Ensure `runtimeOnly` checks and proper handling of disabled or removed authentication states.	2025-11-16 12:12:24 +08:00
Luis Pater	1d0f0301b4	refactor(api/config): centralize legacy OpenAI compatibility key migration Introduce `migrateLegacyOpenAICompatibilityKeys` to streamline and reuse the normalization of OpenAI compatibility entries. Remove redundant loops and enhance maintainability for compatibility key handling. Add cleanup for legacy `api-keys` in YAML configuration during persistence.	2025-11-16 11:39:35 +08:00

... 3 4 5 6 7 ...

709 Commits