CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 20:40:52 +08:00

Author	SHA1	Message	Date
Ben Vargas	ed23472d94	fix(executor): prevent streaming from 429 response when fallback available Fix critical bug where ExecuteStream would create a streaming channel using a 429 error response instead of continuing to the next fallback model after exhausting retries. When 429 retries were exhausted and a fallback model was available, the inner retry loop would break but immediately fall through to the streaming channel creation, attempting to stream from the failed 429 response instead of trying the next model. Solution: Add shouldContinueToNextModel flag to explicitly skip the streaming logic and continue the outer model loop when appropriate. Identified by: codex-bot review Ref: https://github.com/router-for-me/CLIProxyAPI/pull/280#pullrequestreview-3484479106	2025-11-19 13:05:38 -07:00
Ben Vargas	ede4471b84	feat(translator): add default thinkingConfig for gemini-3-pro-preview Match official Gemini CLI behavior by always sending default thinkingConfig when client doesn't specify reasoning parameters. - Set thinkingBudget=-1 (dynamic) for gemini-3-pro-preview - Set include_thoughts=true to return thinking process - Apply to both /v1/chat/completions and /v1/responses endpoints - See: ai-gemini-cli/packages/core/src/config/defaultModelConfigs.ts	2025-11-19 12:47:39 -07:00
Ben Vargas	6a3de3a89c	feat(executor): add intelligent retry logic for 429 rate limits Implement Google RetryInfo.retryDelay support for handling 429 rate limit errors. Retries same model up to 3 times using exact delays from Google's API before trying fallback models. - Add parseRetryDelay() to extract Google's retry guidance - Implement inner retry loop in Execute() and ExecuteStream() - Context-aware waiting with cancellation support - Cap delays at 60s maximum for safety	2025-11-19 12:47:39 -07:00
Ben Vargas	782bba0bc4	feat(registry): enable gemini-3-pro-preview for gemini-cli provider Add gemini-3-pro-preview model to GetGeminiCLIModels() to make it available for OAuth-based Gemini CLI users, matching the model already available in AI Studio provider. Model spec: - ID: gemini-3-pro-preview - Version: 3.0 - Input: 1M tokens - Output: 64K tokens - Thinking: 128-32K tokens (dynamic)	2025-11-19 12:47:39 -07:00
Luis Pater	bf116b68f8	feat(registry): add GPT-5.1 Codex Max model definitions and support - Introduced `gpt-5.1-codex-max` variants to model definitions (`low`, `medium`, `high`, `xhigh`). - Updated executor logic to map effort levels for Codex Max models. - Added `lastCodexMaxPrompt` processing for `gpt-5.1-codex-max` prompts. - Defined instructions for `gpt-5.1-codex-max` in a new file: `codex_instructions/gpt-5.1-codex-max_prompt.md`.	2025-11-20 03:12:22 +08:00
Luis Pater	cc3cf09c00	feat(auth): add AuthIndex for diagnostics and ensure usage recording	2025-11-19 22:02:40 +08:00
hkfires	b285b07986	fix(iflow): adjust auth filename email sanitization	2025-11-19 19:50:06 +08:00
hkfires	8a33f3ef69	fix: detect HTML error bodies without text/html content type	2025-11-19 14:45:33 +08:00
Luis Pater	7a8e00fcea	fix(translator): handle missing parameters in Gemini tool schema gracefully	2025-11-19 13:19:46 +08:00
Luis Pater	89771216a1	feat(translator): add ThoughtSignature handling in Gemini request transformations	2025-11-19 11:34:13 +08:00
Luis Pater	14ddfd4b79	Merge pull request #270 from router-for-me/iflow feat(auth): add iFlow cookie-based authentication support	2025-11-19 01:54:34 +08:00
Luis Pater	567227f35f	Merge pull request #268 from router-for-me/tools fix: use underscore suffix in short name mapping	2025-11-19 01:43:41 +08:00
Luis Pater	17016ae6a5	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:48:21 +08:00
Luis Pater	01b7b60901	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:46:58 +08:00
hkfires	b52a5cc066	feat(auth): add iFlow cookie-based authentication support	2025-11-18 22:35:35 +08:00
hkfires	1ba057112a	fix: use underscore suffix in short name mapping Replace the "~<n>" suffix with "_<n>" when generating unique short names in codex translators (Claude, Gemini, OpenAI chat). This avoids using a special character in identifiers, improving compatibility with downstream APIs while preserving length constraints.	2025-11-18 16:59:25 +08:00
Luis Pater	23a7633e6d	fix(registry): update Thinking parameters and replace Gemini-3 Preview with Gemini-2.5 Flash Lite	2025-11-18 11:51:52 +08:00
Luis Pater	e5e985978d	Fixed: #263 fix(translator): remove input_examples from tool schema in Gemini-Claude requests	2025-11-18 11:27:48 +08:00
Luis Pater	db2d22c978	fix(runtime): simplify scanner buffer allocation in executor implementations	2025-11-18 10:59:49 +08:00
Luis Pater	1c815c58a6	fix(translator): simplify string handling in Gemini responses	2025-11-16 19:02:27 +08:00
Luis Pater	4eab141410	feat(translator): add support for reasoning/thinking content blocks in OpenAI-Claude and Gemini responses	2025-11-16 17:37:39 +08:00
Luis Pater	5937b8e429	Fixed: #260 fix(translator): handle simple string input conversion in Gemini responses	2025-11-16 13:30:11 +08:00
Luis Pater	9875565339	fix(claude translator): ensure default token counts when usage data is missing	2025-11-16 13:18:21 +08:00
Luis Pater	faa483b57d	Merge pull request #257 from lollipopkit/main fix(claude translator): guard tool schema properties	2025-11-16 12:19:38 +08:00
Luis Pater	f0711be302	fix(auth): prevent access to removed credentials lingering in memory Add logic to avoid exposing credentials that have been removed from disk but still persist in memory. Ensure `runtimeOnly` checks and proper handling of disabled or removed authentication states.	2025-11-16 12:12:24 +08:00
Luis Pater	1d0f0301b4	refactor(api/config): centralize legacy OpenAI compatibility key migration Introduce `migrateLegacyOpenAICompatibilityKeys` to streamline and reuse the normalization of OpenAI compatibility entries. Remove redundant loops and enhance maintainability for compatibility key handling. Add cleanup for legacy `api-keys` in YAML configuration during persistence.	2025-11-16 11:39:35 +08:00
lollipopkit🏳️‍⚧️	c73b3fa43b	fix(claude translator): guard tool schema properties	2025-11-15 19:14:13 +08:00
Luis Pater	772fa69515	Fixed: #254 feat(registry): add Kimi-K2-Thinking model to model definitions	2025-11-14 21:20:54 +08:00
Luis Pater	1ccb01631d	refactor(runtime): centralize reasoning effort logic for GPT models Extract reasoning effort mapping into a reusable function `setReasoningEffortByAlias` to reduce redundancy and improve maintainability. Introduce support for the "gpt-5.1-none" variant in the registry and runtime executor.	2025-11-14 17:24:40 +08:00
Ben Vargas	cfbaed0e90	fix(runtime): remove gpt-5.1 minimal effort variant Stop advertising and mapping the unsupported gpt-5.1-minimal variant in the model registry and Codex executor, and align bare gpt-5.1 requests to use medium reasoning effort like Codex CLI while preserving minimal for gpt-5.	2025-11-13 19:43:52 -07:00
Luis Pater	cf9b9be7ea	feat(runtime): extend executor support for GPT-5.1 Codex and variants Expand executor logic to handle GPT-5.1 Codex family and its variants, including reasoning effort configurations for minimal, low, medium, and high levels. Ensure proper mapping of models to payload parameters.	2025-11-14 08:08:25 +08:00
Luis Pater	aa57f3237a	feat(instructions): add detailed agent behavior guidelines for Codex CLI Introduce comprehensive agent instruction documentation (`gpt_5_1_prompt.md`) for Codex CLI. Specify agent behavior, personality, planning requirements, task execution, sandboxing rules, and validation processes to standardize interactions and improve usability.	2025-11-14 06:51:54 +08:00
Luis Pater	fcd98f4f9b	feat(runtime): add payload configuration support for executors Introduce `PayloadConfig` in the configuration to define default and override rules for modifying payload parameters. Implement `applyPayloadConfig` and `applyPayloadConfigWithRoot` to apply these rules across various executors, ensuring consistent parameter handling for different models and protocols. Update all relevant executors to utilize this functionality.	2025-11-13 23:27:40 +08:00
Luis Pater	75b57bc112	Fixed: #246 feat(runtime): add support for GPT-5.1 models and variants Introduce GPT-5.1 model family, including minimal, low, medium, high, Codex, and Codex Mini variants. Update tokenization and reasoning effort handling to accommodate new models in executor and registry.	2025-11-13 17:42:19 +08:00
Luis Pater	a7d2f669e7	feat(watcher): expand event handling for config and auth JSON updates Refine `handleEvent` to support additional file system operations (Rename, Remove) for config and auth JSON files. Improve client update/removal logic with atomic file replacement handling and incremental processing for auth changes.	2025-11-13 12:13:31 +08:00
Luis Pater	ce569ab36e	feat(buildinfo): add build metadata and expose via HTTP headers Introduce a new `buildinfo` package to store version, commit, and build date metadata. Update HTTP handlers to include build metadata in response headers and modify initialization to set `buildinfo` values during runtime.	2025-11-13 08:38:03 +08:00
Luis Pater	d0aa741d59	feat(gemini-cli): add multi-project support and enhance credential handling Introduce support for multi-project Gemini CLI logins, including shared and virtual credential management. Enhance runtime, metadata handling, and token updates for better project granularity and consistency across virtual and shared credentials. Extend onboarding to allow activating all available projects.	2025-11-13 02:55:32 +08:00
Luis Pater	592f6fc66b	feat(vertex): add usage source resolution for Vertex projects Extend `resolveUsageSource` to support Vertex projects by extracting and normalizing `project_id` or `project` from the metadata for accurate source resolution.	2025-11-12 08:43:02 +08:00
Luis Pater	09ecba6dab	Merge pull request #237 from TUGOhost/feature/support_auto_model feat: add auto model resolution and model creation timestamp tracking	2025-11-12 00:03:30 +08:00
Luis Pater	d6bd6f3fb9	feat(vertex, management): enhance token handling and OAuth2 integration Extend `vertexAccessToken` to support proxy-aware HTTP clients and update calls accordingly for better configurability. Add `deleteTokenRecord` to handle token cleanup, improving management of authentication files.	2025-11-11 23:42:46 +08:00
TUGOhost	92f4278039	feat: add auto model resolution and model creation timestamp tracking - Add 'created' field to model registry for tracking model creation time - Implement GetFirstAvailableModel() to find the first available model by newest creation timestamp - Add ResolveAutoModel() utility function to resolve "auto" model name to actual available model - Update request handler to resolve "auto" model before processing requests - Ensures automatic model selection when "auto" is specified as model name This enables dynamic model selection based on availability and creation time, improving the user experience when no specific model is requested.	2025-11-11 20:30:09 +08:00
Luis Pater	8ae8a5c296	Fixed: #233 feat(management): add auth ID normalization and file-based ID resolution Introduce `authIDForPath` to standardize ID generation from file paths, improving consistency in authentication handling. Update `registerAuthFromFile` and `disableAuth` to utilize normalized IDs, incorporating relative path resolution and file name extraction where applicable.	2025-11-11 19:23:31 +08:00
Luis Pater	dc804e96fb	fix(management): improve error handling and normalize YAML comment indentation Enhance error management for file operations and clean up temporary files. Add `NormalizeCommentIndentation` function to ensure YAML comments maintain consistent formatting.	2025-11-11 08:37:57 +08:00
Luis Pater	ab76cb3662	feat(management): add Vertex service account import and WebSocket auth management Introduce an endpoint for importing Vertex service account JSON keys and storing them as authentication records. Add handlers for managing WebSocket authentication configuration.	2025-11-10 20:48:31 +08:00
Luis Pater	2965bdadc1	fix(translator): remove debug print statement from OpenAI Gemini request processing	2025-11-10 18:37:05 +08:00
Luis Pater	40f7061b04	feat(watcher): debounce config reloads to prevent redundant operations Introduce `scheduleConfigReload` with debounce functionality for config reloads, ensuring efficient handling of frequent changes. Added `stopConfigReloadTimer` for stopping timers during watcher shutdown.	2025-11-10 12:57:40 +08:00
Luis Pater	8c947cafbe	Merge branch 'vertex' into dev	2025-11-10 12:24:07 +08:00
Luis Pater	717eadf128	feat(vertex): add support for Vertex AI Gemini authentication and execution Introduce Vertex AI Gemini integration with support for service account-based authentication, credential storage, and import functionality. Added new executor for Vertex AI requests, including execution and streaming paths, and integrated it into the core manager. Enhanced CLI with `--vertex-import` flag for importing service account keys.	2025-11-10 12:23:51 +08:00
Luis Pater	9e105738fd	fix(server): add PATCH method to CORS allowed methods	2025-11-10 12:12:05 +08:00
Luis Pater	5d806fcefc	fix(translator): support system instructions with parts and inline data in OpenAI Gemini requests Handle both `systemInstruction` and `system_instruction` keys, processing text and inline data parts (e.g., images) for system messages in Gemini.	2025-11-10 10:31:32 +08:00

1 2 3 4 5 ...

535 Commits