CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
Luis Pater	653439698e	Fixed: #606 fix: unify response field naming across translators Standardize `text` to `delta` and add missing `output` field in all response payloads for consistency across OpenAI, Claude, and Gemini translators. v6.6.37	2025-12-21 03:13:58 +08:00
Luis Pater	89254cfc97	Merge pull request #638 from neokotora/feat/add-gemini-3-flash fix: add gemini-3-flash-preview model definition in GetGeminiModels v6.6.36	2025-12-20 23:36:57 +08:00
Luis Pater	6bd9a034f7	Merge pull request #602 from ben-vargas/fix-antigravity-propertynames fix: remove propertyNames from JSON schema for Gemini compatibility	2025-12-20 23:32:51 +08:00
Luis Pater	26fc65b051	Merge pull request #633 from ben-vargas/fix-antigravity-applypayloadconfig feat(antigravity): add payload config support to Antigravity executor	2025-12-20 23:30:47 +08:00
Luis Pater	ed5ec5b55c	feat(amp): enhance model mapping and Gemini thinking configuration This commit introduces several improvements to the AMP (Advanced Model Proxy) module: - Model Mapping Logic: The `FallbackHandler` now uses a more robust approach for model mapping. It includes the extraction and preservation of dynamic "thinking suffixes" (e.g., `(xhigh)`) during mapping, ensuring that these configurations are correctly applied to the mapped model. A new `resolveMappedModel` function centralizes this logic for cleaner code. - ModelMapper Verification: The `ModelMapper` in `model_mapping.go` now verifies that the target model of a mapping has available providers after normalizing it. This prevents mappings to non-existent or unresolvable models. - Gemini Thinking Configuration Cleanup: In `gemini_thinking.go`, unnecessary `generationConfig.thinkingConfig.include_thoughts` and `generationConfig.thinkingConfig.thinkingBudget` fields are now deleted from the request body when applying Gemini thinking levels. This prevents potential conflicts or redundant configurations. - Testing: A new test case `TestModelMapper_MapModel_TargetWithThinkingSuffix` has been added to `model_mapping_test.go` to specifically cover the preservation of thinking suffixes during model mapping.	2025-12-20 22:19:35 +08:00
sheauhuu	df777650ac	feat: add gemini-3-flash-preview model definition in GetGeminiModels	2025-12-20 20:05:20 +08:00
Luis Pater	10f8c795ac	Merge pull request #634 from router-for-me/amp fix(amp): add /docs routes to proxy v6.6.35	2025-12-20 17:08:07 +08:00
Luis Pater	3e4858a624	feat(config): add log file size limit configuration #535 This commit introduces a new configuration option `logs-max-total-size-mb` that allows users to set a maximum total size (in MB) for log files in the logs directory. When this limit is exceeded, the oldest log files will be automatically deleted to stay within the specified size. Setting this value to 0 (the default) disables this feature. This change enhances log management by preventing excessive disk space usage.	2025-12-20 15:52:59 +08:00
Ben Vargas	1231dc9cda	feat(antigravity): add payload config support to Antigravity executor Add applyPayloadConfig calls to all Antigravity executor paths (Execute, executeClaudeNonStream, ExecuteStream) to enable config.yaml payload overrides for Antigravity/Gemini-Claude models. This allows users to configure thinking budget and other parameters via payload.override in config.yaml for models like gemini-claude-opus-4-5*.	2025-12-19 22:30:44 -07:00
hkfires	c84ff42bcd	fix(amp): add /docs routes to proxy	2025-12-20 10:15:25 +08:00
Luis Pater	8a5db02165	Fixed: #607 refactor(config): re-export internal configuration types for SDK consumers v6.6.34	2025-12-20 04:49:02 +08:00
Luis Pater	d7afb6eb0c	fix(gemini): improve reasoning effort conversion for Gemini 3 models Refactors the reasoning effort conversion logic for Gemini models. The update specifically addresses how `reasoning_effort` is translated into Gemini 3 specific thinking configurations (`thinkingLevel`, `includeThoughts`) and ensures that numeric budgets are not incorrectly applied to level-based models. Changes include: - Differentiating conversion logic for Gemini 3 models versus other models. - Handling `none`, `auto`, and validated thinking levels for Gemini 3. - Maintaining existing conversion for models not using discrete thinking levels. v6.6.33	2025-12-20 03:11:28 +08:00
Luis Pater	bbd1fe890a	Merge pull request #598 from BigUncle/fix/token-refresh-loop fix(auth): prevent token refresh loop by ignoring timestamp fields v6.6.32	2025-12-19 23:59:40 +08:00
Luis Pater	f607231efa	Merge pull request #627 from router-for-me/gemini fix(gemini): add optional skip for gemini3 thinking conversion v6.6.31	2025-12-19 22:20:51 +08:00
hkfires	2039062845	fix(gemini): add optional skip for gemini3 thinking conversion	2025-12-19 22:07:43 +08:00
Luis Pater	99478d13a8	Merge pull request #623 from router-for-me/remote-OAuth Remote OAuth v6.6.30	2025-12-19 18:29:09 +08:00
Luis Pater	69d3a80fc3	Merge pull request #618 from router-for-me/amp fix(amp): add management auth skipper	2025-12-19 17:37:51 +08:00
Luis Pater	9e268ad103	Merge pull request #619 from router-for-me/gemini fix(util): disable default thinking for gemini 3 flash	2025-12-19 17:36:52 +08:00
hkfires	9d9b9e7a0d	fix(amp): add management auth skipper	2025-12-19 13:57:47 +08:00
hkfires	13aa82f3f3	fix(util): disable default thinking for gemini 3 flash	2025-12-19 13:11:15 +08:00
Luis Pater	05e55d7dc5	feat(codex): update gpt-5.2 codex prompt instructions The prompt for the gpt-5.2 codex model has been updated with more comprehensive instructions. This includes detailed guidelines on general usage, editing constraints, the plan tool, sandboxing configurations, handling special user requests, frontend task considerations, and final message presentation. The updates aim to improve the model's understanding and execution of complex coding tasks by providing clearer directives and constraints. v6.6.29	2025-12-19 12:38:28 +08:00
Supra4E8C	1b358c931c	fix: restore get-auth-status ok fallback and document it	2025-12-19 12:15:22 +08:00
Luis Pater	ca09db21ff	feat(codex): add gpt-5.2 codex prompt handling This change introduces specific logic to load and use instructions for the 'gpt-5.2-codex' model variant by recognizing the 'gpt-5.2-codex_prompt.md' filename. This ensures the correct prompts are used when the '5.2-codex' model is identified, complementing the recent addition of its definition. v6.6.28	2025-12-19 11:39:51 +08:00
Chén Mù	718ff7a73f	Merge pull request #609 from router-for-me/codex feat(registry): add gpt 5.2 codex model definition v6.6.27	2025-12-19 09:54:34 +08:00
hkfires	fa70b220e9	feat(registry): add gpt 5.2 codex model definition	2025-12-19 09:53:03 +08:00
Ben Vargas	1b8cb7b77b	fix: remove propertyNames from JSON schema for Gemini compatibility Gemini API does not support the JSON Schema `propertyNames` keyword, causing 400 errors when Claude tool schemas containing this field are proxied through the Antigravity provider. Add `propertyNames` to the list of unsupported keywords removed by CleanJSONSchemaForGemini(), alongside existing removals like $ref, definitions, and additionalProperties.	2025-12-18 12:50:51 -07:00
Luis Pater	774f1fbc17	Merge pull request #586 from router-for-me/chore chore: ignore gemini metadata files	2025-12-19 01:00:30 +08:00
Supra4E8C	cfa8ddb59f	feat(oauth): add remote OAuth callback support with session management Introduce a centralized OAuth session store with TTL-based expiration to replace the previous simple map-based status tracking. Add a new /api/oauth/callback endpoint that allows remote clients to relay OAuth callback data back to the CLI proxy, enabling OAuth flows when the callback cannot reach the local machine directly. - Add oauth_sessions.go with thread-safe session store and validation - Add oauth_callback.go with POST handler for remote callback relay - Refactor auth_files.go to use new session management APIs - Register new callback route in server.go	2025-12-19 00:38:29 +08:00
BigUncle	39597267ae	fix(auth): prevent token refresh loop by ignoring timestamp fields Add metadataEqualIgnoringTimestamps() function to compare metadata JSON without timestamp/expired/expires_in/last_refresh/access_token fields. This prevents unnecessary file writes when only these fields change during refresh, breaking the fsnotify event → Watcher callback → refresh loop. Key insight: Google OAuth returns a new access_token on each refresh, which was causing file writes and triggering the refresh loop. Fixes antigravity channel excessive log generation issue. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-18 21:37:05 +08:00
hkfires	393e38f2c0	chore: ignore gemini metadata files	2025-12-18 13:18:15 +08:00
Luis Pater	d1220de02d	chore(docs): remove legacy documentation and unused PR workflow file v6.6.26	2025-12-18 08:21:58 +08:00
Luis Pater	13eb5268de	Merge pull request #582 from ben-vargas/fix-gemini-3-thinking-level feat: use thinkingLevel for Gemini 3 models per Google documentation	2025-12-18 07:19:37 +08:00
Ben Vargas	88798816f2	fix: require dot in gemini25Pattern regex for precise matching	2025-12-17 16:09:50 -07:00
Ben Vargas	598f0af19b	fix: apply thinkingLevel from model suffix metadata for Gemini 3 The previous commit added thinkingLevel support but didn't apply it when the reasoning effort came from model name suffix (e.g., model(minimal)). This was because ResolveThinkingConfigFromMetadata returns nil for level-based models, bypassing the metadata application. Changes: - Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API - Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format - Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata - Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata - Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata - Add comprehensive test coverage for Gemini 3 thinkingLevel functions	2025-12-17 16:08:38 -07:00
Ben Vargas	a33f5d31fc	feat: use thinkingLevel for Gemini 3 models per Google documentation Per Google's official documentation, Gemini 3 models should use thinkingLevel (string) instead of thinkingBudget (number) for optimal performance. From Google's Gemini Thinking docs: > Use the thinkingLevel parameter with Gemini 3 models. While > thinkingBudget is accepted for backwards compatibility, using > it with Gemini 3 Pro may result in suboptimal performance. Changes: - Add model family detection functions (IsGemini3Model, IsGemini25Model, IsGemini3ProModel, IsGemini3FlashModel) - Add ApplyGeminiThinkingLevel and ApplyGeminiCLIThinkingLevel functions for applying thinkingLevel config - Add ValidateGemini3ThinkingLevel for model-specific level validation - Add ThinkingBudgetToGemini3Level for backward compatibility conversion - Update NormalizeGeminiThinkingBudget to convert budget to level for Gemini 3 models - Update ApplyDefaultThinkingIfNeeded to not set a default level for Gemini 3 (lets API use its dynamic default "high") - Update ConvertThinkingLevelToBudget to preserve thinkingLevel for Gemini 3 models - Add Levels field to all Gemini 3 model definitions: - Gemini 3 Pro: ["low", "high"] - Gemini 3 Flash: ["minimal", "low", "medium", "high"] Backward compatibility: - Gemini 2.5 models continue to use thinkingBudget as before - If thinkingBudget is provided for Gemini 3, it's converted to the appropriate thinkingLevel - Existing configurations continue to work	2025-12-17 15:28:20 -07:00
Luis Pater	506699fba1	ci(workflows): update pr-test-build workflow	2025-12-18 03:28:23 +08:00
Luis Pater	68a27772b3	feat(antigravity): enable token counting via API with resilient routing Introduces the capability to count tokens for Antigravity-backed requests. This implementation leverages the `countTokens` endpoint of the Antigravity API, replacing the prior unsupported stub. Key aspects of this update include: - API Integration: Direct integration with the Antigravity `countTokens` API, including necessary request payload translation and authentication. - Resilient Infrastructure: A fallback mechanism has been established, allowing the system to attempt connections across multiple Antigravity base URLs to ensure request success even in the event of temporary service interruptions. - Model Aliasing: Added mappings for `gemini-3-flash` and `gemini-3-flash-preview` to ensure compatibility with the latest model variants. - Robust Error Handling: Comprehensive error handling and logging are in place to manage failures during API interactions. v6.6.25	2025-12-18 03:12:46 +08:00
Ben Vargas	de87fb622b	docs: add redirect info and disable Pull app auto-sync	2025-12-17 12:06:39 -07:00
Luis Pater	f27672f6cf	feat(antigravity): add Gemini 3 Flash Preview model definition with enhanced capabilities v6.6.24	2025-12-18 01:02:19 +08:00
Luis Pater	28420c14e4	Merge pull request #580 from router-for-me/chore chore: ignore agent and bmad artifacts	2025-12-18 00:46:25 +08:00
Luis Pater	0bd221ff41	refactor(antigravity): optimize response handling in Claude model with JSON manipulation v6.6.23	2025-12-17 23:57:41 +08:00
Luis Pater	5fda6f8ef3	feat(antigravity): implement non-streaming execution for Claude model requests	2025-12-17 23:17:11 +08:00
hkfires	9b956f6338	chore: ignore agent and bmad artifacts	2025-12-17 23:15:15 +08:00
Luis Pater	09923f654c	feat(antigravity): add streaming support for Claude model requests	2025-12-17 22:16:57 +08:00
Luis Pater	ae7b972649	Merge pull request #577 from router-for-me/refactor-watcher-phase3 Refactor-watcher-phase3	2025-12-17 17:53:04 +08:00
Luis Pater	47885e3710	test(gemini): add test cases and improve compatibility for complex schema cases in CleanJSONSchemaForGemini function	2025-12-17 17:38:53 +08:00
Luis Pater	4b9a260b37	Merge pull request #575 from soilSpoon/feature/antigravity-gemini-compat feature: Improves Antigravity(gemini-claude) JSON schema compatibility	2025-12-17 16:53:06 +08:00
Luis Pater	2c743c8f0b	Merge pull request #572 from router-for-me/watcher refactor(watcher): extract auth synthesizer to synthesizer package v6.6.22	2025-12-17 16:39:59 +08:00
Luis Pater	9f2c278ee6	refactor(translator): replace client.Content structs with JSON-based content generation for more efficient handling of Claude requests	2025-12-17 16:39:32 +08:00
이대희	aea337cfe2	feature: Improves schema flattening and tool use handling Updates schema flattening logic to handle multiple non-null types, providing a more descriptive "Accepts" hint. Removes redundant tracking of the current tool name in `Params` as it's no longer needed for streaming limits, simplifying the structure.	2025-12-17 17:30:23 +09:00

1 2 3 4 5 ...

1059 Commits