CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 20:40:52 +08:00

Author	SHA1	Message	Date
hkfires	fa70b220e9	feat(registry): add gpt 5.2 codex model definition	2025-12-19 09:53:03 +08:00
Luis Pater	13eb5268de	Merge pull request #582 from ben-vargas/fix-gemini-3-thinking-level feat: use thinkingLevel for Gemini 3 models per Google documentation	2025-12-18 07:19:37 +08:00
Ben Vargas	88798816f2	fix: require dot in gemini25Pattern regex for precise matching	2025-12-17 16:09:50 -07:00
Ben Vargas	598f0af19b	fix: apply thinkingLevel from model suffix metadata for Gemini 3 The previous commit added thinkingLevel support but didn't apply it when the reasoning effort came from model name suffix (e.g., model(minimal)). This was because ResolveThinkingConfigFromMetadata returns nil for level-based models, bypassing the metadata application. Changes: - Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API - Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format - Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata - Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata - Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata - Add comprehensive test coverage for Gemini 3 thinkingLevel functions	2025-12-17 16:08:38 -07:00
Ben Vargas	a33f5d31fc	feat: use thinkingLevel for Gemini 3 models per Google documentation Per Google's official documentation, Gemini 3 models should use thinkingLevel (string) instead of thinkingBudget (number) for optimal performance. From Google's Gemini Thinking docs: > Use the thinkingLevel parameter with Gemini 3 models. While > thinkingBudget is accepted for backwards compatibility, using > it with Gemini 3 Pro may result in suboptimal performance. Changes: - Add model family detection functions (IsGemini3Model, IsGemini25Model, IsGemini3ProModel, IsGemini3FlashModel) - Add ApplyGeminiThinkingLevel and ApplyGeminiCLIThinkingLevel functions for applying thinkingLevel config - Add ValidateGemini3ThinkingLevel for model-specific level validation - Add ThinkingBudgetToGemini3Level for backward compatibility conversion - Update NormalizeGeminiThinkingBudget to convert budget to level for Gemini 3 models - Update ApplyDefaultThinkingIfNeeded to not set a default level for Gemini 3 (lets API use its dynamic default "high") - Update ConvertThinkingLevelToBudget to preserve thinkingLevel for Gemini 3 models - Add Levels field to all Gemini 3 model definitions: - Gemini 3 Pro: ["low", "high"] - Gemini 3 Flash: ["minimal", "low", "medium", "high"] Backward compatibility: - Gemini 2.5 models continue to use thinkingBudget as before - If thinkingBudget is provided for Gemini 3, it's converted to the appropriate thinkingLevel - Existing configurations continue to work	2025-12-17 15:28:20 -07:00
Luis Pater	68a27772b3	feat(antigravity): enable token counting via API with resilient routing Introduces the capability to count tokens for Antigravity-backed requests. This implementation leverages the `countTokens` endpoint of the Antigravity API, replacing the prior unsupported stub. Key aspects of this update include: - API Integration: Direct integration with the Antigravity `countTokens` API, including necessary request payload translation and authentication. - Resilient Infrastructure: A fallback mechanism has been established, allowing the system to attempt connections across multiple Antigravity base URLs to ensure request success even in the event of temporary service interruptions. - Model Aliasing: Added mappings for `gemini-3-flash` and `gemini-3-flash-preview` to ensure compatibility with the latest model variants. - Robust Error Handling: Comprehensive error handling and logging are in place to manage failures during API interactions.	2025-12-18 03:12:46 +08:00
Luis Pater	f27672f6cf	feat(antigravity): add Gemini 3 Flash Preview model definition with enhanced capabilities	2025-12-18 01:02:19 +08:00
Luis Pater	0bd221ff41	refactor(antigravity): optimize response handling in Claude model with JSON manipulation	2025-12-17 23:57:41 +08:00
Luis Pater	5fda6f8ef3	feat(antigravity): implement non-streaming execution for Claude model requests	2025-12-17 23:17:11 +08:00
Luis Pater	09923f654c	feat(antigravity): add streaming support for Claude model requests	2025-12-17 22:16:57 +08:00
Luis Pater	ae7b972649	Merge pull request #577 from router-for-me/refactor-watcher-phase3 Refactor-watcher-phase3	2025-12-17 17:53:04 +08:00
Luis Pater	47885e3710	test(gemini): add test cases and improve compatibility for complex schema cases in CleanJSONSchemaForGemini function	2025-12-17 17:38:53 +08:00
Luis Pater	4b9a260b37	Merge pull request #575 from soilSpoon/feature/antigravity-gemini-compat feature: Improves Antigravity(gemini-claude) JSON schema compatibility	2025-12-17 16:53:06 +08:00
Luis Pater	2c743c8f0b	Merge pull request #572 from router-for-me/watcher refactor(watcher): extract auth synthesizer to synthesizer package	2025-12-17 16:39:59 +08:00
Luis Pater	9f2c278ee6	refactor(translator): replace client.Content structs with JSON-based content generation for more efficient handling of Claude requests	2025-12-17 16:39:32 +08:00
이대희	aea337cfe2	feature: Improves schema flattening and tool use handling Updates schema flattening logic to handle multiple non-null types, providing a more descriptive "Accepts" hint. Removes redundant tracking of the current tool name in `Params` as it's no longer needed for streaming limits, simplifying the structure.	2025-12-17 17:30:23 +09:00
hkfires	811f8f8b4f	test(watcher): add comprehensive unit tests for watcher edge cases Add extensive test coverage for watcher module including: - Auth file handling for empty and missing files - Persist async error paths and nil receiver handling - Dispatch loop context cancellation scenarios - Event processing for errors and channel closures - Handle event cases: unrelated files, config changes, auth writes, remove debouncing, atomic replace detection - Normalize auth path and debounce cleanup logic - Runtime auth dispatch and refresh state - Config reload with mirrored auth dir and OAuth provider filtering - Start failure when auth dir is missing - Auth equality comparison ignoring temporal fields - Reload clients filtering without full rescan	2025-12-17 16:29:11 +08:00
이대희	27734a23b1	Update internal/util/translator.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-17 17:15:11 +09:00
이대희	1b8e538a77	feature: Improves Gemini JSON schema compatibility Enhances compatibility with the Gemini API by implementing a schema cleaning process. This includes: - Centralizing schema cleaning logic for Gemini in a dedicated utility function. - Converting unsupported schema keywords to hints within the description field. - Flattening complex schema structures like `anyOf`, `oneOf`, and type arrays to simplify the schema. - Handling streaming responses with empty tool names, which can occur in subsequent chunks after the initial tool use.	2025-12-17 17:10:53 +09:00
hkfires	41c2385aca	refactor(watcher): split watcher.go into focused modules - Create dispatcher.go for auth update queue management - Create events.go for fsnotify event handling - Create config_reload.go for hot-reload logic - Create clients.go for client lifecycle management - Simplify watcher.go to core coordinator (~150 lines) - Maintain 100% API backward compatibility - All tests passing with 72%+ coverage	2025-12-17 15:53:28 +08:00
hkfires	d605985f45	refactor(watcher): extract auth synthesis logic into separate synthesizer package	2025-12-17 15:00:43 +08:00
hkfires	d52b28b147	fix(config): use correct formatting function for prefix change details	2025-12-17 15:00:43 +08:00
Luis Pater	7481c0eaa0	Revert "Fix invalid thinking signature when proxying Claude via Antigravity"	2025-12-17 14:53:52 +08:00
Luis Pater	ffdfad8482	Fixed: #551 fix(translator): standardize content node handling across translators for assistant and tool calls	2025-12-17 13:16:07 +08:00
Luis Pater	6586f08584	fix(translator): correct funcName extraction and ensure proper handling of function response data in Antigravity Claude requests	2025-12-17 03:57:35 +08:00
Luis Pater	f49e887fe6	Merge pull request #570 from fuguiKz/fix/antigravity-thinking-signature Fix invalid thinking signature when proxying Claude via Antigravity	2025-12-17 03:04:41 +08:00
Luis Pater	084558f200	test(config): add unit tests for model prefix changes in config diff	2025-12-17 02:31:16 +08:00
kz	b602eae215	Fix antigravity Claude thinking signature handling	2025-12-17 02:28:58 +08:00
Luis Pater	d02bf9c243	feat(diff): add support for model prefix changes in config diff logic Enhance the configuration diff logic to include detection and reporting of `prefix` changes for all model types. Update related struct naming for consistency across the watcher module.	2025-12-17 02:05:03 +08:00
Luis Pater	26a5f67df2	Merge branch 'dev' into watcher	2025-12-17 01:48:11 +08:00
Luis Pater	600fd42a83	Merge pull request #564 from router-for-me/think feat(thinking): unify budget/effort conversion logic and add iFlow thinking support	2025-12-17 01:21:24 +08:00
Luis Pater	670685139a	fix(api): update route patterns to support wildcards for Gemini actions Normalize action handling by accommodating wildcard patterns in route definitions for Gemini endpoints. Adjust `request.Action` parsing logic to correctly process routes with prefixed actions.	2025-12-17 01:17:02 +08:00
Luis Pater	52b6306388	feat(config): add support for model prefixes and prefix normalization Refactor model management to include an optional `prefix` field for model credentials, enabling better namespace handling. Update affected configuration files, APIs, and handlers to support prefix normalization and routing. Remove unused OpenAI compatibility provider logic to simplify processing.	2025-12-17 01:07:26 +08:00
hkfires	521ec6f1b8	fix(watcher): simplify vertex apikey idKind to exclude base suffix	2025-12-16 22:55:38 +08:00
hkfires	b0c5d9640a	refactor(diff): improve security and stability of config change detection Introduce formatProxyURL helper to sanitize proxy addresses before logging, stripping credentials and path components while preserving host information. Rework model hash computation to sort and deduplicate name/alias pairs with case normalization, ensuring consistent output regardless of input ordering. Add signature-based identification for anonymous OpenAI-compatible provider entries to maintain stable keys across configuration reloads. Replace direct stdout prints with structured logger calls for file change notifications.	2025-12-16 22:39:19 +08:00
hkfires	ef8e94e992	refactor(watcher): extract config diff helpers Break out config diffing, hashing, and OpenAI compatibility utilities into a dedicated diff package, update watcher to consume them, and add comprehensive tests for diff logic and watcher behavior.	2025-12-16 21:45:33 +08:00
hkfires	28a428ae2f	fix(thinking): align budget effort mapping across translators Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.	2025-12-16 18:34:43 +08:00
hkfires	b326ec3641	feat(iflow): add thinking support for iFlow models	2025-12-16 18:34:43 +08:00
Thong Van	f4007f53ba	fix(translator): emit message_start on first chunk regardless of role field Some OpenAI-compatible providers (like GitHub Copilot) may send tool_calls in the first streaming chunk without including the role field. The previous implementation only emitted message_start when the first chunk contained role="assistant", causing Anthropic protocol violations when tool calls arrived first. This fix ensures message_start is always emitted on the very first chunk, preventing 'content_block_start before message_start' errors in clients that strictly validate Anthropic SSE event ordering.	2025-12-16 13:01:09 +07:00
Luis Pater	5a812a1e93	feat(remote-management): add support for custom GitHub repository for panel updates Introduce `panel-github-repository` in the configuration to allow specifying a custom repository for management panel assets. Update dependency versions and enhance asset URL resolution logic to support overrides.	2025-12-16 13:09:26 +08:00
Luis Pater	88b101ebf5	Merge pull request #549 from router-for-me/log Improve Request Logging Efficiency and Standardize Error Responses	2025-12-15 20:43:12 +08:00
Luis Pater	d9a65745df	fix(translator): handle empty item type and string content in OpenAI response parser	2025-12-15 20:35:52 +08:00
hkfires	97ab623d42	fix(api): prevent double logging for streaming responses	2025-12-15 18:00:32 +08:00
hkfires	14aa6cc7e8	fix(api): ensure all response writes are captured for logging The response writer wrapper has been refactored to more reliably capture response bodies for logging, fixing several edge cases. - Implements `WriteString` to capture writes from `io.StringWriter`, which were previously missed by the `Write` method override. - A new `shouldBufferResponseBody` helper centralizes the logic to ensure the body is buffered only when logging is active or for errors when `logOnErrorOnly` is enabled. - Streaming detection is now more robust. It correctly handles non-streaming error responses (e.g., `application/json`) that are generated for a request that was intended to be streaming. BREAKING CHANGE: The public methods `Status()`, `Size()`, and `Written()` have been removed from the `ResponseWriterWrapper` as they are no longer required by the new implementation.	2025-12-15 17:45:16 +08:00
hkfires	8f1dd69e72	feat(amp): require API key authentication for management routes All Amp management endpoints (e.g., /api/user, /threads) are now protected by the standard API key authentication middleware. This ensures that all management operations require a valid API key, significantly improving security. As a result of this change: - The `restrict-management-to-localhost` setting now defaults to `false`. API key authentication provides a stronger and more flexible security control than IP-based restrictions, improving usability in containerized environments. - The reverse proxy logic now strips the client's `Authorization` header after authenticating the initial request. It then injects the configured `upstream-api-key` for the request to the upstream Amp service. BREAKING CHANGE: Amp management endpoints now require a valid API key for authentication. Requests without a valid API key in the `Authorization` header will be rejected with a 401 Unauthorized error.	2025-12-15 13:24:53 +08:00
hkfires	09c339953d	fix(openai): forward reasoning.effort value Drop the hardcoded effort mapping in request conversion so unknown values are preserved instead of being coerced to `auto	2025-12-15 09:16:15 +08:00
hkfires	367a05bdf6	refactor(thinking): export thinking helpers Expose thinking/effort normalization helpers from the executor package so conversion tests use production code and stay aligned with runtime validation behavior.	2025-12-15 09:16:15 +08:00
hkfires	d20b71deb9	fix(thinking): normalize effort mapping Route OpenAI reasoning effort through ThinkingEffortToBudget for Claude translators, preserve "minimal" when translating OpenAI Responses, and treat blank/unknown efforts as no-ops for Gemini thinking configs. Also map budget -1 to "auto" and expand cross-protocol thinking tests.	2025-12-15 09:16:15 +08:00
hkfires	712ce9f781	fix(thinking): drop unsupported none effort When budget 0 maps to "none" for models that use thinking levels but don't support that effort level, strip thinking fields instead of setting an invalid reasoning_effort value. Tests now expect removal for this edge case.	2025-12-15 09:16:14 +08:00
hkfires	716aa71f6e	fix(thinking): centralize reasoning_effort mapping Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into shared helpers used by Gemini, Gemini CLI, and antigravity translators. Normalize Claude thinking handling by preferring positive budgets, applying budget token normalization, and gating by model support. Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to support allowCompat models, and update tests for normalization behavior.	2025-12-15 09:16:14 +08:00

1 2 3 4 5 ...

761 Commits