CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
Evan Nguyen	24e8e20b59	Merge branch 'main' into fix/antigravity-prompt-caching	2025-12-21 19:43:24 +07:00
Evan Nguyen	a87f09bad2	feat(antigravity): add session ID generation and mutex for random source	2025-12-21 17:50:41 +07:00
Luis Pater	dbcbe48ead	Merge pull request #641 from router-for-me/url-OAuth-add-ter OAuth and management	2025-12-21 17:25:24 +08:00
Luis Pater	63908869f6	Merge pull request #611 from soilSpoon/feature/antigravity feat(antigravity): Improve Claude model compatibility	2025-12-21 16:27:29 +08:00
Luis Pater	f6d625114c	feat(logging): revamp request logger to support streaming and temporary file spooling This update enhances the `FileRequestLogger` by introducing support for spooling large request and response bodies to temporary files, reducing memory consumption. It adds atomic requestLogID generation for sequential log naming and new methods for non-streaming/streaming log assembly. Also includes better error handling during logging and temp file cleanups.	2025-12-21 16:17:48 +08:00
이대희	7dc40ba6d4	Improve tool-call parsing, schema sanitization, and hint injection Improve parsing of tool call inputs and Antigravity compatibility to avoid invalid thinking/tool_use errors. - Parse tool call inputs robustly by accepting both object and JSON-string formats and only produce a functionCall part when valid args exist, reducing spurious or malformed parts. - Preserve the skip_thought_signature_validator approach for calls without a valid thinking signature but stop toggling/tracking a separate "disable thinking" flag; this prevents unnecessary removal of thinkingConfig. - Sanitize tool input schemas before attaching them to the Antigravity request to improve compatibility. - Append the interleaved-thinking hint as a new parts entry instead of overwriting/setting text directly, preserving structure. - Remove unused tracking logic and related comments to simplify flow. These changes reduce errors related to missing/invalid thinking signatures, improve schema compatibility, and make hint injection safer and more consistent.	2025-12-21 17:16:40 +09:00
이대희	4070c9de81	Remove interleaved-thinking header from requests Removes the addition of the "anthropic-beta: interleaved-thinking-2025-05-14" header for Claude thinking models when building HTTP requests. This prevents sending an experimental/feature flag header that is no longer required and avoids potential compatibility or routing issues with downstream services. Keeps request headers simpler and more standard.	2025-12-21 15:29:36 +09:00
이대희	1e9e4a86a2	Improve thinking/tool signature handling for Claude and Gemini requests Prefer cached signatures and avoid injecting dummy thinking blocks; instead remove unsigned thinking blocks and add a skip sentinel for tool calls without a valid signature. Generate stable session IDs from the first user message, apply schema cleaning only for Claude models, and reorder thinking parts so thinking appears first. For Gemini, remove thinking blocks and attach a skip sentinel to function calls. Simplify response handling by passing raw function args through (remove special Bash conversion). Update and add tests to reflect the new behavior. These changes prevent rejected dummy signatures, improve compatibility with Antigravity’s signature validation, provide more stable session IDs for conversation grouping, and make request/response translation more robust.	2025-12-21 15:15:50 +09:00
hkfires	3fc410a253	fix(amp): add /settings routes to proxy	2025-12-21 12:51:35 +08:00
Supra4E8C	781bc1521b	fix(oauth): prevent stale session timeouts after login - stop callback forwarders by instance to avoid cross-session shutdowns - clear pending sessions for a provider after successful auth	2025-12-21 10:48:40 +08:00
Supra4E8C	05d201ece8	fix(gemini): gate callback prompt on project_id	2025-12-21 07:21:12 +08:00
Luis Pater	453e744abf	Fixed: #642 fix: remove unsupported fields `type` and `cache_control` across translators	2025-12-21 03:38:38 +08:00
Luis Pater	653439698e	Fixed: #606 fix: unify response field naming across translators Standardize `text` to `delta` and add missing `output` field in all response payloads for consistency across OpenAI, Claude, and Gemini translators.	2025-12-21 03:13:58 +08:00
Supra4E8C	24970baa57	management: allow prefix updates in provider PATCH handlers	2025-12-21 02:14:28 +08:00
Luis Pater	89254cfc97	Merge pull request #638 from neokotora/feat/add-gemini-3-flash fix: add gemini-3-flash-preview model definition in GetGeminiModels	2025-12-20 23:36:57 +08:00
Luis Pater	6bd9a034f7	Merge pull request #602 from ben-vargas/fix-antigravity-propertynames fix: remove propertyNames from JSON schema for Gemini compatibility	2025-12-20 23:32:51 +08:00
Luis Pater	26fc65b051	Merge pull request #633 from ben-vargas/fix-antigravity-applypayloadconfig feat(antigravity): add payload config support to Antigravity executor	2025-12-20 23:30:47 +08:00
Luis Pater	ed5ec5b55c	feat(amp): enhance model mapping and Gemini thinking configuration This commit introduces several improvements to the AMP (Advanced Model Proxy) module: - Model Mapping Logic: The `FallbackHandler` now uses a more robust approach for model mapping. It includes the extraction and preservation of dynamic "thinking suffixes" (e.g., `(xhigh)`) during mapping, ensuring that these configurations are correctly applied to the mapped model. A new `resolveMappedModel` function centralizes this logic for cleaner code. - ModelMapper Verification: The `ModelMapper` in `model_mapping.go` now verifies that the target model of a mapping has available providers after normalizing it. This prevents mappings to non-existent or unresolvable models. - Gemini Thinking Configuration Cleanup: In `gemini_thinking.go`, unnecessary `generationConfig.thinkingConfig.include_thoughts` and `generationConfig.thinkingConfig.thinkingBudget` fields are now deleted from the request body when applying Gemini thinking levels. This prevents potential conflicts or redundant configurations. - Testing: A new test case `TestModelMapper_MapModel_TargetWithThinkingSuffix` has been added to `model_mapping_test.go` to specifically cover the preservation of thinking suffixes during model mapping.	2025-12-20 22:19:35 +08:00
sheauhuu	df777650ac	feat: add gemini-3-flash-preview model definition in GetGeminiModels	2025-12-20 20:05:20 +08:00
Supra4E8C	9855615f1e	fix(gemini): avoid stale manual oauth prompt and accept schemeless callbacks	2025-12-20 19:03:38 +08:00
Supra4E8C	93414f1baa	feat (auth): CLI OAuth supports pasting callback URLs to complete login - Added callback URL resolution and terminal prompt logic - Codex/Claude/iFlow/Antigravity/Gemini login supports callback URL or local callback completion - Update Gemini login option signature and manager call - CLI default prompt function is compatible with null input to continue waiting	2025-12-20 18:25:55 +08:00
Luis Pater	10f8c795ac	Merge pull request #634 from router-for-me/amp fix(amp): add /docs routes to proxy	2025-12-20 17:08:07 +08:00
Luis Pater	3e4858a624	feat(config): add log file size limit configuration #535 This commit introduces a new configuration option `logs-max-total-size-mb` that allows users to set a maximum total size (in MB) for log files in the logs directory. When this limit is exceeded, the oldest log files will be automatically deleted to stay within the specified size. Setting this value to 0 (the default) disables this feature. This change enhances log management by preventing excessive disk space usage.	2025-12-20 15:52:59 +08:00
Ben Vargas	1231dc9cda	feat(antigravity): add payload config support to Antigravity executor Add applyPayloadConfig calls to all Antigravity executor paths (Execute, executeClaudeNonStream, ExecuteStream) to enable config.yaml payload overrides for Antigravity/Gemini-Claude models. This allows users to configure thinking budget and other parameters via payload.override in config.yaml for models like gemini-claude-opus-4-5*.	2025-12-19 22:30:44 -07:00
hkfires	c84ff42bcd	fix(amp): add /docs routes to proxy	2025-12-20 10:15:25 +08:00
Luis Pater	8a5db02165	Fixed: #607 refactor(config): re-export internal configuration types for SDK consumers	2025-12-20 04:49:02 +08:00
Luis Pater	d7afb6eb0c	fix(gemini): improve reasoning effort conversion for Gemini 3 models Refactors the reasoning effort conversion logic for Gemini models. The update specifically addresses how `reasoning_effort` is translated into Gemini 3 specific thinking configurations (`thinkingLevel`, `includeThoughts`) and ensures that numeric budgets are not incorrectly applied to level-based models. Changes include: - Differentiating conversion logic for Gemini 3 models versus other models. - Handling `none`, `auto`, and validated thinking levels for Gemini 3. - Maintaining existing conversion for models not using discrete thinking levels.	2025-12-20 03:11:28 +08:00
hkfires	2039062845	fix(gemini): add optional skip for gemini3 thinking conversion	2025-12-19 22:07:43 +08:00
Luis Pater	99478d13a8	Merge pull request #623 from router-for-me/remote-OAuth Remote OAuth	2025-12-19 18:29:09 +08:00
evann	bc6c4cdbfc	feat(antigravity): add logging for cached token setting errors in responses	2025-12-19 16:49:50 +07:00
Luis Pater	69d3a80fc3	Merge pull request #618 from router-for-me/amp fix(amp): add management auth skipper	2025-12-19 17:37:51 +08:00
evann	404546ce93	refactor(antigravity): regarding production endpoint caching	2025-12-19 16:36:54 +07:00
evann	6dd1cf1dd6	Merge branch 'main' into fix/antigravity-prompt-caching	2025-12-19 16:34:28 +07:00
evann	9058d406a3	feat(antigravity): enhance prompt caching support and update agent version	2025-12-19 16:33:41 +07:00
hkfires	9d9b9e7a0d	fix(amp): add management auth skipper	2025-12-19 13:57:47 +08:00
hkfires	13aa82f3f3	fix(util): disable default thinking for gemini 3 flash	2025-12-19 13:11:15 +08:00
Luis Pater	05e55d7dc5	feat(codex): update gpt-5.2 codex prompt instructions The prompt for the gpt-5.2 codex model has been updated with more comprehensive instructions. This includes detailed guidelines on general usage, editing constraints, the plan tool, sandboxing configurations, handling special user requests, frontend task considerations, and final message presentation. The updates aim to improve the model's understanding and execution of complex coding tasks by providing clearer directives and constraints.	2025-12-19 12:38:28 +08:00
Supra4E8C	1b358c931c	fix: restore get-auth-status ok fallback and document it	2025-12-19 12:15:22 +08:00
이대희	e04b02113a	refactor: Improve cache eviction ordering and clean up session ID usage Improve the cache eviction routine to sort entries by timestamp using the standard library sort routine (stable, clearer and faster than the prior manual selection/bubble logic), and remove a redundant request-derived session ID helper in favor of the centralized session ID function. Also drop now-unused crypto/encoding imports. This yields clearer, more maintainable eviction logic and removes duplicated/unused code and imports to reduce surface area and potential inconsistencies.	2025-12-19 13:14:51 +09:00
이대희	3275494fde	refactor: Use helper to extract wrapped "thinking" text Improve robustness when handling "thinking" content by using a dedicated helper to extract the thinking text. This ensures wrapped or nested thinking objects are handled correctly instead of relying on a direct string extraction, reducing parsing errors for complex payloads.	2025-12-19 13:09:57 +09:00
Luis Pater	ca09db21ff	feat(codex): add gpt-5.2 codex prompt handling This change introduces specific logic to load and use instructions for the 'gpt-5.2-codex' model variant by recognizing the 'gpt-5.2-codex_prompt.md' filename. This ensures the correct prompts are used when the '5.2-codex' model is identified, complementing the recent addition of its definition.	2025-12-19 11:39:51 +08:00
이대희	c1f8211acb	fix: Normalize Bash tool args and add signature caching support Normalize Bash tool arguments by converting a "command" key into "cmd" using JSON-aware parsing, avoiding brittle string replacements that could corrupt values. Apply this conversion in both streaming and non-streaming response paths so bash-style tool calls are emitted with the expected "cmd" field. Add support for accumulating thinking text and carrying session identifiers to enable signature caching/restore for unsigned thinking blocks, improving handling of thinking-state continuity across requests/responses. Also perform small cleanups: import logging, tidy comments and test descriptions. These changes make tool-argument handling more robust and enable reliable signature restoration for thinking blocks.	2025-12-19 11:12:16 +09:00
hkfires	fa70b220e9	feat(registry): add gpt 5.2 codex model definition	2025-12-19 09:53:03 +08:00
이대희	98fa2a1597	feat(translator/antigravity/claude): support interleaved thinking, signature restoration and system hint injection	2025-12-19 10:30:59 +09:00
이대희	0e7c79ba23	feat(translator/antigravity/claude): support interleaved thinking, signature restoration and system hint injection	2025-12-19 10:28:25 +09:00
이대희	b6ba15fcbd	fix(runtime/executor): Antigravity executor schema handling and Claude-specific headers	2025-12-19 10:28:23 +09:00
이대희	e44167d7a4	refactor(util/schema): rename and extend Gemini schema cleaning for Antigravity and add empty-schema placeholders	2025-12-19 10:28:17 +09:00
이대희	1bfa75f780	feat(util): add helper to detect Claude thinking models	2025-12-19 10:28:15 +09:00
이대희	bbcb5552f3	feat(cache): add signature cache for Claude thinking blocks	2025-12-19 10:28:12 +09:00
Ben Vargas	1b8cb7b77b	fix: remove propertyNames from JSON schema for Gemini compatibility Gemini API does not support the JSON Schema `propertyNames` keyword, causing 400 errors when Claude tool schemas containing this field are proxied through the Antigravity provider. Add `propertyNames` to the list of unsupported keywords removed by CleanJSONSchemaForGemini(), alongside existing removals like $ref, definitions, and additionalProperties.	2025-12-18 12:50:51 -07:00

1 2 3 4 5 ...

811 Commits