CLIProxyAPI

fix(gemini): add optional skip for gemini3 thinking conversion

hkfires · 2025-12-19 22:07:43 +08:00

2039062845

Merge pull request #623 from router-for-me/remote-OAuth

Remote OAuth

Luis Pater · 2025-12-19 18:29:09 +08:00

99478d13a8

Merge pull request #618 from router-for-me/amp

fix(amp): add management auth skipper

Luis Pater · 2025-12-19 17:37:51 +08:00

69d3a80fc3

fix(amp): add management auth skipper

hkfires · 2025-12-19 13:57:47 +08:00

9d9b9e7a0d

fix(util): disable default thinking for gemini 3 flash

hkfires · 2025-12-19 13:11:15 +08:00

13aa82f3f3

feat(codex): update gpt-5.2 codex prompt instructions

The prompt for the gpt-5.2 codex model has been updated with more comprehensive instructions. This includes detailed guidelines on general usage, editing constraints, the plan tool, sandboxing configurations, handling special user requests, frontend task considerations, and final message presentation. The updates aim to improve the model's understanding and execution of complex coding tasks by providing clearer directives and constraints.

Luis Pater · 2025-12-19 12:38:28 +08:00

05e55d7dc5

fix: restore get-auth-status ok fallback and document it

Supra4E8C · 2025-12-19 12:15:22 +08:00

1b358c931c

feat(codex): add gpt-5.2 codex prompt handling

This change introduces specific logic to load and use instructions for the 'gpt-5.2-codex' model variant by recognizing the 'gpt-5.2-codex_prompt.md' filename. This ensures the correct prompts are used when the '5.2-codex' model is identified, complementing the recent addition of its definition.

Luis Pater · 2025-12-19 11:39:51 +08:00

ca09db21ff

feat(registry): add gpt 5.2 codex model definition

hkfires · 2025-12-19 09:53:03 +08:00

fa70b220e9

feat(oauth): add remote OAuth callback support with session management

Introduce a centralized OAuth session store with TTL-based expiration
  to replace the previous simple map-based status tracking. Add a new
  /api/oauth/callback endpoint that allows remote clients to relay OAuth
  callback data back to the CLI proxy, enabling OAuth flows when the
  callback cannot reach the local machine directly.

  - Add oauth_sessions.go with thread-safe session store and validation
  - Add oauth_callback.go with POST handler for remote callback relay
  - Refactor auth_files.go to use new session management APIs
  - Register new callback route in server.go

Supra4E8C · 2025-12-19 00:38:29 +08:00

cfa8ddb59f

Merge pull request #582 from ben-vargas/fix-gemini-3-thinking-level

feat: use thinkingLevel for Gemini 3 models per Google documentation

Luis Pater · 2025-12-18 07:19:37 +08:00

13eb5268de

fix: require dot in gemini25Pattern regex for precise matching

Ben Vargas · 2025-12-17 16:09:50 -07:00

88798816f2

fix: apply thinkingLevel from model suffix metadata for Gemini 3

The previous commit added thinkingLevel support but didn't apply it
when the reasoning effort came from model name suffix (e.g., model(minimal)).

This was because ResolveThinkingConfigFromMetadata returns nil for
level-based models, bypassing the metadata application.

Changes:
- Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API
- Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format
- Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata
- Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata
- Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata
- Add comprehensive test coverage for Gemini 3 thinkingLevel functions

Ben Vargas · 2025-12-17 16:08:38 -07:00

598f0af19b

feat: use thinkingLevel for Gemini 3 models per Google documentation

Per Google's official documentation, Gemini 3 models should use
thinkingLevel (string) instead of thinkingBudget (number) for
optimal performance.

From Google's Gemini Thinking docs:
> Use the thinkingLevel parameter with Gemini 3 models. While
> thinkingBudget is accepted for backwards compatibility, using
> it with Gemini 3 Pro may result in suboptimal performance.

Changes:
- Add model family detection functions (IsGemini3Model, IsGemini25Model,
  IsGemini3ProModel, IsGemini3FlashModel)
- Add ApplyGeminiThinkingLevel and ApplyGeminiCLIThinkingLevel functions
  for applying thinkingLevel config
- Add ValidateGemini3ThinkingLevel for model-specific level validation
- Add ThinkingBudgetToGemini3Level for backward compatibility conversion
- Update NormalizeGeminiThinkingBudget to convert budget to level for
  Gemini 3 models
- Update ApplyDefaultThinkingIfNeeded to not set a default level for
  Gemini 3 (lets API use its dynamic default "high")
- Update ConvertThinkingLevelToBudget to preserve thinkingLevel for
  Gemini 3 models
- Add Levels field to all Gemini 3 model definitions:
  - Gemini 3 Pro: ["low", "high"]
  - Gemini 3 Flash: ["minimal", "low", "medium", "high"]

Backward compatibility:
- Gemini 2.5 models continue to use thinkingBudget as before
- If thinkingBudget is provided for Gemini 3, it's converted to the
  appropriate thinkingLevel
- Existing configurations continue to work

Ben Vargas · 2025-12-17 15:28:20 -07:00

a33f5d31fc

feat(antigravity): enable token counting via API with resilient routing

Introduces the capability to count tokens for Antigravity-backed requests. This implementation leverages the `countTokens` endpoint of the Antigravity API, replacing the prior unsupported stub.

Key aspects of this update include:

- **API Integration**: Direct integration with the Antigravity `countTokens` API, including necessary request payload translation and authentication.
- **Resilient Infrastructure**: A fallback mechanism has been established, allowing the system to attempt connections across multiple Antigravity base URLs to ensure request success even in the event of temporary service interruptions.
- **Model Aliasing**: Added mappings for `gemini-3-flash` and `gemini-3-flash-preview` to ensure compatibility with the latest model variants.
- **Robust Error Handling**: Comprehensive error handling and logging are in place to manage failures during API interactions.

Luis Pater · 2025-12-18 03:12:46 +08:00

68a27772b3

feat(antigravity): add Gemini 3 Flash Preview model definition with enhanced capabilities

Luis Pater · 2025-12-18 01:02:19 +08:00

f27672f6cf

refactor(antigravity): optimize response handling in Claude model with JSON manipulation

Luis Pater · 2025-12-17 23:57:41 +08:00

0bd221ff41

feat(antigravity): implement non-streaming execution for Claude model requests

Luis Pater · 2025-12-17 23:17:11 +08:00

5fda6f8ef3

feat(antigravity): add streaming support for Claude model requests

Luis Pater · 2025-12-17 22:16:57 +08:00

09923f654c

Merge pull request #577 from router-for-me/refactor-watcher-phase3

Refactor-watcher-phase3

Luis Pater · 2025-12-17 17:53:04 +08:00

ae7b972649

test(gemini): add test cases and improve compatibility for complex schema cases in CleanJSONSchemaForGemini function

Luis Pater · 2025-12-17 17:38:53 +08:00

47885e3710

Merge pull request #575 from soilSpoon/feature/antigravity-gemini-compat

feature: Improves Antigravity(gemini-claude) JSON schema compatibility

Luis Pater · 2025-12-17 16:53:06 +08:00

4b9a260b37

Merge pull request #572 from router-for-me/watcher

refactor(watcher): extract auth synthesizer to synthesizer package

Luis Pater · 2025-12-17 16:39:59 +08:00

2c743c8f0b

refactor(translator): replace client.Content structs with JSON-based content generation for more efficient handling of Claude requests

Luis Pater · 2025-12-17 16:39:32 +08:00

9f2c278ee6

feature: Improves schema flattening and tool use handling

Updates schema flattening logic to handle multiple non-null types, providing a more descriptive "Accepts" hint.

Removes redundant tracking of the current tool name in `Params` as it's no longer needed for streaming limits, simplifying the structure.

이대희 · 2025-12-17 17:30:23 +09:00

aea337cfe2

test(watcher): add comprehensive unit tests for watcher edge cases

Add extensive test coverage for watcher module including:
- Auth file handling for empty and missing files
- Persist async error paths and nil receiver handling
- Dispatch loop context cancellation scenarios
- Event processing for errors and channel closures
- Handle event cases: unrelated files, config changes, auth writes,
  remove debouncing, atomic replace detection
- Normalize auth path and debounce cleanup logic
- Runtime auth dispatch and refresh state
- Config reload with mirrored auth dir and OAuth provider filtering
- Start failure when auth dir is missing
- Auth equality comparison ignoring temporal fields
- Reload clients filtering without full rescan

hkfires · 2025-12-17 16:29:11 +08:00

811f8f8b4f

Update internal/util/translator.go

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

이대희 · 2025-12-17 17:15:11 +09:00

27734a23b1

feature: Improves Gemini JSON schema compatibility

Enhances compatibility with the Gemini API by implementing a schema cleaning process.

This includes:
- Centralizing schema cleaning logic for Gemini in a dedicated utility function.
- Converting unsupported schema keywords to hints within the description field.
- Flattening complex schema structures like `anyOf`, `oneOf`, and type arrays to simplify the schema.
- Handling streaming responses with empty tool names, which can occur in subsequent chunks after the initial tool use.

이대희 · 2025-12-17 17:10:53 +09:00

1b8e538a77

refactor(watcher): split watcher.go into focused modules

- Create dispatcher.go for auth update queue management
- Create events.go for fsnotify event handling
- Create config_reload.go for hot-reload logic
- Create clients.go for client lifecycle management
- Simplify watcher.go to core coordinator (~150 lines)
- Maintain 100% API backward compatibility
- All tests passing with 72%+ coverage

hkfires · 2025-12-17 15:53:28 +08:00

41c2385aca

refactor(watcher): extract auth synthesis logic into separate synthesizer package

hkfires · 2025-12-17 15:00:43 +08:00

d605985f45

fix(config): use correct formatting function for prefix change details

hkfires · 2025-12-17 15:00:43 +08:00

d52b28b147

Revert "Fix invalid thinking signature when proxying Claude via Antigravity"

Luis Pater · 2025-12-17 14:53:52 +08:00

7481c0eaa0

Fixed: #551

fix(translator): standardize content node handling across translators for assistant and tool calls

Luis Pater · 2025-12-17 13:16:07 +08:00

ffdfad8482

fix(translator): correct funcName extraction and ensure proper handling of function response data in Antigravity Claude requests

Luis Pater · 2025-12-17 03:57:35 +08:00

6586f08584

Merge pull request #570 from fuguiKz/fix/antigravity-thinking-signature

Fix invalid thinking signature when proxying Claude via Antigravity

Luis Pater · 2025-12-17 03:04:41 +08:00

f49e887fe6

test(config): add unit tests for model prefix changes in config diff

Luis Pater · 2025-12-17 02:31:16 +08:00

084558f200

Fix antigravity Claude thinking signature handling

kz · 2025-12-17 02:28:58 +08:00

b602eae215

feat(diff): add support for model prefix changes in config diff logic

Enhance the configuration diff logic to include detection and reporting of `prefix` changes for all model types. Update related struct naming for consistency across the watcher module.

Luis Pater · 2025-12-17 02:05:03 +08:00

d02bf9c243

Merge branch 'dev' into watcher

Luis Pater · 2025-12-17 01:48:11 +08:00

26a5f67df2

Merge pull request #564 from router-for-me/think

feat(thinking): unify budget/effort conversion logic and add iFlow thinking support

Luis Pater · 2025-12-17 01:21:24 +08:00

600fd42a83

fix(api): update route patterns to support wildcards for Gemini actions

Normalize action handling by accommodating wildcard patterns in route definitions for Gemini endpoints. Adjust `request.Action` parsing logic to correctly process routes with prefixed actions.

Luis Pater · 2025-12-17 01:17:02 +08:00

670685139a

feat(config): add support for model prefixes and prefix normalization

Refactor model management to include an optional `prefix` field for model credentials, enabling better namespace handling. Update affected configuration files, APIs, and handlers to support prefix normalization and routing. Remove unused OpenAI compatibility provider logic to simplify processing.

Luis Pater · 2025-12-17 01:07:26 +08:00

52b6306388

fix(watcher): simplify vertex apikey idKind to exclude base suffix

hkfires · 2025-12-16 22:55:38 +08:00

521ec6f1b8

refactor(diff): improve security and stability of config change detection

Introduce formatProxyURL helper to sanitize proxy addresses before
logging, stripping credentials and path components while preserving
host information. Rework model hash computation to sort and deduplicate
name/alias pairs with case normalization, ensuring consistent output
regardless of input ordering. Add signature-based identification for
anonymous OpenAI-compatible provider entries to maintain stable keys
across configuration reloads. Replace direct stdout prints with
structured logger calls for file change notifications.

hkfires · 2025-12-16 22:39:19 +08:00

b0c5d9640a

refactor(watcher): extract config diff helpers

Break out config diffing, hashing, and OpenAI compatibility utilities into a dedicated diff package, update watcher to consume them, and add comprehensive tests for diff logic and watcher behavior.

hkfires · 2025-12-16 21:45:33 +08:00

ef8e94e992

fix(thinking): align budget effort mapping across translators

Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.

hkfires · 2025-12-16 18:34:43 +08:00

28a428ae2f

feat(iflow): add thinking support for iFlow models

hkfires · 2025-12-16 18:34:43 +08:00

b326ec3641

fix(translator): emit message_start on first chunk regardless of role field

Some OpenAI-compatible providers (like GitHub Copilot) may send tool_calls
in the first streaming chunk without including the role field. The previous
implementation only emitted message_start when the first chunk contained
role="assistant", causing Anthropic protocol violations when tool calls
arrived first.

This fix ensures message_start is always emitted on the very first chunk,
preventing 'content_block_start before message_start' errors in clients
that strictly validate Anthropic SSE event ordering.

Thong Van · 2025-12-16 13:01:09 +07:00

f4007f53ba

feat(remote-management): add support for custom GitHub repository for panel updates

Introduce `panel-github-repository` in the configuration to allow specifying a custom repository for management panel assets. Update dependency versions and enhance asset URL resolution logic to support overrides.

Luis Pater · 2025-12-16 13:09:26 +08:00

5a812a1e93

Merge pull request #549 from router-for-me/log

Improve Request Logging Efficiency and Standardize Error Responses

Luis Pater · 2025-12-15 20:43:12 +08:00

88b101ebf5

770 Commits