CLIProxyAPI

fix(antigravity): preserve finish_reason tool_calls across streaming chunks

When streaming responses with tool calls, the finish_reason was being
overwritten. The upstream sends functionCall in chunk 1, then
finishReason: STOP in chunk 2. The old code would set finish_reason
from every chunk, causing "tool_calls" to be overwritten by "stop".

This broke clients like Claude Code that rely on finish_reason to
detect when tool calls are complete.

Changes:
- Add SawToolCall bool to track tool calls across entire stream
- Add UpstreamFinishReason to cache the finish reason
- Only emit finish_reason on final chunk (has both finishReason + usage)
- Priority: tool_calls > max_tokens > stop

Includes 5 unit tests covering:
- Tool calls not overwritten by subsequent STOP
- Normal text gets "stop"
- MAX_TOKENS without tool calls gets "max_tokens"
- Tool calls take priority over MAX_TOKENS
- Intermediate chunks have no finish_reason

Fixes streaming tool call detection for Claude Code + Gemini models.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

MohammadErfan Jabbari · 2026-01-05 18:45:25 +01:00

fe6043aec7

Merge pull request #623 from router-for-me/remote-OAuth

Remote OAuth

Luis Pater · 2025-12-19 18:29:09 +08:00

v6.6.30 99478d13a8

Merge pull request #618 from router-for-me/amp

fix(amp): add management auth skipper

Luis Pater · 2025-12-19 17:37:51 +08:00

69d3a80fc3

Merge pull request #619 from router-for-me/gemini

fix(util): disable default thinking for gemini 3 flash

Luis Pater · 2025-12-19 17:36:52 +08:00

9e268ad103

fix(amp): add management auth skipper

hkfires · 2025-12-19 13:57:47 +08:00

9d9b9e7a0d

fix(util): disable default thinking for gemini 3 flash

hkfires · 2025-12-19 13:11:15 +08:00

13aa82f3f3

feat(codex): update gpt-5.2 codex prompt instructions

The prompt for the gpt-5.2 codex model has been updated with more comprehensive instructions. This includes detailed guidelines on general usage, editing constraints, the plan tool, sandboxing configurations, handling special user requests, frontend task considerations, and final message presentation. The updates aim to improve the model's understanding and execution of complex coding tasks by providing clearer directives and constraints.

Luis Pater · 2025-12-19 12:38:28 +08:00

v6.6.29 05e55d7dc5

fix: restore get-auth-status ok fallback and document it

Supra4E8C · 2025-12-19 12:15:22 +08:00

1b358c931c

feat(codex): add gpt-5.2 codex prompt handling

This change introduces specific logic to load and use instructions for the 'gpt-5.2-codex' model variant by recognizing the 'gpt-5.2-codex_prompt.md' filename. This ensures the correct prompts are used when the '5.2-codex' model is identified, complementing the recent addition of its definition.

Luis Pater · 2025-12-19 11:39:51 +08:00

v6.6.28 ca09db21ff

Merge pull request #609 from router-for-me/codex

feat(registry): add gpt 5.2 codex model definition

Chén Mù · 2025-12-19 09:54:34 +08:00

v6.6.27 718ff7a73f

feat(registry): add gpt 5.2 codex model definition

hkfires · 2025-12-19 09:53:03 +08:00

fa70b220e9

Merge pull request #586 from router-for-me/chore

chore: ignore gemini metadata files

Luis Pater · 2025-12-19 01:00:30 +08:00

774f1fbc17

feat(oauth): add remote OAuth callback support with session management

Introduce a centralized OAuth session store with TTL-based expiration
  to replace the previous simple map-based status tracking. Add a new
  /api/oauth/callback endpoint that allows remote clients to relay OAuth
  callback data back to the CLI proxy, enabling OAuth flows when the
  callback cannot reach the local machine directly.

  - Add oauth_sessions.go with thread-safe session store and validation
  - Add oauth_callback.go with POST handler for remote callback relay
  - Refactor auth_files.go to use new session management APIs
  - Register new callback route in server.go

Supra4E8C · 2025-12-19 00:38:29 +08:00

cfa8ddb59f

chore: ignore gemini metadata files

hkfires · 2025-12-18 13:18:15 +08:00

393e38f2c0

chore(docs): remove legacy documentation and unused PR workflow file

Luis Pater · 2025-12-18 08:21:58 +08:00

v6.6.26 d1220de02d

Merge pull request #582 from ben-vargas/fix-gemini-3-thinking-level

feat: use thinkingLevel for Gemini 3 models per Google documentation

Luis Pater · 2025-12-18 07:19:37 +08:00

13eb5268de

fix: require dot in gemini25Pattern regex for precise matching

Ben Vargas · 2025-12-17 16:09:50 -07:00

88798816f2

fix: apply thinkingLevel from model suffix metadata for Gemini 3

The previous commit added thinkingLevel support but didn't apply it
when the reasoning effort came from model name suffix (e.g., model(minimal)).

This was because ResolveThinkingConfigFromMetadata returns nil for
level-based models, bypassing the metadata application.

Changes:
- Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API
- Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format
- Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata
- Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata
- Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata
- Add comprehensive test coverage for Gemini 3 thinkingLevel functions

Ben Vargas · 2025-12-17 16:08:38 -07:00

598f0af19b

feat: use thinkingLevel for Gemini 3 models per Google documentation

Per Google's official documentation, Gemini 3 models should use
thinkingLevel (string) instead of thinkingBudget (number) for
optimal performance.

From Google's Gemini Thinking docs:
> Use the thinkingLevel parameter with Gemini 3 models. While
> thinkingBudget is accepted for backwards compatibility, using
> it with Gemini 3 Pro may result in suboptimal performance.

Changes:
- Add model family detection functions (IsGemini3Model, IsGemini25Model,
  IsGemini3ProModel, IsGemini3FlashModel)
- Add ApplyGeminiThinkingLevel and ApplyGeminiCLIThinkingLevel functions
  for applying thinkingLevel config
- Add ValidateGemini3ThinkingLevel for model-specific level validation
- Add ThinkingBudgetToGemini3Level for backward compatibility conversion
- Update NormalizeGeminiThinkingBudget to convert budget to level for
  Gemini 3 models
- Update ApplyDefaultThinkingIfNeeded to not set a default level for
  Gemini 3 (lets API use its dynamic default "high")
- Update ConvertThinkingLevelToBudget to preserve thinkingLevel for
  Gemini 3 models
- Add Levels field to all Gemini 3 model definitions:
  - Gemini 3 Pro: ["low", "high"]
  - Gemini 3 Flash: ["minimal", "low", "medium", "high"]

Backward compatibility:
- Gemini 2.5 models continue to use thinkingBudget as before
- If thinkingBudget is provided for Gemini 3, it's converted to the
  appropriate thinkingLevel
- Existing configurations continue to work

Ben Vargas · 2025-12-17 15:28:20 -07:00

a33f5d31fc

ci(workflows): update pr-test-build workflow

Luis Pater · 2025-12-18 03:28:23 +08:00

506699fba1

feat(antigravity): enable token counting via API with resilient routing

Introduces the capability to count tokens for Antigravity-backed requests. This implementation leverages the `countTokens` endpoint of the Antigravity API, replacing the prior unsupported stub.

Key aspects of this update include:

- **API Integration**: Direct integration with the Antigravity `countTokens` API, including necessary request payload translation and authentication.
- **Resilient Infrastructure**: A fallback mechanism has been established, allowing the system to attempt connections across multiple Antigravity base URLs to ensure request success even in the event of temporary service interruptions.
- **Model Aliasing**: Added mappings for `gemini-3-flash` and `gemini-3-flash-preview` to ensure compatibility with the latest model variants.
- **Robust Error Handling**: Comprehensive error handling and logging are in place to manage failures during API interactions.

Luis Pater · 2025-12-18 03:12:46 +08:00

v6.6.25 68a27772b3

docs: add redirect info and disable Pull app auto-sync

Ben Vargas · 2025-12-17 12:06:39 -07:00

de87fb622b

feat(antigravity): add Gemini 3 Flash Preview model definition with enhanced capabilities

Luis Pater · 2025-12-18 01:02:19 +08:00

v6.6.24 f27672f6cf

Merge pull request #580 from router-for-me/chore

chore: ignore agent and bmad artifacts

Luis Pater · 2025-12-18 00:46:25 +08:00

28420c14e4

refactor(antigravity): optimize response handling in Claude model with JSON manipulation

Luis Pater · 2025-12-17 23:57:41 +08:00

v6.6.23 0bd221ff41

feat(antigravity): implement non-streaming execution for Claude model requests

Luis Pater · 2025-12-17 23:17:11 +08:00

5fda6f8ef3

chore: ignore agent and bmad artifacts

hkfires · 2025-12-17 23:15:15 +08:00

9b956f6338

feat(antigravity): add streaming support for Claude model requests

Luis Pater · 2025-12-17 22:16:57 +08:00

09923f654c

Merge pull request #577 from router-for-me/refactor-watcher-phase3

Refactor-watcher-phase3

Luis Pater · 2025-12-17 17:53:04 +08:00

ae7b972649

test(gemini): add test cases and improve compatibility for complex schema cases in CleanJSONSchemaForGemini function

Luis Pater · 2025-12-17 17:38:53 +08:00

47885e3710

Merge pull request #575 from soilSpoon/feature/antigravity-gemini-compat

feature: Improves Antigravity(gemini-claude) JSON schema compatibility

Luis Pater · 2025-12-17 16:53:06 +08:00

4b9a260b37

Merge pull request #572 from router-for-me/watcher

refactor(watcher): extract auth synthesizer to synthesizer package

Luis Pater · 2025-12-17 16:39:59 +08:00

v6.6.22 2c743c8f0b

refactor(translator): replace client.Content structs with JSON-based content generation for more efficient handling of Claude requests

Luis Pater · 2025-12-17 16:39:32 +08:00

9f2c278ee6

feature: Improves schema flattening and tool use handling

Updates schema flattening logic to handle multiple non-null types, providing a more descriptive "Accepts" hint.

Removes redundant tracking of the current tool name in `Params` as it's no longer needed for streaming limits, simplifying the structure.

이대희 · 2025-12-17 17:30:23 +09:00

aea337cfe2

test(watcher): add comprehensive unit tests for watcher edge cases

Add extensive test coverage for watcher module including:
- Auth file handling for empty and missing files
- Persist async error paths and nil receiver handling
- Dispatch loop context cancellation scenarios
- Event processing for errors and channel closures
- Handle event cases: unrelated files, config changes, auth writes,
  remove debouncing, atomic replace detection
- Normalize auth path and debounce cleanup logic
- Runtime auth dispatch and refresh state
- Config reload with mirrored auth dir and OAuth provider filtering
- Start failure when auth dir is missing
- Auth equality comparison ignoring temporal fields
- Reload clients filtering without full rescan

hkfires · 2025-12-17 16:29:11 +08:00

811f8f8b4f

Update internal/util/translator.go

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

이대희 · 2025-12-17 17:15:11 +09:00

27734a23b1

feature: Improves Gemini JSON schema compatibility

Enhances compatibility with the Gemini API by implementing a schema cleaning process.

This includes:
- Centralizing schema cleaning logic for Gemini in a dedicated utility function.
- Converting unsupported schema keywords to hints within the description field.
- Flattening complex schema structures like `anyOf`, `oneOf`, and type arrays to simplify the schema.
- Handling streaming responses with empty tool names, which can occur in subsequent chunks after the initial tool use.

이대희 · 2025-12-17 17:10:53 +09:00

1b8e538a77

refactor(watcher): split watcher.go into focused modules

- Create dispatcher.go for auth update queue management
- Create events.go for fsnotify event handling
- Create config_reload.go for hot-reload logic
- Create clients.go for client lifecycle management
- Simplify watcher.go to core coordinator (~150 lines)
- Maintain 100% API backward compatibility
- All tests passing with 72%+ coverage

hkfires · 2025-12-17 15:53:28 +08:00

41c2385aca

refactor(watcher): extract auth synthesis logic into separate synthesizer package

hkfires · 2025-12-17 15:00:43 +08:00

d605985f45

fix(config): use correct formatting function for prefix change details

hkfires · 2025-12-17 15:00:43 +08:00

d52b28b147

Merge pull request #571 from router-for-me/revert-570-fix/antigravity-thinking-signature

Revert "Fix invalid thinking signature when proxying Claude via Antigravity"

Luis Pater · 2025-12-17 14:56:29 +08:00

4afe1f42ca

Revert "Fix invalid thinking signature when proxying Claude via Antigravity"

Luis Pater · 2025-12-17 14:53:52 +08:00

7481c0eaa0

Fixed: #551

fix(translator): standardize content node handling across translators for assistant and tool calls

Luis Pater · 2025-12-17 13:16:07 +08:00

v6.6.21 ffdfad8482

fix(translator): correct funcName extraction and ensure proper handling of function response data in Antigravity Claude requests

Luis Pater · 2025-12-17 03:57:35 +08:00

v6.6.20 6586f08584

Merge pull request #570 from fuguiKz/fix/antigravity-thinking-signature

Fix invalid thinking signature when proxying Claude via Antigravity

Luis Pater · 2025-12-17 03:04:41 +08:00

f49e887fe6

Merge pull request #569 from router-for-me/watcher

Watcher Module Progressive Refactoring - Phase 1

Luis Pater · 2025-12-17 02:43:34 +08:00

v6.6.19 a5b3ff11fd

test(config): add unit tests for model prefix changes in config diff

Luis Pater · 2025-12-17 02:31:16 +08:00

084558f200

Fix antigravity Claude thinking signature handling

kz · 2025-12-17 02:28:58 +08:00

b602eae215

feat(diff): add support for model prefix changes in config diff logic

Enhance the configuration diff logic to include detection and reporting of `prefix` changes for all model types. Update related struct naming for consistency across the watcher module.

Luis Pater · 2025-12-17 02:05:03 +08:00

d02bf9c243

Merge branch 'dev' into watcher

Luis Pater · 2025-12-17 01:48:11 +08:00

26a5f67df2

1043 Commits