CLIProxyAPI

**fix(translator): rename responseSchema key for generationConfig**

- Renamed `generationConfig.responseSchema` to `generationConfig.responseJsonSchema` in Gemini request transformation to align with updated schema expectations.

Luis Pater · 2025-12-02 18:32:23 +08:00

v6.5.33 41ee44432d

**refactor(registry): remove Qwen3-Coder from model definitions**

Luis Pater · 2025-12-02 11:34:38 +08:00

v6.5.32 1434bc38e5

**refactor(cliproxy, config): remove vertex-compat flow, streamline Vertex API key handling**

- Removed `vertex-compat` executor and related configuration.
- Consolidated Vertex compatibility checks into `vertex` handling with `apikey`-based model resolution.
- Streamlined model generation logic for Vertex API key entries.

Luis Pater · 2025-12-02 09:18:24 +08:00

0fd2abbc3b

feat: Add support for VertexAI compatible service (#375 )

feat: consolidate Vertex AI compatibility with API key support in Gemini

Aero · 2025-12-02 08:14:22 +08:00

0ebb654019

Merge pull request #390 from NguyenSiTrung/main

feat(amp): add model mapping support for routing unavailable models to alternatives

Luis Pater · 2025-12-02 08:07:56 +08:00

08a1d2edf9

fix: enable hot reload for amp-model-mappings config

- Store ampModule in Server struct to access it during config updates
- Call ampModule.OnConfigUpdated() in UpdateClients() for hot reload
- Watch config directory instead of file to handle atomic saves (vim, VSCode, etc.)
- Improve config file event detection with basename matching
- Add diagnostic logging for config reload tracing

NguyenSiTrung · 2025-12-01 13:34:49 +07:00

3409f4e336

Merge branch 'router-for-me:main' into main

NguyenSiTrung · 2025-12-01 08:12:29 +07:00

9354b87e54

Merge pull request #386 from auroraflux/feat/dedupe-thinking-metadata-helpers

refactor(executor): dedupe thinking metadata helpers across Gemini executors

Luis Pater · 2025-12-01 09:00:27 +08:00

v6.5.31 54e24110ec

docs(readme): add CCS (Claude Code Switch) to projects list

Luis Pater · 2025-12-01 07:22:42 +08:00

717c703bff

refactor(executor): dedupe thinking metadata helpers across Gemini executors

Extract applyThinkingMetadata and applyThinkingMetadataCLI helpers to
payload_helpers.go and use them across all four Gemini-based executors:
- gemini_executor.go (Execute, ExecuteStream, CountTokens)
- gemini_cli_executor.go (Execute, ExecuteStream, CountTokens)
- aistudio_executor.go (translateRequest)
- antigravity_executor.go (Execute, ExecuteStream)

This eliminates code duplication introduced in the -reasoning suffix PR
and centralizes the thinking config application logic.

Net reduction: 28 lines of code.

auroraflux · 2025-11-30 15:20:15 -08:00

1c6f4be8ae

Merge pull request #379 from kaitranntt/docs/add-ccs-project

docs: add CCS (Claude Code Switch) to projects list

Luis Pater · 2025-12-01 07:20:04 +08:00

0de2560cee

fix: change AGY to Antigravity

Kai (Tam Nhu) Tran · 2025-11-30 12:43:12 -05:00

85eb926482

docs: add CCS to projects list

Kai (Tam Nhu) Tran · 2025-11-30 12:40:35 -05:00

c52ef08e67

Merge pull request #377 from router-for-me/gemini

feat(registry): add thinking support to gemini models

Luis Pater · 2025-11-30 21:27:54 +08:00

v6.5.30 cb580cd083

feat(registry): add thinking support to gemini models

hkfires · 2025-11-30 20:56:29 +08:00

75e278c7a5

Merge pull request #376 from auroraflux/feat/reasoning-suffix-support

feat(util): add -reasoning suffix support for Gemini models

Luis Pater · 2025-11-30 20:55:38 +08:00

73208c4e55

**feat(util): add -reasoning suffix support for Gemini models**

Adds support for the `-reasoning` model name suffix which enables
thinking/reasoning mode with dynamic budget. This allows clients to
request reasoning-enabled inference using model names like
`gemini-2.5-flash-reasoning` without explicit configuration.

The suffix is normalized to the base model (e.g., gemini-2.5-flash)
with thinkingBudget=-1 (dynamic) and include_thoughts=true.

Follows the existing pattern established by -nothinking and
-thinking-N suffixes.

auroraflux · 2025-11-30 01:18:57 -08:00

32d3809f8c

**fix(executor, auth): ensure index assignment consistency for auth objects**

- Updated `usage_helpers.go` to call `EnsureIndex()` for proper index assignment in reporter initialization.
- Adjusted `auth/manager.go` to assign auth indices inside a locked section when they are unassigned, ensuring thread safety and consistency.

Luis Pater · 2025-11-30 16:56:29 +08:00

v6.5.29 a748e93fd9

Merge pull request #371 from ben-vargas/test-amp-tools

fix(amp): add /threads.rss root-level route for AMP CLI

Luis Pater · 2025-11-30 15:18:23 +08:00

v6.5.28 54a9c4c3c7

Merge pull request #366 from router-for-me/blacklist

Add Model Blacklist

Luis Pater · 2025-11-30 15:17:46 +08:00

18b5c35dea

feat(api): add oauth excluded model management

hkfires · 2025-11-30 13:38:23 +08:00

7b7871ede2

docs(config): expand model exclusion examples

hkfires · 2025-11-30 11:55:47 +08:00

c4e3646b75

feat(cliproxy): support wildcard exclusions for models

hkfires · 2025-11-30 08:02:00 +08:00

022aa81be1

refactor(config): rename model blacklist fields to excluded models

hkfires · 2025-11-29 21:23:47 +08:00

c43f0ea7b1

fix(auth): fix runtime auth reload on oauth blacklist change

hkfires · 2025-11-29 20:30:11 +08:00

6a191358af

fix(amp): add /threads.rss root-level route for AMP CLI

AMP CLI requests /threads.rss at the root level, but the AMP module
only registered routes under /api/*. This caused a 404 error during
AMP CLI startup.

Add the missing root-level route with the same security middleware
(noCORS, optional localhost restriction) as other management routes.

Ben Vargas · 2025-11-29 05:01:19 -07:00

db1119dd78

docs: add model mapping documentation for Amp CLI integration

- Add model mapping feature to README.md Amp CLI section
- Add detailed Model Mapping Configuration section to amp-cli-integration.md
- Update architecture diagram to show model mapping flow
- Update Model Fallback Behavior to include mapping step
- Add Table of Contents entry for model mapping

Trung Nguyen · 2025-11-29 12:51:03 +07:00

33a5656235

feat(amp): add model mapping support for routing unavailable models to alternatives

- Add AmpModelMapping config to route models like 'claude-opus-4.5' to 'claude-sonnet-4'
- Add ModelMapper interface and DefaultModelMapper implementation with hot-reload support
- Enhance FallbackHandler to apply model mappings before falling back to ampcode.com
- Add structured logging for routing decisions (local provider, mapping, amp credits)
- Update config.example.yaml with amp-model-mappings documentation

Trung Nguyen · 2025-11-29 12:44:09 +07:00

2cd59806e2

feat(auth): add oauth provider model blacklist

hkfires · 2025-11-28 10:37:10 +08:00

5983e3ec87

feat(config): add per-key model blacklist for providers

hkfires · 2025-11-27 21:57:07 +08:00

f8cebb9343

**fix(translator): handle non-JSON output parsing for OpenAI function responses**

- Updated `antigravity_openai_request.go` to process non-JSON outputs gracefully by verifying and distinguishing between JSON and plain string formats.
- Ensured proper assignment of parsed or raw response to `functionResponse`.

Luis Pater · 2025-11-27 16:18:49 +08:00

v6.5.27 72c7ef7647

**feat(registry): add context length and update max tokens for Claude model configurations**

- Added `ContextLength` field with a value of 200,000 to all applicable Claude model definitions.
- Standardized `MaxCompletionTokens` values across models for consistency and alignment.

Luis Pater · 2025-11-27 16:13:25 +08:00

d2e4639b2a

Merge pull request #340 from nestharus/fix/339-thinking-openai-gemini-compat

fix(thinking): resolve OpenAI/Gemini compatibility for thinking model…

Luis Pater · 2025-11-27 16:03:24 +08:00

08321223c4

Fixed: #354

**fix(translator): add support for "xhigh" reasoning effort in OpenAI responses**

- Updated handling in `openai_openai-responses_request.go` to include the new "xhigh" reasoning effort level.

Luis Pater · 2025-11-27 15:59:15 +08:00

7e30157590

fix(claude): ensure max_tokens exceeds thinking budget for thinking models

Fixes an issue where Claude thinking models would return 400 errors when
the thinking.budget_tokens was greater than or equal to max_tokens.

Changes:
- Add MaxCompletionTokens: 128000 to all Claude thinking model definitions
- Add ensureMaxTokensForThinking() function in claude_executor.go that:
  - Checks if thinking is enabled with a budget_tokens value
  - Looks up the model's MaxCompletionTokens from the registry
  - Ensures max_tokens is set to at least the model's MaxCompletionTokens
  - Falls back to budget_tokens + 4000 buffer if registry lookup fails

This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens)
is always satisfied when using extended thinking features.

Fixes: #339

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

nestharus · 2025-11-26 22:31:05 -08:00

e73cdf5cff

**fix(translator): normalize function calls and outputs for consistent input processing**

- Implemented logic to pair consecutive function calls and their outputs, ensuring proper sequencing for processing.
- Adjusted `gemini_openai-responses_request.go` to normalize message structures and maintain expected flow.

Luis Pater · 2025-11-27 10:25:45 +08:00

v6.5.26 39621a0340

**fix(translator): handle non-JSON output gracefully in function call outputs**

- Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Str` instead of `.Raw` when parsing non-JSON string outputs.
- Added checks to distinguish between JSON and non-JSON `output` types for accurate `functionResponse` construction.

Luis Pater · 2025-11-27 09:40:00 +08:00

346b663079

**fix(translator): preserve raw JSON encoding in function call outputs**

- Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Raw` instead of `.String` for preserving original JSON encoding.
- Ensured proper setting of raw JSON output when constructing `functionResponse`.

Luis Pater · 2025-11-27 08:26:53 +08:00

v6.5.25 0bcae68c6c

**fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature**

- Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present.
- Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing.

Luis Pater · 2025-11-27 00:52:17 +08:00

v6.5.24 c8cee547fd

Merge pull request #343 from router-for-me/misc

style(amp): tidy whitespace in proxy module and tests

Luis Pater · 2025-11-26 19:03:07 +08:00

36755421fe

style(amp): tidy whitespace in proxy module and tests

hkfires · 2025-11-26 18:57:26 +08:00

6c17dbc4da

**feat(registry): add Gemini 3 Pro Image Preview model and remove Claude Sonnet 4.5 Thinking**

- Added new `Gemini 3 Pro Image Preview` model with detailed metadata and configuration.
- Removed outdated `Claude Sonnet 4.5 Thinking` model definition for cleanup and relevance.

Luis Pater · 2025-11-26 18:22:40 +08:00

v6.5.23 ee6429cc75

Fixed: #339

**feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions**

- Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration.
- Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure.

Luis Pater · 2025-11-26 11:42:57 +08:00

v6.5.22 a4a26d978e

Fixed: #337

**fix(executor): replace redundant commented code with `checkSystemInstructions` helper**

- Replaced commented-out `sjson.SetRawBytes` lines with the new `checkSystemInstructions` function.
- Centralized system instruction handling for better code clarity and reuse.
- Ensured consistent logic for managing `system` field across Claude executor flows.

Luis Pater · 2025-11-26 08:27:48 +08:00

v6.5.21 ed9f6e897e

Merge pull request #334 from nestharus/feat/claude-thinking-and-beta-headers

feat(claude): add thinking model variants and beta headers support

Luis Pater · 2025-11-26 02:17:02 +08:00

v6.5.20 9c1e3c0687

Merge branch 'dev' into feat/claude-thinking-and-beta-headers

Luis Pater · 2025-11-26 02:16:40 +08:00

2e5681ea32

**fix(executor): comment out redundant code for setting Claude system instructions**

- Commented out multiple instances of `sjson.SetRawBytes` for setting `system` key to Claude instructions as they are redundant.
- Code cleanup to improve clarity and maintainability without affecting functionality.

Luis Pater · 2025-11-26 02:06:16 +08:00

v6.5.19 52c17f03a5

feat(claude): add thinking model variants and beta headers support

- Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants)
- Add Thinking support for antigravity models with -thinking suffix
- Add injectThinkingConfig() for automatic thinking budget based on model suffix
- Add resolveUpstreamModel() mappings for thinking variants to actual Claude models
- Add extractAndRemoveBetas() to convert betas array to anthropic-beta header
- Update applyClaudeHeaders() to merge custom betas from request body

Closes #324

nestharus · 2025-11-25 03:33:05 -08:00

d0e694d4ed

**fix(handlers): refactor API response capture to append data safely**

- Introduced `appendAPIResponse` helper to preserve and append data to existing API responses.
- Ensured newline inclusion when appending, if necessary.
- Improved `nil` and data type checks for response handling.
- Updated middleware to skip request logging for `GET` requests.

Luis Pater · 2025-11-25 11:37:02 +08:00

v6.5.18 506f1117dd

**fix(executor): update antigravity executor to enhance model metadata handling**

- Added additional metadata fields (`Name`, `Description`, `DisplayName`, `Version`) to `ModelInfo` struct initialization for better model representation.
- Removed unnecessary whitespace in the code.

Luis Pater · 2025-11-25 09:19:01 +08:00

v6.5.17 113db3c5bf

787 Commits