CLIProxyAPI

Fixes Claude API thinking block requirement

Addresses a Claude API requirement where assistant messages with tool use must have a thinking block when thinking is enabled.

This commit injects an empty thinking block into assistant messages that include tool use but lack a thinking block. This ensures compatibility with the Claude API when the thinking feature is enabled.

이대희 · 2026-02-02 14:04:29 +09:00

c548c5d49f

Fixes thinking signature validation errors

Addresses an issue where thinking signature validation fails due to model mapping and empty internal registry.

- Implements a fallback mechanism in the router to use the global model registry when the internal registry is empty. This ensures that models registered via API keys are correctly resolved even without local provider configurations.
- Modifies `GetModelGroup` to use registry-based grouping in addition to name pattern matching, covering cases where models are registered with API keys but lack provider names in their names.
- Updates signature validation to compare model groups instead of exact model names.

These changes resolve thinking signature validation errors and improve the accuracy of model resolution.

이대희 · 2026-02-02 12:50:33 +09:00

a424396a87

Merge remote-tracking branch 'upstream/main' into feature/ampcode-alias

이대희 · 2026-02-02 12:09:31 +09:00

24b4bee500

Merge branch 'dev' into codex

Luis Pater · 2026-02-01 20:20:49 +08:00

b927b0cc6c

Implements unified model routing

Migrates the AMP module to a new unified routing system, replacing the fallback handler with a router-based approach.

This change introduces a `ModelRoutingWrapper` that handles model extraction, routing decisions, and proxying based on provider availability and model mappings.
It provides a more flexible and maintainable routing mechanism by centralizing routing logic.

The changes include:
- Introducing new `routing` package with core routing logic.
- Creating characterization tests to capture existing behavior.
- Implementing model extraction and rewriting.
- Updating AMP module routes to utilize the new routing wrapper.
- Deprecating `FallbackHandler` in favor of the new `ModelRoutingWrapper`.

이대희 · 2026-02-01 16:58:32 +09:00

9299897e04

fix(codex): convert system role to developer for codex input

hkfires · 2026-02-01 15:37:37 +08:00

354f6582b2

Refactors AMP model mapping and error handling

Improves AMP request handling by consolidating model mapping logic into a helper function for better readability and maintainability.

Enhances error handling for premature client connection closures during reverse proxy operations by explicitly acknowledging and swallowing the ErrAbortHandler panic, preventing noisy stack traces.

Removes unused method `findProviderViaOAuthAlias` from the `DefaultModelMapper`.

이대희 · 2026-02-01 15:56:31 +09:00

527a269799

docs(translator): update Codex Claude request transform docs

hkfires · 2026-02-01 14:55:41 +08:00

fe3ebe3532

Refactors context keys for model routing

Uses centralized context keys for accessing mapped and fallback models.

This change deprecates the string-based context keys used in the AMP fallback handlers in favor of the `ctxkeys` package, promoting consistency and reducing the risk of typos.
The authentication conductor now retrieves fallback models using the shared `ctxkeys` constants.

이대희 · 2026-02-01 15:50:45 +09:00

2fe0b6cd2d

Merge remote-tracking branch 'upstream/main' into feature/ampcode-alias

이대희 · 2026-02-01 15:43:16 +09:00

eeb1812d60

refactor(codex): remove codex instructions injection support

hkfires · 2026-02-01 14:33:31 +08:00

ac802a4646

feat(config): track routing and cloak changes in config diff

hkfires · 2026-02-01 12:05:48 +08:00

6a258ff841

refactor(api): centralize config change logging

hkfires · 2026-02-01 11:31:44 +08:00

4649cadcb5

Merge pull request #874 from MohammadErfan-Jabbari/fix/streaming-finish-reason-tool-calls

fix(antigravity): preserve finish_reason tool_calls across streaming chunks

Luis Pater · 2026-02-01 07:05:39 +08:00

73db4e64f6

Merge pull request #859 from shunkakinoki/fix/objectstore-sync-race-condition

fix: prevent race condition in objectstore auth sync

Luis Pater · 2026-02-01 07:01:43 +08:00

69ca0a8fac

Merge pull request #1368 from sususu98/feat/configurable-error-logs-max-files

feat(logging): make error-logs-max-files configurable

Luis Pater · 2026-02-01 06:50:10 +08:00

3b04e11544

feat(config): add payload filter rules to remove JSON paths

Introduce `Filter` rules in the payload configuration to remove specified JSON paths from the payload. Update related helper functions and add examples to `config.example.yaml`.

Luis Pater · 2026-02-01 05:29:41 +08:00

6d8609e457

Fixed: #1372 #1366

fix(caching): ensure unique cache_control injection using count validation

Luis Pater · 2026-01-31 23:48:50 +08:00

d216adeffc

fix(config): add codex instructions enabled change to config change details

hkfires · 2026-01-31 22:44:25 +08:00

bb09708c02

fix(misc): update opencode instructions

hkfires · 2026-01-31 22:28:30 +08:00

1150d972a1

feat(logging): make error-logs-max-files configurable

- Add ErrorLogsMaxFiles config field with default value 10
- Support hot-reload via config file changes
- Add Management API: GET/PUT/PATCH /v0/management/error-logs-max-files
- Maintain SDK backward compatibility with NewFileRequestLogger (3 params)
- Add NewFileRequestLoggerWithOptions for custom error log retention

When request logging is disabled, forced error logs are retained up to
the configured limit. Set to 0 to disable cleanup.

sususu98 · 2026-01-31 17:48:40 +08:00

6db8d2a28e

fix(amp): update fallback_handlers_test.go for provider registration

Amp-Thread-ID: https://ampcode.com/threads/T-019c0f77-82b6-711c-9172-092bd2a2059d
Co-authored-by: Amp <amp@ampcode.com>

이대희 · 2026-01-31 13:55:44 +08:00

adedb16d35

feat(routing): implement unified model routing with OAuth and API key providers

- Added a new routing package to manage provider registration and model resolution.
- Introduced Router, Executor, and Provider interfaces to handle different provider types.
- Implemented OAuthProvider and APIKeyProvider to support OAuth and API key authentication.
- Enhanced DefaultModelMapper to include OAuth model alias handling and fallback mechanisms.
- Updated context management in API handlers to preserve fallback models.
- Added tests for routing logic and provider selection.
- Enhanced Claude request conversion to handle reasoning content based on thinking mode.

이대희 · 2026-01-31 13:55:43 +08:00

89907231c1

feature(ampcode): Improves AMP model mapping with alias support

Enhances the AMP model mapping functionality to support fallback mechanisms using .

This change allows the system to attempt alternative models (aliases) if the primary mapped model fails due to issues like quota exhaustion. It updates the model mapper to load and utilize the  configuration, enabling provider lookup via aliases. It also introduces context keys to pass fallback model names between handlers.

Additionally, this change introduces a fix to prevent ReverseProxy from panicking by swallowing ErrAbortHandler panics.

Amp-Thread-ID: https://ampcode.com/threads/T-019c0cd1-9e59-722b-83f0-e0582aba6914
Co-authored-by: Amp <amp@ampcode.com>

이대희 · 2026-01-31 13:55:43 +08:00

09044e8ccc

fix(misc): update user agent string for opencode

hkfires · 2026-01-31 11:23:08 +08:00

2854e04bbb

fix(translator): handle stop_reason and MAX_TOKENS for Claude responses

Luis Pater · 2026-01-31 04:03:01 +08:00

f99cddf97f

Merge pull request #1248 from shekohex/feat/responses-compact

feat(openai): add responses/compact support

Luis Pater · 2026-01-31 03:12:55 +08:00

f887f9985d

fix(translator): include token usage in message_delta for Claude responses

Luis Pater · 2026-01-31 02:55:27 +08:00

550da0cee8

fix(caching): ensure prompt-caching beta is always appended and add multi-turn cache control tests

Luis Pater · 2026-01-31 01:42:58 +08:00

7ff3936efe

Merge pull request #1294 from Darley-Wey/fix/claude2gemini

fix: skip empty text parts and messages to avoid Gemini API error

Luis Pater · 2026-01-31 01:05:41 +08:00

f36a5f5654

Merge pull request #1295 from SchneeMart/feature/claude-caching

feat(caching): implement Claude prompt caching with multi-turn support

Luis Pater · 2026-01-31 01:04:19 +08:00

c1facdff67

Merge pull request #1311 from router-for-me/fix/gemini-schema

fix(gemini): Removes unsupported extension fields

Luis Pater · 2026-01-30 23:55:56 +08:00

4ee46bc9f2

Merge pull request #1317 from yinkev/feat/gemini-tools-passthrough

feat(translator): add code_execution and url_context tool passthrough

Luis Pater · 2026-01-30 23:46:44 +08:00

c3e94a8277

feat(auth): add custom HTTP client with utls for Claude API authentication

Introduce a custom HTTP client utilizing utls with Firefox TLS fingerprinting to bypass Cloudflare fingerprinting on Anthropic domains. Includes support for proxy configuration and enhanced connection management for HTTP/2.

Luis Pater · 2026-01-30 21:29:41 +08:00

6b6d030ed3

feat(translator): add code_execution and url_context tool passthrough

Add support for Gemini's code_execution and url_context tools in the
request translators, enabling:

- Agentic Vision: Image analysis with Python code execution for
  bounding boxes, annotations, and visual reasoning
- URL Context: Live web page content fetching and analysis

Tools are passed through using the same pattern as google_search:
- code_execution: {} -> codeExecution: {}
- url_context: {} -> urlContext: {}

Tested with Gemini 3 Flash Preview agentic vision successfully.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

kyinhub · 2026-01-29 21:14:52 -08:00

538039f583

refactor(gemini): optimize removeExtensionFields with post-order traversal and DeleteBytes

Amp-Thread-ID: https://ampcode.com/threads/T-019c0d09-330d-7399-b794-652b94847df1
Co-authored-by: Amp <amp@ampcode.com>

이대희 · 2026-01-30 13:02:58 +09:00

ca796510e9

fix(gemini): Removes unsupported extension fields

Removes x-* extension fields from JSON schemas to ensure compatibility with the Gemini API.

These fields, while valid in OpenAPI/JSON Schema, are not recognized by the Gemini API and can cause issues.
The change recursively walks the schema, identifies these extension fields, and removes them, except when they define properties.

Amp-Thread-ID: https://ampcode.com/threads/T-019c0cd1-9e59-722b-83f0-e0582aba6914
Co-authored-by: Amp <amp@ampcode.com>

이대희 · 2026-01-30 12:31:26 +09:00

d0d66cdcb7

feat(caching): implement Claude prompt caching with multi-turn support

- Add ensureCacheControl() to auto-inject cache breakpoints
- Cache tools (last tool), system (last element), and messages (2nd-to-last user turn)
- Add prompt-caching-2024-07-31 beta header
- Return original payload on sjson error to prevent corruption
- Include verification test for caching logic

Enables up to 90% cost reduction on cached tokens.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Martin Schneeweiss · 2026-01-29 22:59:33 +01:00

3a43ecb19b

fix(config): ensure empty mapping persists for oauth-model-alias deletions #1305

Luis Pater · 2026-01-30 04:17:56 +08:00

a709e5a12d

Merge pull request #1300 from sususu98/feat/log-api-response-timestamp

fix(logging): add API response timestamp and fix request timestamp timing

Luis Pater · 2026-01-30 03:27:17 +08:00

f0ac77197b

Merge pull request #1298 from sususu98/fix/restore-usageMetadata-in-gemini-translator

fix(translator): restore usageMetadata in Gemini responses from Antigravity

Luis Pater · 2026-01-30 02:59:41 +08:00

da0bbf2a3f

fix(logging): capture streaming TTFB on first chunk and make timestamps required

- Add firstChunkTimestamp field to ResponseWriterWrapper for sync capture
- Capture TTFB in Write() and WriteString() before async channel send
- Add SetFirstChunkTimestamp() to StreamingLogWriter interface
- Make requestTimestamp/apiResponseTimestamp required in LogRequest()
- Remove timestamp capture from WriteAPIResponse() (now via setter)
- Fix Gemini handler to set API_RESPONSE_TIMESTAMP before writing response

This ensures accurate TTFB measurement for all streaming API formats
(OpenAI, Gemini, Claude) by capturing timestamp synchronously when
the first response chunk arrives, not when the stream finalizes.

sususu98 · 2026-01-29 22:32:24 +08:00

295f34d7f0

fix(logging): add API response timestamp and fix request timestamp timing

Previously:
- REQUEST INFO timestamp was captured at log write time (not request arrival)
- API RESPONSE had NO timestamp at all

This fix:
- Captures REQUEST INFO timestamp when request first arrives
- Adds API RESPONSE timestamp when upstream response arrives

Changes:
- Add Timestamp field to RequestInfo, set at middleware initialization
- Set API_RESPONSE_TIMESTAMP in appendAPIResponse() and gemini handler
- Pass timestamps through logging chain to writeNonStreamingLog()
- Add timestamp output to API RESPONSE section

This enables accurate measurement of backend response latency in error logs.

sususu98 · 2026-01-29 22:22:18 +08:00

c41ce77eea

fix(config): prune oauth-model-alias when preserving config

hkfires · 2026-01-29 14:06:52 +08:00

d0bada7a43

fix(translator): restore usageMetadata in Gemini responses from Antigravity

When using Gemini API format with Antigravity backend, the executor
renames usageMetadata to cpaUsageMetadata in non-terminal chunks.
The Gemini translator was returning this internal field name directly
to clients instead of the standard usageMetadata field.

Add restoreUsageMetadata() to rename cpaUsageMetadata back to
usageMetadata before returning responses to clients.

sususu98 · 2026-01-29 11:16:00 +08:00

9dc0e6d08b

fix(api): update amp module only on config changes

hkfires · 2026-01-29 09:28:49 +08:00

8510fc313e

fix: skip empty text parts and messages to avoid Gemini API error

When Claude API sends an assistant message with empty text content like:
{"role":"assistant","content":[{"type":"text","text":""}]}
The translator was creating a part object {} with no data field,
causing Gemini API to return error:
"required oneof field 'data' must have one initialized field"
This fix:
1. Skips empty text parts (text="") during translation
2. Skips entire messages when their parts array becomes empty
This ensures compatibility when clients send empty assistant messages
in their conversation history.

Darley · 2026-01-29 04:13:07 +08:00

2666708c30

Merge pull request #1276 from router-for-me/thinking

feat(thinking): enable thinking toggle for qwen3 and deepseek models

Luis Pater · 2026-01-28 11:16:54 +08:00

9e5b1d24e8

refactor: consolidate channel send logic with context-safe handlers

Optimize channel operations by introducing reusable context-aware send functions (`send` and `sendErr`) across `wsrelay`, `handlers`, and `cliproxy`. Ensure graceful handling of canceled contexts during stream operations.

Luis Pater · 2026-01-28 10:58:35 +08:00

e93e05ae25

feat(thinking): enable thinking toggle for qwen3 and deepseek models

Fix #1245

hkfires · 2026-01-28 09:54:05 +08:00

c8c27325dc

1159 Commits