CLIProxyAPI

refactor(thinking): use bracket tags for thinking meta

Align thinking suffix handling on a single bracket-style marker.

NormalizeThinkingModel strips a terminal `[value]` segment from
model identifiers and turns it into either a thinking budget (for
numeric values) or a reasoning effort hint (for strings). Emission
of `ThinkingIncludeThoughtsMetadataKey` is removed.

Executor helpers and the example config are updated so their
comments reference the new `[value]` suffix format instead of the
legacy dash variants.

BREAKING CHANGE: dash-based thinking suffixes (`-thinking`,
`-thinking-N`, `-reasoning`, `-nothinking`) are no longer parsed
for thinking metadata; only `[value]` annotations are recognized.

hkfires · 2025-12-11 18:17:28 +08:00

facfe7c518

fix(runtime): unify claude thinking config resolution

hkfires · 2025-12-11 17:20:44 +08:00

6285459c08

docs(runtime): document reasoning effort precedence

hkfires · 2025-12-11 16:35:36 +08:00

21bbceca0c

fix(runtime): validate thinking config in iflow and qwen

hkfires · 2025-12-11 16:21:50 +08:00

f6300c72b7

fix(util): do not strip thinking suffix on registered models

NormalizeThinkingModel now checks ModelSupportsThinking before removing
"-thinking" or "-thinking-<ver>", avoiding accidental parsing of model
names where the suffix is part of the official id (e.g., kimi-k2-thinking,
qwen3-235b-a22b-thinking-2507).

The registry adds ThinkingSupport metadata for several models and
propagates it via ModelInfo (e.g., kimi-k2-thinking, deepseek-r1,
qwen3-235b-a22b-thinking-2507, minimax-m2), enabling accurate detection
of thinking-capable models and correcting base model inference.

hkfires · 2025-12-11 15:52:14 +08:00

007572b58e

fix(runtime): unify reasoning effort metadata overrides

hkfires · 2025-12-11 14:35:05 +08:00

3a81ab22fd

fix(runtime): validate reasoning effort levels

hkfires · 2025-12-11 12:36:54 +08:00

519da2e042

fix(util): align reasoning effort handling with registry

hkfires · 2025-12-11 12:20:12 +08:00

169f4295d0

fix(util): centralize reasoning effort normalization

hkfires · 2025-12-11 12:14:51 +08:00

d06d0eab2f

feat(runtime): add thinking config normalization

hkfires · 2025-12-11 11:51:33 +08:00

3ffd120ae9

feat(registry): add thinking metadata for models

hkfires · 2025-12-11 11:28:44 +08:00

a03d514095

Merge pull request #479 from router-for-me/claude

fix(claude): prevent final events when no content streamed

Luis Pater · 2025-12-11 08:18:59 +08:00

1da03bfe15

feat(util): implement dynamic thinking suffix normalization and refactor budget resolution logic

- Added support for parsing and normalizing dynamic thinking model suffixes.
- Centralized budget resolution across executors and payload helpers.
- Retired legacy Gemini-specific thinking handlers in favor of unified logic.
- Updated executors to use metadata-based thinking configuration.
- Added `ResolveOriginalModel` utility for resolving normalized upstream models using request metadata.
- Updated executors (Gemini, Codex, iFlow, OpenAI, Qwen) to incorporate upstream model resolution and substitute model values in payloads and request URLs.
- Ensured fallbacks handle cases with missing or malformed metadata to derive models robustly.
- Refactored upstream model resolution to dynamically incorporate metadata for selecting and normalizing models.
- Improved handling of thinking configurations and model overrides in executors.
- Removed hardcoded thinking model entries and migrated logic to metadata-based resolution.
- Updated payload mutations to always include the resolved model.

Luis Pater · 2025-12-11 03:10:50 +08:00

v6.6.0 423ce97665

Fixed: #478

feat(antigravity): add support for inline image data in client responses

Luis Pater · 2025-12-10 23:55:53 +08:00

v6.5.65 e717939edb

fix(claude): prevent final events when no content streamed

hkfires · 2025-12-10 22:19:55 +08:00

a89514951f

fix(logging): update response aggregation logic to include all attempts

Luis Pater · 2025-12-10 16:53:48 +08:00

v6.5.64 94d61c7b2b

feat(responses): add unique identifiers for responses, function calls, and tool uses

Luis Pater · 2025-12-10 16:02:54 +08:00

v6.5.63 1249b07eb8

feat(antigravity): add unique identifier for tool use blocks in response

Luis Pater · 2025-12-10 15:27:57 +08:00

v6.5.62 6b37f33d31

fix(antigravity): remove references to autopush endpoint and update fallback logic

Luis Pater · 2025-12-10 00:13:20 +08:00

v6.5.61 f25f419e5a

Merge pull request #465 from router-for-me/think

Move thinking budget normalization from translators to executor

Luis Pater · 2025-12-09 21:10:33 +08:00

v6.5.60 b7e382008f

feat(amp): add /news.rss proxy route