CLIProxyAPI

feat(translator): support Claude thinking type adaptive

hkfires · 2026-02-10 16:20:32 +08:00

938a799263

fix(translator): correct gemini-cli log prefix

hkfires · 2026-02-07 08:40:09 +08:00

b7e4f00c5f

Add Kimi (Moonshot AI) provider support

- OAuth2 device authorization grant flow (RFC 8628) for authentication
- Streaming and non-streaming chat completions via OpenAI-compatible API
- Models: kimi-k2, kimi-k2-thinking, kimi-k2.5
- CLI `--kimi-login` command for device flow auth
- Token management with automatic refresh
- Thinking/reasoning effort support for thinking-enabled models

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

test · 2026-02-05 19:24:46 -05:00

f5f26f0cbe

fix(test): rename test function to reflect behavior change for builtin tools

hkfires · 2026-02-05 09:25:34 +08:00

075e3ab69e

feat(thinking): enable thinking toggle for qwen3 and deepseek models

Fix #1245

hkfires · 2026-01-28 09:54:05 +08:00

c8c27325dc

feat(registry): support provider-specific model info lookup

hkfires · 2026-01-20 10:01:17 +08:00

e641fde25c

fix(executor): stop rewriting thinkingLevel for gemini

hkfires · 2026-01-19 19:49:39 +08:00

1d2fe55310

fix(thinking): update ValidateConfig to include fromSuffix parameter and adjust budget validation logic

hkfires · 2026-01-18 16:37:14 +08:00

cb6caf3f87

refactor(thinking): add Gemini family provider grouping for strict validation

hkfires · 2026-01-18 11:30:53 +08:00

03005b5d29

test(thinking): split E2E coverage into suffix and body parameter test functions

Refactor thinking configuration tests by separating model name suffix-based
scenarios from request body parameter-based scenarios into distinct test
functions with independent case numbering.

Architectural improvements:
- Extract thinkingTestCase struct to package level for shared usage
- Add getTestModels() helper returning complete model fixture set
- Introduce runThinkingTests() runner with protocol-specific field detection
- Register level-subset-model fixture with constrained low/high level support
- Extend iflow protocol handling for glm-test and minimax-test models
- Add same-protocol strict boundary validation cases (80-89)
- Replace error responses with clamped values for boundary-exceeding budgets

hkfires · 2026-01-18 10:30:14 +08:00

97b67e0e49

refactor(thinking): extract antigravity logic into a dedicated provider

hkfires · 2026-01-15 19:08:22 +08:00

4ad6189487

feat(thinking): support zero as a valid thinking budget for capable models

hkfires · 2026-01-15 15:41:10 +08:00

ff4ff6bc2f

test(thinking): remove legacy unit and integration tests

hkfires · 2026-01-15 13:06:40 +08:00

33d66959e9

refactor: improve thinking logic

hkfires · 2026-01-15 13:06:39 +08:00

0b06d637e7

Merge pull request #553 from XInTheDark/fix/builtin-tools-web-search

fix(translator): preserve built-in tools (web_search) to Responses API

Luis Pater · 2026-01-09 04:40:13 +08:00

3d01b3cfe8

fix(thinking): fallback to upstream model for thinking support when alias not in registry

hkfires · 2025-12-31 18:07:13 +08:00

8bf3305b2b

feat(thinking): add numeric budget to thinkingLevel conversion fallback

hkfires · 2025-12-31 17:14:47 +08:00

d00e3ea973

feat(amp): add per-client upstream API key mapping support

hkfires · 2025-12-29 12:26:25 +08:00

225e2c6797

fix: apply thinkingLevel from model suffix metadata for Gemini 3

The previous commit added thinkingLevel support but didn't apply it
when the reasoning effort came from model name suffix (e.g., model(minimal)).

This was because ResolveThinkingConfigFromMetadata returns nil for
level-based models, bypassing the metadata application.

Changes:
- Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API
- Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format
- Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata
- Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata
- Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata
- Add comprehensive test coverage for Gemini 3 thinkingLevel functions

Ben Vargas · 2025-12-17 16:08:38 -07:00

598f0af19b

Revert "Fix invalid thinking signature when proxying Claude via Antigravity"

Luis Pater · 2025-12-17 14:53:52 +08:00

7481c0eaa0

Merge pull request #570 from fuguiKz/fix/antigravity-thinking-signature

Fix invalid thinking signature when proxying Claude via Antigravity

Luis Pater · 2025-12-17 03:04:41 +08:00

f49e887fe6

Fix antigravity Claude thinking signature handling

kz · 2025-12-17 02:28:58 +08:00

b602eae215

test(thinking): add effort to budget coverage

hkfires · 2025-12-16 18:34:43 +08:00

9df96a4bb4

fix(thinking): align budget effort mapping across translators

Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.

hkfires · 2025-12-16 18:34:43 +08:00

28a428ae2f

fix(translator): preserve built-in tools across openai<->responses

- Pass through non-function tool definitions like web_search

- Translate tool_choice for built-in tools and function tools

- Add regression tests for built-in tool passthrough

Muzhen Gaming · 2025-12-15 21:18:54 +08:00

0b834fcb54

refactor(thinking): export thinking helpers

Expose thinking/effort normalization helpers from the executor package
so conversion tests use production code and stay aligned with runtime
validation behavior.

hkfires · 2025-12-15 09:16:15 +08:00

367a05bdf6

fix(thinking): normalize effort mapping

Route OpenAI reasoning effort through ThinkingEffortToBudget for Claude
translators, preserve "minimal" when translating OpenAI Responses, and
treat blank/unknown efforts as no-ops for Gemini thinking configs.

Also map budget -1 to "auto" and expand cross-protocol thinking tests.

hkfires · 2025-12-15 09:16:15 +08:00

d20b71deb9

test(thinking): expand conversion edge case coverage

hkfires · 2025-12-15 09:16:14 +08:00

a4a3274a55

fix(thinking): centralize reasoning_effort mapping

Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into
shared helpers used by Gemini, Gemini CLI, and antigravity translators.

Normalize Claude thinking handling by preferring positive budgets, applying
budget token normalization, and gating by model support.

Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to
support allowCompat models, and update tests for normalization behavior.

hkfires · 2025-12-15 09:16:14 +08:00

716aa71f6e

test(thinking): cover openai-compat reasoning passthrough

hkfires · 2025-12-15 09:16:14 +08:00

8496cc2444

fix(thinking): gate reasoning effort by model support

Only map OpenAI reasoning effort to Claude thinking for models that support
thinking and use budget tokens (not level-based thinking).

Also add "xhigh" effort mapping and adjust minimal/low budgets, with new
raw-payload conversion tests across protocols and models.

hkfires · 2025-12-15 09:16:14 +08:00

5ef2d59e05

fix(thinking): map budgets to effort levels

Ensure thinking settings translate correctly across providers:
- Only apply reasoning_effort to level-based models and derive it from numeric
  budget suffixes when present
- Strip effort string fields for budget-based models and skip Claude/Gemini
  budget resolution for level-based or unsupported models
- Default Gemini include_thoughts when a nonzero budget override is set
- Add cross-protocol conversion and budget range tests

hkfires · 2025-12-12 21:33:20 +08:00

374faa2640

refactor(api): simplify request body parsing in ampcode handlers

hkfires · 2025-12-08 14:45:35 +08:00

05cfa16e5f

feat(api): add comprehensive ampcode management endpoints

Add new REST API endpoints under /v0/management/ampcode for managing
ampcode configuration including upstream URL, API key, localhost
restriction, model mappings, and force model mappings settings.

- Move force-model-mappings from config_basic to config_lists
- Add GET/PUT/PATCH/DELETE endpoints for all ampcode settings
- Support model mapping CRUD with upsert (PATCH) capability
- Add comprehensive test coverage for all ampcode endpoints

hkfires · 2025-12-08 12:03:00 +08:00

93a6e2d920

refactor(config): improve OpenAI compatibility target matching logic

hkfires · 2025-12-03 12:41:17 +08:00

8c42b21e66

35 Commits