chuan/CLIProxyAPI - CLIProxyAPI - Penguin

chuan/CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-03-02 04:24:11 +08:00

fix(translator): handle empty item type and string content in OpenAI response parser

Luis Pater · 2025-12-15 20:35:52 +08:00

v6.6.16 d9a65745df
Merge pull request #547 from router-for-me/amp
```
feat(amp): require API key authentication for management routes
```
Luis Pater · 2025-12-15 16:14:49 +08:00

v6.6.15 f6720f8dfa
Merge pull request #543 from router-for-me/log
```
feat(auth): add proxy information to debug logs
```
Chén Mù · 2025-12-15 15:59:16 +08:00

e19ab3a066

feat(amp): require API key authentication for management routes

All Amp management endpoints (e.g., /api/user, /threads) are now protected by the standard API key authentication middleware. This ensures that all management operations require a valid API key, significantly improving security.

As a result of this change:
- The `restrict-management-to-localhost` setting now defaults to `false`. API key authentication provides a stronger and more flexible security control than IP-based restrictions, improving usability in containerized environments.
- The reverse proxy logic now strips the client's `Authorization` header after authenticating the initial request. It then injects the configured `upstream-api-key` for the request to the upstream Amp service.

BREAKING CHANGE: Amp management endpoints now require a valid API key for authentication. Requests without a valid API key in the `Authorization` header will be rejected with a 401 Unauthorized error.

hkfires · 2025-12-15 13:24:53 +08:00

8f1dd69e72

feat(auth): add proxy information to debug logs

hkfires · 2025-12-15 13:14:55 +08:00

f26da24a2f
Merge pull request #533 from router-for-me/think
```
refactor(thinking): centralize reasoning effort mapping and normalize budget values
```
Luis Pater · 2025-12-15 10:34:41 +08:00

v6.6.14 8e4fbcaa7d

fix(openai): forward reasoning.effort value

Drop the hardcoded effort mapping in request conversion so
unknown values are preserved instead of being coerced to `auto

hkfires · 2025-12-15 09:16:15 +08:00

09c339953d

refactor(thinking): export thinking helpers

Expose thinking/effort normalization helpers from the executor package
so conversion tests use production code and stay aligned with runtime
validation behavior.

hkfires · 2025-12-15 09:16:15 +08:00

367a05bdf6

fix(thinking): normalize effort mapping

Route OpenAI reasoning effort through ThinkingEffortToBudget for Claude
translators, preserve "minimal" when translating OpenAI Responses, and
treat blank/unknown efforts as no-ops for Gemini thinking configs.

Also map budget -1 to "auto" and expand cross-protocol thinking tests.

hkfires · 2025-12-15 09:16:15 +08:00

d20b71deb9

fix(thinking): drop unsupported none effort

When budget 0 maps to "none" for models that use thinking levels
but don't support that effort level, strip thinking fields instead
of setting an invalid reasoning_effort value.
Tests now expect removal for this edge case.

hkfires · 2025-12-15 09:16:14 +08:00

712ce9f781

test(thinking): expand conversion edge case coverage

hkfires · 2025-12-15 09:16:14 +08:00

a4a3274a55

fix(thinking): centralize reasoning_effort mapping

Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into
shared helpers used by Gemini, Gemini CLI, and antigravity translators.

Normalize Claude thinking handling by preferring positive budgets, applying
budget token normalization, and gating by model support.

Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to
support allowCompat models, and update tests for normalization behavior.

hkfires · 2025-12-15 09:16:14 +08:00

716aa71f6e

fix(thinking): map budgets to effort for level models

hkfires · 2025-12-15 09:16:14 +08:00

e8976f9898
test(thinking): cover openai-compat reasoning passthrough

hkfires · 2025-12-15 09:16:14 +08:00

8496cc2444

fix(thinking): gate reasoning effort by model support

Only map OpenAI reasoning effort to Claude thinking for models that support
thinking and use budget tokens (not level-based thinking).

Also add "xhigh" effort mapping and adjust minimal/low budgets, with new
raw-payload conversion tests across protocols and models.

hkfires · 2025-12-15 09:16:14 +08:00

5ef2d59e05

Merge pull request #542 from router-for-me/aistudio

Chén Mù · 2025-12-15 09:13:25 +08:00

v6.6.13 07bb89ae80

Fixed: #534

fix(aistudio): correct JSON string boundary detection for backslash sequences

hkfires · 2025-12-15 09:00:14 +08:00

27a5ad8ec2

Merge pull request #537 from sukakcoding/fix/function-response-fallback
```
fix: handle malformed json in function response parsing
```
Luis Pater · 2025-12-15 03:31:09 +08:00

707b07c5f5
refactor: extract parseFunctionResponse helper to reduce duplication

sukakcoding · 2025-12-15 01:05:36 +08:00

4a764afd76
fix: handle malformed json in function response parsing

sukakcoding · 2025-12-15 00:59:46 +08:00

ecf49d574b
Merge pull request #536 from AoaoMH/feature/auth-model-check
```
feat: using Client Model Infos;
```
Luis Pater · 2025-12-15 00:29:33 +08:00

v6.6.12 5a75ef8ffd
feat: using Client Model Infos;

Test · 2025-12-15 00:13:05 +08:00

07279f8746
fix(registry): remove unused ThinkingSupport from DeepSeek-R1 model

Luis Pater · 2025-12-14 21:30:17 +08:00

71f788b13a
fix(registry): correct DeepSeek-V3.2 experimental model ID

Luis Pater · 2025-12-14 21:27:43 +08:00

59c62dc580
Merge pull request #531 from AoaoMH/feature/auth-model-check
```
feat: add API endpoint to query models for auth credentials
```
Luis Pater · 2025-12-14 16:46:43 +08:00

v6.6.11 d5310a3300
fix(registry): update DeepSeek model definitions with new IDs and descriptions

Luis Pater · 2025-12-14 16:17:11 +08:00

v6.6.10 f0a3eb574e
feat: add API endpoint to query models for auth credentials

Test · 2025-12-14 15:16:26 +08:00

bb15855443
Merge pull request #449 from sususu98/fix/gemini-cli-429-retry-delay-parsing
```
fix(gemini-cli): enhance 429 retry delay parsing
```
Luis Pater · 2025-12-14 14:04:14 +08:00

14ce6aebd1
Merge pull request #515 from teeverc/fix/response-rewriter-streaming-flush
```
fix(amp): flush response buffer after each streaming chunk write
```
Luis Pater · 2025-12-14 13:26:05 +08:00

2fe83723f2
refactor: only flush stream response on successful write

teeverc · 2025-12-13 13:32:54 -08:00

cd8c86c6fb
fix: streaming for amp cli

teeverc · 2025-12-13 13:17:53 -08:00

52d5fd1a67
Merge pull request #498 from teeverc/fix/claude-streaming-flush
```
fix(claude): flush Claude SSE chunks immediately
```
Luis Pater · 2025-12-13 23:58:34 +08:00

v6.6.9 b6ad243e9e

fix(executor): add allowCompat support for reasoning effort normalization

Introduced `allowCompat` parameter to improve compatibility handling for reasoning effort in payloads across OpenAI and similar models.

Luis Pater · 2025-12-13 04:06:02 +08:00

660aabc437

Merge pull request #505 from router-for-me/think
```
fix(thinking): map budgets to effort levels
```
Luis Pater · 2025-12-12 22:17:11 +08:00

566120e8d5
Merge branch 'dev' into think

Luis Pater · 2025-12-12 22:16:44 +08:00

f3f0f1717d
Merge pull request #501 from huynguyen03dev/fix/openai-compat-model-alias-resolution
```
fix(openai-compat): prevent model alias from being overwritten
```
Luis Pater · 2025-12-12 21:58:15 +08:00

v6.6.8 7621ec609e

fix(executor): improve model compatibility handling for OpenAI-compatibility

Enhances payload handling by introducing OpenAI-compatibility checks and refining how reasoning metadata is resolved, ensuring broader model support.

Luis Pater · 2025-12-12 21:57:25 +08:00

9f511f0024

fix(thinking): map budgets to effort levels

Ensure thinking settings translate correctly across providers:
- Only apply reasoning_effort to level-based models and derive it from numeric
  budget suffixes when present
- Strip effort string fields for budget-based models and skip Claude/Gemini
  budget resolution for level-based or unsupported models
- Default Gemini include_thoughts when a nonzero budget override is set
- Add cross-protocol conversion and budget range tests

hkfires · 2025-12-12 21:33:20 +08:00

374faa2640

Merge pull request #502 from router-for-me/iflow
```
fix(auth): prevent duplicate iflow BXAuth tokens
```
Luis Pater · 2025-12-12 20:03:37 +08:00

v6.6.7 1c52a89535
fix(auth): prevent duplicate iflow BXAuth tokens

hkfires · 2025-12-12 19:57:19 +08:00

e7cedbee6e
Merge pull request #500 from router-for-me/think
```
fix(codex): raise default reasoning effort to medium
```
Luis Pater · 2025-12-12 18:35:26 +08:00

v6.6.6 b8194e717c

fix(openai-compat): prevent model alias from being overwritten by ResolveOriginalModel

When using OpenAI-compatible providers with model aliases (e.g., glm-4.6-zai -> glm-4.6),
the alias resolution was correctly applied but then immediately overwritten by
ResolveOriginalModel, causing 'Unknown Model' errors from upstream APIs.

This fix skips the ResolveOriginalModel override when a model alias has already
been resolved, ensuring the correct model name is sent to the upstream provider.

Co-authored-by: Amp <amp@ampcode.com>

huynguyen03.dev · 2025-12-12 17:20:24 +07:00

15c3cc3a50

fix(codex): raise default reasoning effort to medium

hkfires · 2025-12-12 18:18:48 +08:00

d131435e25

Fixed: #440

feat(watcher): normalize auth file paths and implement debounce for remove events

Luis Pater · 2025-12-12 16:50:56 +08:00

6e43669498

Update sdk/api/handlers/claude/code_handlers.go

thank you gemini

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

teeverc · 2025-12-12 00:26:01 -08:00

5ab3032335

fix: flush Claude SSE chunks immediately to match OpenAI behavior

- Write each SSE chunk directly to c.Writer and flush immediately
- Remove buffered writer and ticker-based flushing that caused delayed output
- Add 500ms timeout case for consistency with OpenAI/Gemini handlers
- Clean up unused bufio import

This fixes the 'not streaming' issue where small responses were held
in the buffer until timeout/threshold was reached.

Amp-Thread-ID: https://ampcode.com/threads/T-019b1186-164e-740c-96ab-856f64ee6bee
Co-authored-by: Amp <amp@ampcode.com>

teeverc · 2025-12-12 00:14:19 -08:00

1215c635a0

Merge pull request #494 from ben-vargas/fix-gpt-reasoning-none
```
fix(models): add "none" reasoning effort level to gpt-5.2
```
Luis Pater · 2025-12-12 08:53:19 +08:00

v6.6.5 fc054db51a
refactor(handlers): improve request logging and payload handling

Luis Pater · 2025-12-12 08:52:52 +08:00

6e2306a5f2

fix(models): add "none" reasoning effort level to gpt-5.2

Per OpenAI API documentation, gpt-5.2 supports reasoning_effort values
of "none", "low", "medium", "high", and "xhigh". The "none" level was
missing from the model definition.

Reference: https://platform.openai.com/docs/api-reference/chat/create#chat_create-reasoning_effort

Ben Vargas · 2025-12-11 15:26:23 -07:00

b09e2115d1

Fixed: #492

Luis Pater · 2025-12-12 04:08:11 +08:00

v6.6.4 a68c97a40f

1 2 3 4 5 ...