CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 12:30:50 +08:00

Author	SHA1	Message	Date
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	e641fde25c	feat(registry): support provider-specific model info lookup	2026-01-20 10:01:17 +08:00
hkfires	1d2fe55310	fix(executor): stop rewriting thinkingLevel for gemini	2026-01-19 19:49:39 +08:00
hkfires	cb6caf3f87	fix(thinking): update ValidateConfig to include fromSuffix parameter and adjust budget validation logic	2026-01-18 16:37:14 +08:00
hkfires	03005b5d29	refactor(thinking): add Gemini family provider grouping for strict validation	2026-01-18 11:30:53 +08:00
hkfires	97b67e0e49	test(thinking): split E2E coverage into suffix and body parameter test functions Refactor thinking configuration tests by separating model name suffix-based scenarios from request body parameter-based scenarios into distinct test functions with independent case numbering. Architectural improvements: - Extract thinkingTestCase struct to package level for shared usage - Add getTestModels() helper returning complete model fixture set - Introduce runThinkingTests() runner with protocol-specific field detection - Register level-subset-model fixture with constrained low/high level support - Extend iflow protocol handling for glm-test and minimax-test models - Add same-protocol strict boundary validation cases (80-89) - Replace error responses with clamped values for boundary-exceeding budgets	2026-01-18 10:30:14 +08:00
hkfires	4ad6189487	refactor(thinking): extract antigravity logic into a dedicated provider	2026-01-15 19:08:22 +08:00
hkfires	ff4ff6bc2f	feat(thinking): support zero as a valid thinking budget for capable models	2026-01-15 15:41:10 +08:00
hkfires	33d66959e9	test(thinking): remove legacy unit and integration tests	2026-01-15 13:06:40 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
Luis Pater	3d01b3cfe8	Merge pull request #553 from XInTheDark/fix/builtin-tools-web-search fix(translator): preserve built-in tools (web_search) to Responses API	2026-01-09 04:40:13 +08:00
hkfires	8bf3305b2b	fix(thinking): fallback to upstream model for thinking support when alias not in registry	2025-12-31 18:07:13 +08:00
hkfires	d00e3ea973	feat(thinking): add numeric budget to thinkingLevel conversion fallback	2025-12-31 17:14:47 +08:00
hkfires	225e2c6797	feat(amp): add per-client upstream API key mapping support	2025-12-29 12:26:25 +08:00
Ben Vargas	598f0af19b	fix: apply thinkingLevel from model suffix metadata for Gemini 3 The previous commit added thinkingLevel support but didn't apply it when the reasoning effort came from model name suffix (e.g., model(minimal)). This was because ResolveThinkingConfigFromMetadata returns nil for level-based models, bypassing the metadata application. Changes: - Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API - Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format - Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata - Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata - Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata - Add comprehensive test coverage for Gemini 3 thinkingLevel functions	2025-12-17 16:08:38 -07:00
Luis Pater	7481c0eaa0	Revert "Fix invalid thinking signature when proxying Claude via Antigravity"	2025-12-17 14:53:52 +08:00
Luis Pater	f49e887fe6	Merge pull request #570 from fuguiKz/fix/antigravity-thinking-signature Fix invalid thinking signature when proxying Claude via Antigravity	2025-12-17 03:04:41 +08:00
kz	b602eae215	Fix antigravity Claude thinking signature handling	2025-12-17 02:28:58 +08:00
hkfires	9df96a4bb4	test(thinking): add effort to budget coverage	2025-12-16 18:34:43 +08:00
hkfires	28a428ae2f	fix(thinking): align budget effort mapping across translators Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.	2025-12-16 18:34:43 +08:00
Muzhen Gaming	0b834fcb54	fix(translator): preserve built-in tools across openai<->responses - Pass through non-function tool definitions like web_search - Translate tool_choice for built-in tools and function tools - Add regression tests for built-in tool passthrough	2025-12-15 21:18:54 +08:00
hkfires	367a05bdf6	refactor(thinking): export thinking helpers Expose thinking/effort normalization helpers from the executor package so conversion tests use production code and stay aligned with runtime validation behavior.	2025-12-15 09:16:15 +08:00
hkfires	d20b71deb9	fix(thinking): normalize effort mapping Route OpenAI reasoning effort through ThinkingEffortToBudget for Claude translators, preserve "minimal" when translating OpenAI Responses, and treat blank/unknown efforts as no-ops for Gemini thinking configs. Also map budget -1 to "auto" and expand cross-protocol thinking tests.	2025-12-15 09:16:15 +08:00
hkfires	a4a3274a55	test(thinking): expand conversion edge case coverage	2025-12-15 09:16:14 +08:00
hkfires	716aa71f6e	fix(thinking): centralize reasoning_effort mapping Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into shared helpers used by Gemini, Gemini CLI, and antigravity translators. Normalize Claude thinking handling by preferring positive budgets, applying budget token normalization, and gating by model support. Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to support allowCompat models, and update tests for normalization behavior.	2025-12-15 09:16:14 +08:00
hkfires	8496cc2444	test(thinking): cover openai-compat reasoning passthrough	2025-12-15 09:16:14 +08:00
hkfires	5ef2d59e05	fix(thinking): gate reasoning effort by model support Only map OpenAI reasoning effort to Claude thinking for models that support thinking and use budget tokens (not level-based thinking). Also add "xhigh" effort mapping and adjust minimal/low budgets, with new raw-payload conversion tests across protocols and models.	2025-12-15 09:16:14 +08:00
hkfires	374faa2640	fix(thinking): map budgets to effort levels Ensure thinking settings translate correctly across providers: - Only apply reasoning_effort to level-based models and derive it from numeric budget suffixes when present - Strip effort string fields for budget-based models and skip Claude/Gemini budget resolution for level-based or unsupported models - Default Gemini include_thoughts when a nonzero budget override is set - Add cross-protocol conversion and budget range tests	2025-12-12 21:33:20 +08:00
hkfires	05cfa16e5f	refactor(api): simplify request body parsing in ampcode handlers	2025-12-08 14:45:35 +08:00
hkfires	93a6e2d920	feat(api): add comprehensive ampcode management endpoints Add new REST API endpoints under /v0/management/ampcode for managing ampcode configuration including upstream URL, API key, localhost restriction, model mappings, and force model mappings settings. - Move force-model-mappings from config_basic to config_lists - Add GET/PUT/PATCH/DELETE endpoints for all ampcode settings - Support model mapping CRUD with upsert (PATCH) capability - Add comprehensive test coverage for all ampcode endpoints	2025-12-08 12:03:00 +08:00
hkfires	8c42b21e66	refactor(config): improve OpenAI compatibility target matching logic	2025-12-03 12:41:17 +08:00

31 Commits