CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
Luis Pater	73db4e64f6	Merge pull request #874 from MohammadErfan-Jabbari/fix/streaming-finish-reason-tool-calls fix(antigravity): preserve finish_reason tool_calls across streaming chunks	2026-02-01 07:05:39 +08:00
kyinhub	538039f583	feat(translator): add code_execution and url_context tool passthrough Add support for Gemini's code_execution and url_context tools in the request translators, enabling: - Agentic Vision: Image analysis with Python code execution for bounding boxes, annotations, and visual reasoning - URL Context: Live web page content fetching and analysis Tools are passed through using the same pattern as google_search: - code_execution: {} -> codeExecution: {} - url_context: {} -> urlContext: {} Tested with Gemini 3 Flash Preview agentic vision successfully. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-29 21:14:52 -08:00
Luis Pater	0d6ecb0191	Fixed: #1077 refactor(translator): improve tools handling by separating functionDeclarations and googleSearch nodes	2026-01-24 05:51:11 +08:00
Luis Pater	68b3565d7b	Merge branch 'main' into dev (PR #961 )	2026-01-20 11:42:22 +08:00
Luis Pater	46433a25f8	fix(translator): add check for empty `text` to prevent invalid serialization in `gemini` and `antigravity`	2026-01-18 00:50:10 +08:00
Luis Pater	f8f3ad84fc	Fixed: #1064 feat(translator): improve system message handling and content indexing across translators - Updated logic for processing system messages in `claude`, `gemini`, `gemini-cli`, and `antigravity` translators. - Introduced indexing for `systemInstruction.parts` to ensure proper ordering and handling of multi-part content. - Added safeguards for accurate content transformation and serialization.	2026-01-17 05:40:56 +08:00
Luis Pater	cec4e251bd	feat(translator): preserve `text` field in serialized output during chat completions processing	2026-01-16 11:35:34 +08:00
hkfires	199cf480b0	refactor(thinking): remove support for non-standard thinking configurations This change removes the translation logic for several non-standard, proprietary extensions used to configure thinking/reasoning. Specifically, support for `extra_body.google.thinking_config` and the Anthropic-style `thinking` object has been dropped from the OpenAI request translators. This simplification streamlines the translators, focusing them on the standard `reasoning_effort` parameter. It also removes the need to look up model information from the registry within these components. BREAKING CHANGE: Support for non-standard thinking configurations via `extra_body.google.thinking_config` and the Anthropic-style `thinking` object has been removed. Clients should now use the standard `reasoning_effort` parameter to control reasoning.	2026-01-15 19:32:12 +08:00
hkfires	6e4a602c60	fix(thinking): map reasoning_effort to thinkingConfig	2026-01-15 13:06:40 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
extremk	0b5bbe9234	Add candidate count handling in OpenAI request	2026-01-10 18:49:29 +08:00
MohammadErfan Jabbari	fe6043aec7	fix(antigravity): preserve finish_reason tool_calls across streaming chunks When streaming responses with tool calls, the finish_reason was being overwritten. The upstream sends functionCall in chunk 1, then finishReason: STOP in chunk 2. The old code would set finish_reason from every chunk, causing "tool_calls" to be overwritten by "stop". This broke clients like Claude Code that rely on finish_reason to detect when tool calls are complete. Changes: - Add SawToolCall bool to track tool calls across entire stream - Add UpstreamFinishReason to cache the finish reason - Only emit finish_reason on final chunk (has both finishReason + usage) - Priority: tool_calls > max_tokens > stop Includes 5 unit tests covering: - Tool calls not overwritten by subsequent STOP - Normal text gets "stop" - MAX_TOKENS without tool calls gets "max_tokens" - Tool calls take priority over MAX_TOKENS - Intermediate chunks have no finish_reason Fixes streaming tool call detection for Claude Code + Gemini models. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-05 18:45:25 +01:00
Luis Pater	8f8dfd081b	Merge pull request #850 from can1357/main feat(translator): add developer role support for Gemini translators	2026-01-05 11:27:24 +08:00
Luis Pater	8edbda57cf	feat(translator): add `thoughtSignature` to node parts for Gemini and Antigravity requests Enhanced node structure by including `thoughtSignature` for inline data parts in Gemini OpenAI, Gemini CLI, and Antigravity request handlers to improve traceability of thought processes.	2026-01-05 09:25:17 +08:00
can1357	6762e081f3	feat(translator): add developer role support for Gemini translators Treat OpenAI's "developer" role the same as "system" role in request translation for gemini, gemini-cli, and antigravity backends.	2026-01-03 21:01:01 +01:00
Luis Pater	7646a2b877	Fixed: #749 fix(translators): ensure `gjson.String` content is non-empty before setting `parts` in OpenAI request logic	2025-12-28 00:54:26 +08:00
Luis Pater	33e53a2a56	fix(translators): ensure correct handling and output of multimodal assistant content across request handlers	2025-12-26 05:08:04 +08:00
Luis Pater	3f949b7f84	Merge pull request #704 from tinyc0der/add-index fix(openai): add index field to image response for LiteLLM compatibility	2025-12-25 21:35:12 +08:00
TinyCoder	a7fc2ee4cf	refactor(image): avoid using json.Marshal	2025-12-25 14:21:01 +07:00
Luis Pater	06ad527e8c	Fixed: #696 fix(translators): adjust prompt token calculation by subtracting cached tokens across Gemini, OpenAI, and Claude handlers	2025-12-24 23:29:18 +08:00
TinyCoder	671558a822	fix(openai): add index field to image response for LiteLLM compatibility LiteLLM's Pydantic model requires an index field in each image object. Without it, responses fail validation with "images.0.index Field required".	2025-12-24 17:43:31 +07:00
Luis Pater	7569320770	Merge branch 'dev' into fix/antigravity-prompt-caching	2025-12-24 03:49:46 +08:00
Luis Pater	24bc9cba67	Fixed: #639 fix(antigravity): validate function arguments before serialization Ensure `function.arguments` is a valid JSON before setting raw bytes, fallback to setting as parameterized content if invalid.	2025-12-23 03:49:45 +08:00
Luis Pater	5106caf641	Fixed: #654 feat: handle array input for system instructions in translators Enhanced Gemini, Gemini-CLI, and Antigravity translators to process array content for system instructions. Adds support for assigning roles and handling multiple content parts dynamically.	2025-12-23 02:24:26 +08:00
Luis Pater	a86d501dc2	refactor: replace `json.Marshal` and `json.Unmarshal` with `sjson` and `gjson` Optimized the handling of JSON serialization and deserialization by replacing redundant `json.Marshal` and `json.Unmarshal` calls with `sjson` and `gjson`. Introduced a `marshalJSONValue` utility for compact JSON encoding, improving performance and code simplicity. Removed unused `encoding/json` imports.	2025-12-22 11:44:06 +08:00
Evan Nguyen	24e8e20b59	Merge branch 'main' into fix/antigravity-prompt-caching	2025-12-21 19:43:24 +07:00
Luis Pater	d7afb6eb0c	fix(gemini): improve reasoning effort conversion for Gemini 3 models Refactors the reasoning effort conversion logic for Gemini models. The update specifically addresses how `reasoning_effort` is translated into Gemini 3 specific thinking configurations (`thinkingLevel`, `includeThoughts`) and ensures that numeric budgets are not incorrectly applied to level-based models. Changes include: - Differentiating conversion logic for Gemini 3 models versus other models. - Handling `none`, `auto`, and validated thinking levels for Gemini 3. - Maintaining existing conversion for models not using discrete thinking levels.	2025-12-20 03:11:28 +08:00
evann	bc6c4cdbfc	feat(antigravity): add logging for cached token setting errors in responses	2025-12-19 16:49:50 +07:00
evann	9058d406a3	feat(antigravity): enhance prompt caching support and update agent version	2025-12-19 16:33:41 +07:00
Luis Pater	ffdfad8482	Fixed: #551 fix(translator): standardize content node handling across translators for assistant and tool calls	2025-12-17 13:16:07 +08:00
hkfires	716aa71f6e	fix(thinking): centralize reasoning_effort mapping Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into shared helpers used by Gemini, Gemini CLI, and antigravity translators. Normalize Claude thinking handling by preferring positive budgets, applying budget token normalization, and gating by model support. Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to support allowCompat models, and update tests for normalization behavior.	2025-12-15 09:16:14 +08:00
hkfires	e8976f9898	fix(thinking): map budgets to effort for level models	2025-12-15 09:16:14 +08:00
Luis Pater	1249b07eb8	feat(responses): add unique identifiers for responses, function calls, and tool uses	2025-12-10 16:02:54 +08:00
hkfires	9b202b6c1c	fix(executor): centralize default thinking config	2025-12-09 21:05:06 +08:00
hkfires	5b6d201408	refactor(translator): remove thinking budget normalization across all translators	2025-12-09 21:05:06 +08:00
hkfires	a283545b6b	feat(antigravity): enforce thinking budget limits for Claude models	2025-12-08 20:36:17 +08:00
hkfires	a174d015f2	feat(openai): handle thinking.budget_tokens from Anthropic-style requests	2025-12-07 19:14:05 +08:00
Luis Pater	f383840cf9	fix(antigravity): update toolNode role from "tool" to "user" in chat completions	2025-12-07 02:37:46 +08:00
Luis Pater	7a628426dc	Fixed: #433 refactor(translator): normalize finish reason casing across all OpenAI response handlers	2025-12-07 01:48:24 +08:00
Luis Pater	412148af0e	feat(antigravity): add function ID to FunctionCall and FunctionResponse models	2025-12-05 23:05:35 +08:00
Luis Pater	72c7ef7647	fix(translator): handle non-JSON output parsing for OpenAI function responses - Updated `antigravity_openai_request.go` to process non-JSON outputs gracefully by verifying and distinguishing between JSON and plain string formats. - Ensured proper assignment of parsed or raw response to `functionResponse`.	2025-11-27 16:18:49 +08:00
Luis Pater	c8cee547fd	fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature - Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present. - Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing.	2025-11-27 00:52:17 +08:00
Luis Pater	9d50a68768	feat(translator): improve content processing and Antigravity request conversion - Refactored response translation logic to support mixed content types (`input_text`, `output_text`, `input_image`) with better role assignments and part handling. - Added image processing logic for embedding inline data with MIME type and base64 encoded content. - Updated Antigravity request conversion to replace Gemini CLI references for consistency.	2025-11-22 21:34:34 +08:00
hkfires	1061354b2f	fix: handle empty and non-JSON SSE chunks safely	2025-11-22 13:49:23 +08:00
hkfires	c29931e093	fix(translator): ignore empty JSON chunks in OpenAI responses	2025-11-22 13:09:16 +08:00
hkfires	b05cfd9f84	fix(translator): include empty text chunks in responses	2025-11-22 13:03:50 +08:00
hkfires	dc8d3201e1	feat(translator): support image size and googleSearch tools	2025-11-22 10:36:52 +08:00
Luis Pater	c1031e2d3f	feat(translator): add Antigravity translation logic - Introduced request and response translation functions to enable compatibility between OpenAI Chat Completions API and Antigravity. - Registered translation utilities for both streaming and non-streaming scenarios. - Added support for reasoning content, tool calls, and metadata handling. - Established request normalization and embedding for Antigravity-compatible payloads. - Added new fields to `Params` struct for better tracking of finish reasons, usage metadata, and tool usage. - Refactored handling of response transitions, final events, and state-driven logic in `ConvertAntigravityResponseToClaude`. - Introduced `appendFinalEvents` and `resolveStopReason` helper functions for cleaner separation of concerns. - Added `TotalTokenCount` field to `Params` struct for enhanced token tracking. - Updated token count calculations to fallback on `TotalTokenCount` when specific counts are missing. - Introduced `hasNonZeroUsageMetadata` function to validate presence of token data in `usage_metadata`.	2025-11-21 23:40:59 +08:00

48 Commits