CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-02 20:40:52 +08:00

Author	SHA1	Message	Date
Luis Pater	73db4e64f6	Merge pull request #874 from MohammadErfan-Jabbari/fix/streaming-finish-reason-tool-calls fix(antigravity): preserve finish_reason tool_calls across streaming chunks	2026-02-01 07:05:39 +08:00
MohammadErfan Jabbari	fe6043aec7	fix(antigravity): preserve finish_reason tool_calls across streaming chunks When streaming responses with tool calls, the finish_reason was being overwritten. The upstream sends functionCall in chunk 1, then finishReason: STOP in chunk 2. The old code would set finish_reason from every chunk, causing "tool_calls" to be overwritten by "stop". This broke clients like Claude Code that rely on finish_reason to detect when tool calls are complete. Changes: - Add SawToolCall bool to track tool calls across entire stream - Add UpstreamFinishReason to cache the finish reason - Only emit finish_reason on final chunk (has both finishReason + usage) - Priority: tool_calls > max_tokens > stop Includes 5 unit tests covering: - Tool calls not overwritten by subsequent STOP - Normal text gets "stop" - MAX_TOKENS without tool calls gets "max_tokens" - Tool calls take priority over MAX_TOKENS - Intermediate chunks have no finish_reason Fixes streaming tool call detection for Claude Code + Gemini models. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-05 18:45:25 +01:00
Luis Pater	3f949b7f84	Merge pull request #704 from tinyc0der/add-index fix(openai): add index field to image response for LiteLLM compatibility	2025-12-25 21:35:12 +08:00
TinyCoder	a7fc2ee4cf	refactor(image): avoid using json.Marshal	2025-12-25 14:21:01 +07:00
Luis Pater	06ad527e8c	Fixed: #696 fix(translators): adjust prompt token calculation by subtracting cached tokens across Gemini, OpenAI, and Claude handlers	2025-12-24 23:29:18 +08:00
TinyCoder	671558a822	fix(openai): add index field to image response for LiteLLM compatibility LiteLLM's Pydantic model requires an index field in each image object. Without it, responses fail validation with "images.0.index Field required".	2025-12-24 17:43:31 +07:00
Luis Pater	7569320770	Merge branch 'dev' into fix/antigravity-prompt-caching	2025-12-24 03:49:46 +08:00
Luis Pater	a86d501dc2	refactor: replace `json.Marshal` and `json.Unmarshal` with `sjson` and `gjson` Optimized the handling of JSON serialization and deserialization by replacing redundant `json.Marshal` and `json.Unmarshal` calls with `sjson` and `gjson`. Introduced a `marshalJSONValue` utility for compact JSON encoding, improving performance and code simplicity. Removed unused `encoding/json` imports.	2025-12-22 11:44:06 +08:00
evann	bc6c4cdbfc	feat(antigravity): add logging for cached token setting errors in responses	2025-12-19 16:49:50 +07:00
evann	9058d406a3	feat(antigravity): enhance prompt caching support and update agent version	2025-12-19 16:33:41 +07:00
Luis Pater	1249b07eb8	feat(responses): add unique identifiers for responses, function calls, and tool uses	2025-12-10 16:02:54 +08:00
Luis Pater	7a628426dc	Fixed: #433 refactor(translator): normalize finish reason casing across all OpenAI response handlers	2025-12-07 01:48:24 +08:00
Luis Pater	c8cee547fd	fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature - Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present. - Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing.	2025-11-27 00:52:17 +08:00
hkfires	1061354b2f	fix: handle empty and non-JSON SSE chunks safely	2025-11-22 13:49:23 +08:00
hkfires	c29931e093	fix(translator): ignore empty JSON chunks in OpenAI responses	2025-11-22 13:09:16 +08:00
hkfires	b05cfd9f84	fix(translator): include empty text chunks in responses	2025-11-22 13:03:50 +08:00
Luis Pater	c1031e2d3f	feat(translator): add Antigravity translation logic - Introduced request and response translation functions to enable compatibility between OpenAI Chat Completions API and Antigravity. - Registered translation utilities for both streaming and non-streaming scenarios. - Added support for reasoning content, tool calls, and metadata handling. - Established request normalization and embedding for Antigravity-compatible payloads. - Added new fields to `Params` struct for better tracking of finish reasons, usage metadata, and tool usage. - Refactored handling of response transitions, final events, and state-driven logic in `ConvertAntigravityResponseToClaude`. - Introduced `appendFinalEvents` and `resolveStopReason` helper functions for cleaner separation of concerns. - Added `TotalTokenCount` field to `Params` struct for enhanced token tracking. - Updated token count calculations to fallback on `TotalTokenCount` when specific counts are missing. - Introduced `hasNonZeroUsageMetadata` function to validate presence of token data in `usage_metadata`.	2025-11-21 23:40:59 +08:00

17 Commits