CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
hkfires	cf9daf470c	feat(translator): report cached token usage in Claude output	2026-01-19 11:23:44 +08:00
Luis Pater	140d6211cc	feat(translator): add reasoning state tracking and improve reasoning summary handling - Introduced `oaiToResponsesStateReasoning` to track reasoning data. - Enhanced logic for emitting reasoning summary events and managing state transitions. - Updated output generation to handle multiple reasoning entries consistently.	2026-01-19 03:58:28 +08:00
hkfires	d5ef4a6d15	refactor(translator): remove registry model lookups from thinking config conversions	2026-01-18 10:30:14 +08:00
hkfires	6e4a602c60	fix(thinking): map reasoning_effort to thinkingConfig	2026-01-15 13:06:40 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
Luis Pater	3d01b3cfe8	Merge pull request #553 from XInTheDark/fix/builtin-tools-web-search fix(translator): preserve built-in tools (web_search) to Responses API	2026-01-09 04:40:13 +08:00
Luis Pater	7815ee338d	fix(translator): adjust `message_delta` emission boundary in Claude-to-OpenAI conversion Fixed incorrect boundary logic for `message_delta` emission, ensuring proper handling of usage updates and `emitMessageStopIfNeeded` within the response loop.	2026-01-04 01:36:51 +08:00
hkfires	2d2f4572a7	fix(translator): remove unnecessary whitespace trimming in reasoning text collection	2026-01-01 12:39:09 +08:00
hkfires	8f4c46f38d	fix(translator): emit tool_result messages before user content in Claude-to-OpenAI conversion	2026-01-01 11:11:43 +08:00
hkfires	b6ba51bc2a	feat(translator): add thinking block and tool result handling for Claude-to-OpenAI conversion	2026-01-01 09:41:25 +08:00
Luis Pater	a86d501dc2	refactor: replace `json.Marshal` and `json.Unmarshal` with `sjson` and `gjson` Optimized the handling of JSON serialization and deserialization by replacing redundant `json.Marshal` and `json.Unmarshal` calls with `sjson` and `gjson`. Introduced a `marshalJSONValue` utility for compact JSON encoding, improving performance and code simplicity. Removed unused `encoding/json` imports.	2025-12-22 11:44:06 +08:00
Luis Pater	653439698e	Fixed: #606 fix: unify response field naming across translators Standardize `text` to `delta` and add missing `output` field in all response payloads for consistency across OpenAI, Claude, and Gemini translators.	2025-12-21 03:13:58 +08:00
hkfires	28a428ae2f	fix(thinking): align budget effort mapping across translators Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.	2025-12-16 18:34:43 +08:00
Thong Van	f4007f53ba	fix(translator): emit message_start on first chunk regardless of role field Some OpenAI-compatible providers (like GitHub Copilot) may send tool_calls in the first streaming chunk without including the role field. The previous implementation only emitted message_start when the first chunk contained role="assistant", causing Anthropic protocol violations when tool calls arrived first. This fix ensures message_start is always emitted on the very first chunk, preventing 'content_block_start before message_start' errors in clients that strictly validate Anthropic SSE event ordering.	2025-12-16 13:01:09 +07:00
Muzhen Gaming	0b834fcb54	fix(translator): preserve built-in tools across openai<->responses - Pass through non-function tool definitions like web_search - Translate tool_choice for built-in tools and function tools - Add regression tests for built-in tool passthrough	2025-12-15 21:18:54 +08:00
Luis Pater	d9a65745df	fix(translator): handle empty item type and string content in OpenAI response parser	2025-12-15 20:35:52 +08:00
hkfires	09c339953d	fix(openai): forward reasoning.effort value Drop the hardcoded effort mapping in request conversion so unknown values are preserved instead of being coerced to `auto	2025-12-15 09:16:15 +08:00
hkfires	d20b71deb9	fix(thinking): normalize effort mapping Route OpenAI reasoning effort through ThinkingEffortToBudget for Claude translators, preserve "minimal" when translating OpenAI Responses, and treat blank/unknown efforts as no-ops for Gemini thinking configs. Also map budget -1 to "auto" and expand cross-protocol thinking tests.	2025-12-15 09:16:15 +08:00
hkfires	716aa71f6e	fix(thinking): centralize reasoning_effort mapping Move OpenAI `reasoning_effort` -> Gemini `thinkingConfig` budget logic into shared helpers used by Gemini, Gemini CLI, and antigravity translators. Normalize Claude thinking handling by preferring positive budgets, applying budget token normalization, and gating by model support. Always convert Gemini `thinkingBudget` back to OpenAI `reasoning_effort` to support allowCompat models, and update tests for normalization behavior.	2025-12-15 09:16:14 +08:00
hkfires	e8976f9898	fix(thinking): map budgets to effort for level models	2025-12-15 09:16:14 +08:00
Luis Pater	1249b07eb8	feat(responses): add unique identifiers for responses, function calls, and tool uses	2025-12-10 16:02:54 +08:00
huynguyen03.dev	549c0c2c5a	fix: filter whitespace-only text content in Claude to OpenAI translation Remove redundant existence check since TrimSpace handles empty strings	2025-12-07 16:08:12 +07:00
huynguyen03.dev	f092801b61	fix: filter whitespace-only text in Claude to OpenAI translation Skip text content blocks that are empty or contain only whitespace when translating Claude messages to OpenAI format. This fixes GLM-4.6 and other strict OpenAI-compatible providers that reject empty text with error 'text cannot be empty'.	2025-12-07 15:39:58 +07:00
Luis Pater	7e30157590	Fixed: #354 fix(translator): add support for "xhigh" reasoning effort in OpenAI responses - Updated handling in `openai_openai-responses_request.go` to include the new "xhigh" reasoning effort level.	2025-11-27 15:59:15 +08:00
Luis Pater	1c815c58a6	fix(translator): simplify string handling in Gemini responses	2025-11-16 19:02:27 +08:00
Luis Pater	4eab141410	feat(translator): add support for reasoning/thinking content blocks in OpenAI-Claude and Gemini responses	2025-11-16 17:37:39 +08:00
Luis Pater	9875565339	fix(claude translator): ensure default token counts when usage data is missing	2025-11-16 13:18:21 +08:00
Luis Pater	5d806fcefc	fix(translator): support system instructions with parts and inline data in OpenAI Gemini requests Handle both `systemInstruction` and `system_instruction` keys, processing text and inline data parts (e.g., images) for system messages in Gemini.	2025-11-10 10:31:32 +08:00
Luis Pater	1afbc4dd96	fix(translator): separate tool calls from content in OpenAI Claude requests	2025-11-08 17:57:46 +08:00
Luis Pater	682c4598ee	fix(translator): handle gjson strings in OpenAI response formatting	2025-11-08 00:41:56 +08:00
Luis Pater	fd2b23592e	Fixed: #193 fix(translator): consolidate temperature and top_p conditionals in OpenAI Claude request Fixed: #169 fix(translator): adjust instruction strings in Codex Claude and OpenAI responses	2025-11-01 15:37:51 +08:00
tobwen	e5ed2cba4a	Add support for dynamic model providers Implements functionality to parse model names with provider information in the format "provider://model" This allows dynamic provider selection rather than relying only on predefined mappings. The change affects all execution methods to properly handle these dynamic model specifications while maintaining compatibility with the existing approach for standard model names.	2025-10-28 01:41:54 +01:00
Luis Pater	6f9c23af5e	#167 refactor(translator): consolidate Claude content handling logic - Unified logic for text and image content conversion to improve maintainability. - Introduced `convertClaudeContentPart` utility for consistent content transformation. - Replaced redundant string operations with streamlined JSON modifications. - Adjusted validation checks for message content generation.	2025-10-27 22:43:59 +08:00
Luis Pater	a552a45b81	Fixed: #140 #133 #80 feat(translator): add token counting functionality for Gemini, Claude, and CLI - Introduced `TokenCount` handling across various Codex translators (Gemini, Claude, CLI) with respective implementations. - Added utility methods for token counting and formatting responses. - Integrated `tiktoken-go/tokenizer` library for tokenization. - Updated CodexExecutor with token counting logic to support multiple models including GPT-5 variants. - Refined go.mod and go.sum to include new dependencies. feat(runtime): add token counting functionality across executors - Implemented token counting in OpenAICompatExecutor, QwenExecutor, and IFlowExecutor. - Added utilities for token counting and response formatting using `tiktoken-go/tokenizer`. - Integrated token counting into translators for Gemini, Claude, and Gemini CLI. - Enhanced multiple model support, including GPT-5 variants, for token counting. docs: update environment variable instructions for multi-model support - Added details for setting `ANTHROPIC_DEFAULT_OPUS_MODEL`, `ANTHROPIC_DEFAULT_SONNET_MODEL`, and `ANTHROPIC_DEFAULT_HAIKU_MODEL` for version 2.x.x. - Clarified usage of `ANTHROPIC_MODEL` and `ANTHROPIC_SMALL_FAST_MODEL` for version 1.x.x. - Expanded examples for setting environment variables across different models including Gemini, GPT-5, Claude, and Qwen3.	2025-10-26 05:39:15 +08:00
Luis Pater	243bf5c108	feat: enhance tool call handling in OpenAI response conversion	2025-10-21 20:04:24 +08:00
Luis Pater	9cdef937af	fix: initialize contentBlocks with an empty slice and improve content handling in ConvertOpenAIResponseToClaudeNonStream	2025-10-17 08:47:09 +08:00
Luis Pater	eb2549a782	fix(gemini): update response template to omit finishReason until known	2025-10-16 06:41:04 +08:00
Luis Pater	c419264a70	fix(responses): handle empty and invalid rawJSON in ConvertOpenAIChatCompletionsResponseToOpenAIResponses	2025-10-16 06:34:00 +08:00
Luis Pater	5ab0854b5b	fix(claude): track message_start event in streaming response Add a `MessageStarted` flag to `ConvertOpenAIResponseToAnthropicParams` to ensure the `message_start` event is emitted only once during streaming. Refactor response handling to detect streaming mode via the `stream` field instead of the `object` type, simplifying the branching logic. Update the streaming conversion to set `MessageStarted` after sending the `message_start` event, preventing duplicate starts. These changes improve correctness of streaming response handling for Claude integration.	2025-10-16 03:54:48 +08:00
Luis Pater	599986495b	feat(translator): enhance OpenAI Gemini request handling for mixed content - Replaced `contentParts` with `aggregatedParts` to support mixed content (text and inline data). - Introduced `textBuilder` for efficient text concatenation. - Added support for inline data processing, including base64-encoded image URLs. - Updated `msg["content"]` logic to handle both plain text and mixed content scenarios.	2025-10-13 02:15:55 +08:00
Luis Pater	40255b128e	feat(translator): add usage metadata aggregation for Claude and OpenAI responses - Integrated input, output, reasoning, and total token tracking in response processing for Claude and OpenAI. - Ensured support for usage details even when specific fields are missing in the response. - Enhanced completion outputs with aggregated usage details for accurate reporting.	2025-09-27 01:12:47 +08:00
Luis Pater	f5dc380b63	rebuild branch	2025-09-25 10:32:48 +08:00
Luis Pater	3f69254f43	remove all	2025-09-25 10:31:02 +08:00
Luis Pater	d41ff2076f	feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses - Enhanced support for extracting system instructions from input arrays. - Improved input message role and type determination logic for consistent message processing. - Refined instruction handling logic across translator types for better compatibility.	2025-09-23 23:12:34 +08:00
Luis Pater	9df04d71e2	feat(translators/claude): implement non-streaming response parsing for various translator types - Added `ConvertCodexResponseToClaudeNonStream`, `ConvertGeminiCLIResponseToClaudeNonStream`, `ConvertGeminiResponseToClaudeNonStream`, and `ConvertOpenAIResponseToClaudeNonStream` methods for handling non-streaming JSON response conversion. - Introduced logic for parsing and structuring content, handling reasoning, text, and tool usage blocks. - Enhanced support for stop reasons and refined token usage data aggregation.	2025-09-23 20:42:48 +08:00
hkfires	d6bb143978	refactor(translator): Remove unused logrus imports	2025-09-22 08:01:37 +08:00
Luis Pater	f81898c906	feat: introduce custom provider example and remove redundant debug logs - Added `examples/custom-provider/main.go` showcasing custom executor and translator integration using the SDK. - Removed redundant debug logs from translator modules to enhance code cleanliness. - Updated SDK documentation with new usage and advanced examples. - Expanded the management API with new endpoints, including request logging and GPT-5 Codex features.	2025-09-22 03:37:53 +08:00
Luis Pater	d9ad65622a	refactor: standardize constant naming and improve file-based auth handling - Renamed constants from uppercase to CamelCase for consistency. - Replaced redundant file-based auth handling logic with the new `util.CountAuthFiles` helper. - Fixed various error-handling inconsistencies and enhanced robustness in file operations. - Streamlined auth client reload logic in server and watcher components. - Applied minor code readability improvements across multiple packages.	2025-09-22 02:56:45 +08:00
Luis Pater	4999fce7f4	v6 version first commit	2025-09-22 01:40:24 +08:00
Luis Pater	9fce13fe03	Update internal module imports to use `v5` package path - Updated all `github.com/luispater/CLIProxyAPI/internal/...` imports to point to `github.com/luispater/CLIProxyAPI/v5/internal/...`. - Adjusted `go.mod` to specify `module github.com/luispater/CLIProxyAPI/v5`.	2025-09-13 23:34:32 +08:00

1 2

57 Commits