CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-28 21:05:49 +08:00

Author	SHA1	Message	Date
Kirill Turanskiy	1cc21cc45b	fix: prevent duplicate function call arguments when delta events precede done Non-spark codex models (gpt-5.3-codex, gpt-5.2-codex) stream function call arguments via multiple delta events followed by a done event. The done handler unconditionally emitted the full arguments, duplicating what deltas already streamed. This produced invalid double JSON that Claude Code couldn't parse, causing tool calls to fail with missing parameters and infinite retry loops. Add HasReceivedArgumentsDelta flag to track whether delta events were received. The done handler now only emits arguments when no deltas preceded it (spark models), while delta-based streaming continues to work for non-spark models.	2026-02-19 23:18:14 +03:00
Kirill Turanskiy	07cf616e2b	fix: handle response.function_call_arguments.done in codex→claude streaming translator Some Codex models (e.g. gpt-5.3-codex-spark) send function call arguments in a single "done" event without preceding "delta" events. The streaming translator only handled "delta" events, causing tool call arguments to be lost — resulting in empty tool inputs and infinite retry loops in clients like Claude Code. Emit the full arguments from the "done" event as a single input_json_delta so downstream clients receive the complete tool input.	2026-02-19 23:18:14 +03:00
hkfires	938a799263	feat(translator): support Claude thinking type adaptive	2026-02-10 16:20:32 +08:00
Luis Pater	80b5e79e75	fix(translator): normalize and restrict `stop_reason`/`finish_reason` usage - Standardized the handling of `stop_reason` and `finish_reason` across Codex and Gemini responses. - Restricted pass-through of specific reasons (`max_tokens`, `stop`) for consistency. - Enhanced fallback logic for undefined reasons.	2026-02-07 02:07:51 +08:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
hkfires	fe3ebe3532	docs(translator): update Codex Claude request transform docs	2026-02-01 14:55:41 +08:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00
Luis Pater	f99cddf97f	fix(translator): handle stop_reason and MAX_TOKENS for Claude responses	2026-01-31 04:03:01 +08:00
hkfires	cf9daf470c	feat(translator): report cached token usage in Claude output	2026-01-19 11:23:44 +08:00
hkfires	d5ef4a6d15	refactor(translator): remove registry model lookups from thinking config conversions	2026-01-18 10:30:14 +08:00
Luis Pater	65b4e1ec6c	feat(codex): enable instruction toggling and update role terminology - Added conditional logic for Codex instruction injection based on configuration. - Updated role terminology from "user" to "developer" for better alignment with context.	2026-01-17 04:12:29 +08:00
hkfires	ed8b0f25ee	fix(thinking): use LookupModelInfo for model data	2026-01-15 13:06:41 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
hkfires	ac626111ac	feat(codex): add OpenCode instructions based on user agent	2026-01-11 13:36:35 +08:00
Luis Pater	d47b7dc79a	refactor(response): enhance parameter handling for Codex to Claude conversion	2026-01-09 05:20:19 +08:00
Luis Pater	a86d501dc2	refactor: replace `json.Marshal` and `json.Unmarshal` with `sjson` and `gjson` Optimized the handling of JSON serialization and deserialization by replacing redundant `json.Marshal` and `json.Unmarshal` calls with `sjson` and `gjson`. Introduced a `marshalJSONValue` utility for compact JSON encoding, improving performance and code simplicity. Removed unused `encoding/json` imports.	2025-12-22 11:44:06 +08:00
hkfires	28a428ae2f	fix(thinking): align budget effort mapping across translators Unify thinking budget-to-effort conversion in a shared helper, handle disabled/default thinking cases in translators, adjust zero-budget mapping, and drop the old OpenAI-specific helper with updated tests.	2025-12-16 18:34:43 +08:00
hkfires	e8976f9898	fix(thinking): map budgets to effort for level models	2025-12-15 09:16:14 +08:00
hkfires	d131435e25	fix(codex): raise default reasoning effort to medium	2025-12-12 18:18:48 +08:00
hkfires	1ba057112a	fix: use underscore suffix in short name mapping Replace the "~<n>" suffix with "_<n>" when generating unique short names in codex translators (Claude, Gemini, OpenAI chat). This avoids using a special character in identifiers, improving compatibility with downstream APIs while preserving length constraints.	2025-11-18 16:59:25 +08:00
lollipopkit🏳️‍⚧️	c73b3fa43b	fix(claude translator): guard tool schema properties	2025-11-15 19:14:13 +08:00
Luis Pater	fd2b23592e	Fixed: #193 fix(translator): consolidate temperature and top_p conditionals in OpenAI Claude request Fixed: #169 fix(translator): adjust instruction strings in Codex Claude and OpenAI responses	2025-11-01 15:37:51 +08:00
Luis Pater	a552a45b81	Fixed: #140 #133 #80 feat(translator): add token counting functionality for Gemini, Claude, and CLI - Introduced `TokenCount` handling across various Codex translators (Gemini, Claude, CLI) with respective implementations. - Added utility methods for token counting and formatting responses. - Integrated `tiktoken-go/tokenizer` library for tokenization. - Updated CodexExecutor with token counting logic to support multiple models including GPT-5 variants. - Refined go.mod and go.sum to include new dependencies. feat(runtime): add token counting functionality across executors - Implemented token counting in OpenAICompatExecutor, QwenExecutor, and IFlowExecutor. - Added utilities for token counting and response formatting using `tiktoken-go/tokenizer`. - Integrated token counting into translators for Gemini, Claude, and Gemini CLI. - Enhanced multiple model support, including GPT-5 variants, for token counting. docs: update environment variable instructions for multi-model support - Added details for setting `ANTHROPIC_DEFAULT_OPUS_MODEL`, `ANTHROPIC_DEFAULT_SONNET_MODEL`, and `ANTHROPIC_DEFAULT_HAIKU_MODEL` for version 2.x.x. - Clarified usage of `ANTHROPIC_MODEL` and `ANTHROPIC_SMALL_FAST_MODEL` for version 1.x.x. - Expanded examples for setting environment variables across different models including Gemini, GPT-5, Claude, and Qwen3.	2025-10-26 05:39:15 +08:00
Luis Pater	e6d7677373	docs: add GPT-5 Codex guidelines for internal usage - Added comprehensive instructions for Codex CLI harness, sandboxing, approvals, and editing constraints to `internal/misc/codex_instructions/`. - Clarified `approval_policy` configurations and scenarios requiring escalated permissions. - Provided detailed style and structure guidelines for presenting results in the Codex CLI.	2025-10-23 09:14:56 +08:00
Luis Pater	735b21394c	Fixed: #137 refactor: simplify ConvertCodexResponseToClaudeNonStream by removing bufio.Scanner usage and restructuring response parsing logic	2025-10-18 06:22:42 +08:00
Luis Pater	4576f9915b	Fixed: #121 feat(translator): map Claude web search tool type to Codex web_search - Added special handling to replace `web_search_20250305` tool type with `{"type":"web_search"}` in Claude request processing.	2025-10-15 09:32:12 +08:00
Luis Pater	c945e35983	feat(translator): improve Claude request handling with enhanced content processing - Introduced helper functions (`appendTextContent`, `appendImageContent`, etc.) for structured content construction. - Refactored message generation logic for better clarity, supporting mixed content scenarios (text, images, and function calls). - Added `flushMessage` to ensure proper grouping of message contents.	2025-10-14 23:58:37 +08:00
Luis Pater	bbdd68a8b4	feat(registry/runtime): add Gemini 2.5 model and increase buffer sizes - Added new "Gemini 2.5 Flash Image Preview" model definition, with enhanced image generation capabilities. - Increased scanner buffer size to 20,971,520 bytes across executors and translators to handle larger payloads.	2025-10-06 04:44:45 +08:00
Luis Pater	f5dc380b63	rebuild branch	2025-09-25 10:32:48 +08:00
Luis Pater	3f69254f43	remove all	2025-09-25 10:31:02 +08:00
Luis Pater	9df04d71e2	feat(translators/claude): implement non-streaming response parsing for various translator types - Added `ConvertCodexResponseToClaudeNonStream`, `ConvertGeminiCLIResponseToClaudeNonStream`, `ConvertGeminiResponseToClaudeNonStream`, and `ConvertOpenAIResponseToClaudeNonStream` methods for handling non-streaming JSON response conversion. - Introduced logic for parsing and structuring content, handling reasoning, text, and tool usage blocks. - Enhanced support for stop reasons and refined token usage data aggregation.	2025-09-23 20:42:48 +08:00
Luis Pater	f81898c906	feat: introduce custom provider example and remove redundant debug logs - Added `examples/custom-provider/main.go` showcasing custom executor and translator integration using the SDK. - Removed redundant debug logs from translator modules to enhance code cleanliness. - Updated SDK documentation with new usage and advanced examples. - Expanded the management API with new endpoints, including request logging and GPT-5 Codex features.	2025-09-22 03:37:53 +08:00
Luis Pater	d9ad65622a	refactor: standardize constant naming and improve file-based auth handling - Renamed constants from uppercase to CamelCase for consistency. - Replaced redundant file-based auth handling logic with the new `util.CountAuthFiles` helper. - Fixed various error-handling inconsistencies and enhanced robustness in file operations. - Streamlined auth client reload logic in server and watcher components. - Applied minor code readability improvements across multiple packages.	2025-09-22 02:56:45 +08:00
Luis Pater	4999fce7f4	v6 version first commit	2025-09-22 01:40:24 +08:00
Luis Pater	869a3dfbb4	feat: implement model-specific Codex instructions for GPT-5 - Added `CodexInstructions(modelName string)` function to dynamically select instructions based on the model (e.g., GPT-5 Codex). - Introduced `gpt_5_instructions.txt` and `gpt_5_codex_instructions.txt` for respective model configurations. - Updated translators to pass `modelName` and use the new instruction logic.	2025-09-19 08:47:54 +08:00
Luis Pater	9fce13fe03	Update internal module imports to use `v5` package path - Updated all `github.com/luispater/CLIProxyAPI/internal/...` imports to point to `github.com/luispater/CLIProxyAPI/v5/internal/...`. - Adjusted `go.mod` to specify `module github.com/luispater/CLIProxyAPI/v5`.	2025-09-13 23:34:32 +08:00
Luis Pater	ad943b2d4d	Add reverse mappings for original tool names and improve error logging - Introduced reverse mapping logic for tool names in translators to restore original names when shortened. - Enhanced error handling by logging API response errors consistently across handlers. - Refactored request and response loggers to include API error details, improving debugging capabilities. - Integrated robust tool name shortening and uniqueness mechanisms for OpenAI, Gemini, and Claude requests. - Improved handler retry logic to properly capture and respond to errors.	2025-09-04 02:39:56 +08:00
Luis Pater	3acdec51bd	Add OpenAI Responses support	2025-09-03 00:15:35 +08:00
Luis Pater	8c555c4e69	Refactor codebase	2025-08-25 16:58:16 +08:00
Luis Pater	de7b8501cc	Add openai codex support	2025-08-16 01:22:33 +08:00

40 Commits