CLIProxyAPI

fix(translator): improve error handling for function parameters schema transformation

- Added fallback to set default `parametersJsonSchema` when `parameters` key is absent.
- Enhanced logging to capture detailed errors during schema transformation.
- Refined tool declaration appending logic for robustness.

Luis Pater · 2025-10-28 22:57:26 +08:00

v6.2.38 0defb68c6c

Merge pull request #177 from router-for-me/aistudio

feat(registry): unify Gemini models and add AI Studio set

Luis Pater · 2025-10-28 21:57:18 +08:00

v6.2.37 d6272d3300

fix(aistudio): remove no-op executor unregister on WS disconnect

hkfires · 2025-10-28 19:51:05 +08:00

c99d0dfb33

fix(executor): pass authID to relay instead of provider

hkfires · 2025-10-28 19:28:26 +08:00

663b9b35ab

feat(registry): unify Gemini models and add AI Studio set

hkfires · 2025-10-28 19:00:25 +08:00

5dced4c0a6

docs(readme): clarify model definition and add usage example for undefined models

- Updated `README.md` and `README_CN.md` to include additional instructions on requesting undefined models using the `openrouter://` format.
- Added example for `moonshotai/kimi-k2:free` usage.

Luis Pater · 2025-10-28 09:09:19 +08:00

5891785125

Merge pull request #173 from tobwen/feature/dynamic-model-routing

Add support for dynamic model providers

Luis Pater · 2025-10-28 08:55:08 +08:00

ac3d47e8c0

Add support for dynamic model providers

Implements functionality to parse model names with provider information in the format "provider://model" This allows dynamic provider selection rather than relying only on predefined mappings.

The change affects all execution methods to properly handle these dynamic model specifications while maintaining compatibility with the existing approach for standard model names.

tobwen · 2025-10-28 01:41:54 +01:00

e5ed2cba4a

Fixed: #172

feat(runtime): add Brotli and Zstd compression support, improve response handling

- Implemented Brotli and Zstd decompression handling in `FileRequestLogger` and executor logic for enhanced compatibility.
- Added `decodeResponseBody` utility for streamlined multi-encoding support (Gzip, Deflate, Brotli, Zstd).
- Improved resource cleanup with composite readers for proper closure under all conditions.
- Updated dependencies in `go.mod` and `go.sum` to include Brotli and Zstd libraries.

Luis Pater · 2025-10-28 08:39:03 +08:00

v6.2.36 847c2502a5

feat(claude): add model alias mapping and improve key normalization

- Introduced model alias mapping for Claude configurations, enabling upstream and client-facing model name associations.
- Added `computeClaudeModelsHash` to generate a consistent hash for model aliases.
- Implemented `normalizeClaudeKey` function to standardize input API key configuration, including models.
- Enhanced executor to resolve model aliases to upstream names dynamically.
- Updated documentation and configuration examples to reflect new model alias support.

Luis Pater · 2025-10-28 00:14:19 +08:00

v6.2.35 c7196ba7dc

#167

refactor(translator): consolidate Claude content handling logic

- Unified logic for text and image content conversion to improve maintainability.
- Introduced `convertClaudeContentPart` utility for consistent content transformation.
- Replaced redundant string operations with streamlined JSON modifications.
- Adjusted validation checks for message content generation.

Luis Pater · 2025-10-27 22:43:59 +08:00

v6.2.34 6f9c23af5e

feat(registry): add Qwen3 Vision Model definition #164

Luis Pater · 2025-10-27 00:41:05 +08:00

v6.2.33 2d5d06c809

Merge pull request #163 from router-for-me/nb

fix(gemini): map responseModalities to uppercase IMAGE/TEXT

Luis Pater · 2025-10-26 22:41:18 +08:00

3e20b00357

fix(gemini-executor): uppercase responseModalities

hkfires · 2025-10-26 21:26:15 +08:00

e370f86f63

fix(aistudio): ensure colon-spaced JSON in responses

hkfires · 2025-10-26 20:21:45 +08:00

7f266aa19e

refactor(wsrelay): rename RoundTrip to NonStream

hkfires · 2025-10-26 20:01:46 +08:00

f3f31274e8

fix(gemini): map responseModalities to uppercase IMAGE/TEXT

hkfires · 2025-10-26 19:35:22 +08:00

7061cd6058

Merge pull request #161 from router-for-me/aistudio

Add websocket provider

Luis Pater · 2025-10-26 16:39:09 +08:00

5da5674ae2

fix(aistudio): remove generationConfig and tools when action is countTokens

hkfires · 2025-10-26 16:28:20 +08:00

7459c2c81a

fix(server): resolve incorrect variable usage in management asset paths

- Replaced `s.currentPath` with `s.configFilePath` for consistent handling of management asset paths.
- Adjusted calls to `managementasset.FilePath` and `StaticDir` to use the updated configuration path.

Luis Pater · 2025-10-26 12:44:57 +08:00

v6.2.32 cd4706f60e

feat(ws): add WebSocket auth

hkfires · 2025-10-26 07:46:04 +08:00

359b8de44e

fix(aistudio): strip usage metadata from non-final stream chunks

hkfires · 2025-10-26 07:46:04 +08:00

ea6065f1b1

feat(aistudio): support non-streaming responses

hkfires · 2025-10-26 07:46:04 +08:00

8aaed4cf09

feat(aistudio): track Gemini usage and improve stream errors

hkfires · 2025-10-26 07:46:04 +08:00

c32e013605

feat: add websocket routing and executor unregister API

- Introduce Server.AttachWebsocketRoute(path, handler) to mount websocket
  upgrade handlers on the Gin engine.
- Track registered WS paths via wsRoutes with wsRouteMu to prevent
  duplicate registrations; initialize in NewServer and import sync.
- Add Manager.UnregisterExecutor(provider) for clean executor lifecycle
  management.
- Add github.com/gorilla/websocket v1.5.3 dependency and update go.sum.

Motivation: enable services to expose WS endpoints through the core server
and allow removing auth executors dynamically while avoiding duplicate
route setup. No breaking changes.

hkfires · 2025-10-26 07:46:03 +08:00

3839d93ba0

Fixed: #140 #133 #80

feat(translator): add token counting functionality for Gemini, Claude, and CLI

- Introduced `TokenCount` handling across various Codex translators (Gemini, Claude, CLI) with respective implementations.
- Added utility methods for token counting and formatting responses.
- Integrated `tiktoken-go/tokenizer` library for tokenization.
- Updated CodexExecutor with token counting logic to support multiple models including GPT-5 variants.
- Refined go.mod and go.sum to include new dependencies.

feat(runtime): add token counting functionality across executors

- Implemented token counting in OpenAICompatExecutor, QwenExecutor, and IFlowExecutor.
- Added utilities for token counting and response formatting using `tiktoken-go/tokenizer`.
- Integrated token counting into translators for Gemini, Claude, and Gemini CLI.
- Enhanced multiple model support, including GPT-5 variants, for token counting.

docs: update environment variable instructions for multi-model support

- Added details for setting `ANTHROPIC_DEFAULT_OPUS_MODEL`, `ANTHROPIC_DEFAULT_SONNET_MODEL`, and `ANTHROPIC_DEFAULT_HAIKU_MODEL` for version 2.x.x.
- Clarified usage of `ANTHROPIC_MODEL` and `ANTHROPIC_SMALL_FAST_MODEL` for version 1.x.x.
- Expanded examples for setting environment variables across different models including Gemini, GPT-5, Claude, and Qwen3.

Luis Pater · 2025-10-26 05:39:15 +08:00

v6.2.31 a552a45b81

refactor(translator): remove unused log dependency and comment out debug logging

docs: add GPT-5 Codex guidelines for CLI usage

- Added detailed guidelines for GPT-5 Codex in Codex CLI.
- Expanded instructions on sandboxing, approvals, editing constraints, and style requirements.
- Included presentation and response formatting best practices.

fix(codex_instructions): update comparison logic to use prefix matching

- Changed system instructions comparison to use `strings.HasPrefix` for improved flexibility.

Luis Pater · 2025-10-24 12:15:15 +08:00

v6.2.30 f6cf784cd1

feat(executor): add debug logs for rate-limiting retries in Gemini CLI executor

Luis Pater · 2025-10-23 10:39:21 +08:00

e783923464

docs: add GPT-5 Codex guidelines for internal usage

- Added comprehensive instructions for Codex CLI harness, sandboxing, approvals, and editing constraints to `internal/misc/codex_instructions/`.
- Clarified `approval_policy` configurations and scenarios requiring escalated permissions.
- Provided detailed style and structure guidelines for presenting results in the Codex CLI.

Luis Pater · 2025-10-23 09:14:56 +08:00

v6.2.29 e6d7677373

feat: improve error handling with added status codes and headers

- Updated Execute methods to include enhanced error handling via `StatusCode` and `Headers` extraction.
- Introduced structured error responses for cooling down scenarios, providing additional metadata and retry suggestions.
- Refined quota management, allowing for differentiation between cool-down, disabled, and other block reasons.
- Improved model filtering logic based on client availability and suspension criteria.

Luis Pater · 2025-10-22 09:01:11 +08:00

d225558dae

feat: add DisableCooling configuration to manage quota cooldown behavior

Luis Pater · 2025-10-21 21:51:30 +08:00

v6.2.28 9678be7aa4

feat: enhance tool call handling in OpenAI response conversion

Luis Pater · 2025-10-21 20:04:24 +08:00

v6.2.27 243bf5c108

feat: enhance quota management with backoff levels and cooldown logic

Luis Pater · 2025-10-21 18:44:28 +08:00

v6.2.26 3569e5779a

Refactor executor error handling and usage reporting

- Updated the Execute methods in various executors (GeminiCLIExecutor, GeminiExecutor, IFlowExecutor, OpenAICompatExecutor, QwenExecutor) to return a response and error as named return values for improved clarity.
- Enhanced error handling by deferring failure tracking in usage reporters, ensuring that failures are reported correctly.
- Improved response body handling by ensuring proper closure and error logging for HTTP responses across all executors.
- Added failure tracking and reporting in the usage reporter to capture unsuccessful requests.
- Updated the usage logging structure to include a 'Failed' field for better tracking of request outcomes.
- Adjusted the logic in the RequestStatistics and Record methods to accommodate the new failure tracking mechanism.

Luis Pater · 2025-10-21 11:22:24 +08:00

20985d1a10

feat: implement management asset configuration and auto-updater

Luis Pater · 2025-10-21 09:01:58 +08:00

67f553806b

docs: add Subtitle Translator tool to README files

Luis Pater · 2025-10-21 02:48:08 +08:00

29044312a4

Merge pull request #151 from VjayC/add-subtitle-translator

docs: add Subtitle Translator to projects list

Luis Pater · 2025-10-21 02:44:50 +08:00

5b3fc092ee

docs: add Subtitle Translator to projects list

Vijay Chimmi · 2025-10-20 11:29:18 -07:00

792e8d09d7

Fixed: #148

feat(executor): add initial cache_helpers.go file

Luis Pater · 2025-10-20 10:17:29 +08:00

v6.2.25 eadccb229f

Merge pull request #147 from router-for-me/config

feat(mgmt): support YAML config retrieval and updates via /config.yaml

Luis Pater · 2025-10-19 22:26:38 +08:00

v6.2.24 fed6f3ecd7

feat(mgmt): support YAML config retrieval and updates via /config.yaml

hkfires · 2025-10-19 21:56:29 +08:00

f8dcd707a6

Merge pull request #145 from router-for-me/path

feat: prefer util.WritablePath() for logs and local storage

Luis Pater · 2025-10-19 20:50:44 +08:00

v6.2.23 0e91e95287

Merge pull request #146 from router-for-me/iflow

feat(iflow): add masked token logs; increase refresh lead to 24h

Luis Pater · 2025-10-19 20:49:40 +08:00

c5dcbc1c1a

feat(iflow): add masked token logs; increase refresh lead to 24h

hkfires · 2025-10-19 10:56:29 +08:00

4504ba5329

feat: prefer util.WritablePath() for logs and local storage

hkfires · 2025-10-19 10:19:55 +08:00

d16599fa1d

Merge pull request #139 from router-for-me/log

feat(logging): centralize sensitive header masking

Luis Pater · 2025-10-18 22:25:28 +08:00

v6.2.22 674393ec12

feat(logging): centralize sensitive header masking

hkfires · 2025-10-18 17:16:00 +08:00

9f45806106

refactor: streamline ConvertCodexResponseToGeminiNonStream by removing unnecessary buffer and improving response handling

Luis Pater · 2025-10-18 16:08:30 +08:00

v6.2.21 307ae76ed4

Fixed: #137

refactor: simplify ConvertCodexResponseToClaudeNonStream by removing bufio.Scanner usage and restructuring response parsing logic

Luis Pater · 2025-10-18 06:22:42 +08:00

v6.2.20 735b21394c

fix: initialize contentBlocks with an empty slice and improve content handling in ConvertOpenAIResponseToClaudeNonStream

Luis Pater · 2025-10-17 08:47:09 +08:00

v6.2.19 9cdef937af

545 Commits