CLIProxyAPI

Enhance logging for API requests and responses across executors

- Added detailed logging of upstream request metadata including URL, method, headers, and body for Codex, Gemini, IFlow, OpenAI Compat, and Qwen executors.
- Implemented error logging for API response failures to capture errors during HTTP requests.
- Introduced structured logging for authentication details (AuthID, AuthLabel, AuthType, AuthValue) to improve traceability.
- Updated response logging to include status codes and headers for better debugging.
- Ensured that all executors consistently log API interactions to facilitate monitoring and troubleshooting.

Luis Pater · 2025-10-17 04:12:38 +08:00

3dd0844b98

Feature: #103

feat(gemini): add Gemini thinking configuration support and metadata normalization

- Introduced logic to parse and apply `thinkingBudget` and `include_thoughts` configurations from metadata.
- Enhanced request handling to include normalized Gemini model metadata, preserving the original model identifier.
- Updated Gemini and Gemini-CLI executors to apply thinking configuration based on metadata overrides.
- Refactored handlers to support metadata extraction and cloning during request preparation.

Luis Pater · 2025-10-16 11:31:18 +08:00

ade279d1f2

fix: add Claude→Claude passthrough to prevent SSE event fragmentation

When from==to (Claude→Claude scenario), directly forward SSE stream
line-by-line without invoking TranslateStream. This preserves the
multi-line SSE event structure (event:/data:/blank) and prevents
JSON parsing errors caused by event fragmentation.

Resolves: JSON parsing error when using Claude Code streaming responses

fix: correct SSE event formatting in Handler layer

Remove duplicate newline additions (\n\n) that were breaking SSE event format.
The Executor layer already provides properly formatted SSE chunks with correct
line endings, so the Handler should forward them as-is without modification.

Changes:
- Remove redundant \n\n addition after each chunk
- Add len(chunk) > 0 check before writing
- Format error messages as proper SSE events (event: error\ndata: {...}\n\n)
- Add chunkIdx counter for future debugging needs

This fixes JSON parsing errors caused by malformed SSE event streams.

fix: update comments for clarity in SSE event forwarding

Adamcf · 2025-10-15 22:13:44 +08:00

15981aa412

feat(usage): add support for tracking request source in usage records

- Introduced `Source` field to usage-related structs for better origin tracking.
- Updated `newUsageReporter` to resolve and populate the `Source` attribute.
- Implemented `resolveUsageSource` to determine source from auth metadata or API key.

Luis Pater · 2025-10-14 02:11:43 +08:00

32a8102d71

refactor(provider): remove Gemini Web cookie-based provider

hkfires · 2025-10-11 12:53:03 +08:00

b895018ff5

feat(registry, executor, util): add support for gemini-2.5-flash-image-preview and improve aspect ratio handling

- Introduced `gemini-2.5-flash-image-preview` model to the registry with updated definitions.
- Enhanced Gemini CLI and API executors to handle image aspect ratio adjustments for the new model.
- Added utility function to create base64 white image placeholders based on aspect ratio configurations.

Luis Pater · 2025-10-10 01:49:58 +08:00

20787cd107

feat(registry, executor): add support for glm-4.6 model and enhance Gemini CLI token handling

- Added `glm-4.6` model to registry and documentation.
- Updated Gemini CLI executor to pass configuration to `prepareGeminiCLITokenSource` for improved token management.

Luis Pater · 2025-10-09 20:57:18 +08:00

b2cdbbdd47

feat(registry, executor): add support for gemini-2.5-flash-image model

- Introduced `gemini-2.5-flash-image` model with updated definitions in registry.
- Enhanced model marker detection in Gemini CLI executor to include support for the new model.

Luis Pater · 2025-10-09 10:06:10 +08:00

d45ebff66b

feat(gemini-web): Enable config hot-reload and fix Gem selection

hkfires · 2025-10-07 20:23:33 +08:00

9bb7df7af7

fix(gemini): Disable thinking config for incompatible models

hkfires · 2025-10-06 16:32:03 +08:00

c62ecc2442

feat(registry/runtime): add Gemini 2.5 model and increase buffer sizes

- Added new "Gemini 2.5 Flash Image Preview" model definition, with enhanced image generation capabilities.
- Increased scanner buffer size to 20,971,520 bytes across executors and translators to handle larger payloads.

Luis Pater · 2025-10-06 04:44:45 +08:00

bbdd68a8b4

feat(iflow): Add User-Agent header to API requests

hkfires · 2025-10-05 18:50:35 +08:00

c8029b7166

feat: Add support for iFlow provider

hkfires · 2025-10-05 15:51:09 +08:00

b839e351c4

feat(auth): improve OpenAI compatibility normalization and API key handling

- Refined trimming and normalization logic for `baseURL` and `apiKey` attributes.
- Updated `Authorization` header logic to omit empty API keys.
- Enhanced compatibility processing by handling empty `api-key-entries`.
- Improved legacy format fallback and added safeguards for empty credentials across executor paths.

Luis Pater · 2025-10-03 02:38:30 +08:00

2eef6875e9

feat(runtime): remove previous_response_id from Codex executor request body

- Implemented logic to delete `previous_response_id` property from the request body in Codex executor.
- Applied changes consistently across relevant Codex executor paths.

Luis Pater · 2025-10-02 12:00:06 +08:00

12c09f1a46

feat(gemini-web): Add support for custom auth labels

hkfires · 2025-09-30 12:21:51 +08:00

8858e07d8b

feat(gemini-web): Add conversation affinity selector

hkfires · 2025-09-30 12:21:51 +08:00

82187bffba

refactor(proxy): improve SOCKS5 proxy authentication handling

- Added nil check for proxy user credentials to prevent potential nil pointer dereference.
- Enhanced authentication logic for SOCKS5 proxies in `proxy_helpers.go` and `proxy.go`.

Luis Pater · 2025-09-30 11:23:39 +08:00

832268cae7

feat(runtime): introduce newProxyAwareHTTPClient for enhanced proxy handling

- Added `newProxyAwareHTTPClient` to centralize proxy configuration with priority on `auth.ProxyURL` and `cfg.ProxyURL`.
- Integrated enhanced proxy support across executors for HTTP, HTTPS, and SOCKS5 protocols.
- Refactored redundant HTTP client initialization to use `newProxyAwareHTTPClient` for consistent behavior.

Luis Pater · 2025-09-30 09:04:15 +08:00

de796ac1c2

refactor(runtime): move Anthropic-Beta header setting to applyClaudeHeaders for better header management

Luis Pater · 2025-09-29 20:51:36 +08:00

352a67857b

feat(provider/gemini-web): Prioritize explicit label for account identification

hkfires · 2025-09-27 10:56:15 +08:00

562a49a194

refactor(config): migrate to SDKConfig and streamline proxy handling

- Replaced `config.Config` with `config.SDKConfig` across components for simpler configuration management.
- Updated proxy setup functions and handlers to align with `SDKConfig` improvements.
- Reorganized handler imports to match new SDK structure.

Luis Pater · 2025-09-27 04:50:23 +08:00

57c9ba49f4

refactor(executor): remove redundant handling of "reasoning.effort" in gpt-5 and gpt-5-codex models

Luis Pater · 2025-09-26 18:13:28 +08:00

514add4b85

feat(gemini-web): Introduce stable account label for identification

hkfires · 2025-09-25 10:59:20 +08:00

2175a10932

rebuild branch

Luis Pater · 2025-09-25 10:32:48 +08:00

f5dc380b63

remove all

Luis Pater · 2025-09-25 10:31:02 +08:00

3f69254f43

chore(docs): add and refine package-level comments across modules

- Added detailed package-level comments to improve documentation coverage.
- Clarified parameter descriptions, return types, and functionality of exported methods across packages.
- Enhanced overall code readability and API documentation consistency.

Luis Pater · 2025-09-25 00:14:17 +08:00

0db0b03db9

fix(gemini): handle "[DONE]" chunk, trim "data:" prefix, and remove session_id from requests

- Adjusted stream handling to skip "[DONE]" chunks.
- Ensured "data:" prefix is trimmed for non-prefixed input in translation.
- Removed `session_id` from request bodies before processing.

Luis Pater · 2025-09-24 23:34:46 +08:00

48bbd9e214

Removed the cookie snapshot feature.

hkfires · 2025-09-24 22:12:29 +08:00

d4f5ec2492

refactor(gemini-web): Move provider logic to its own package

The Gemini Web API client logic has been relocated from `internal/client/gemini-web` to a new, more specific `internal/provider/gemini-web` package. This refactoring improves code organization and modularity by better isolating provider-specific implementations.

As a result of this move, the `GeminiWebState` struct and its methods have been exported (capitalized) to make them accessible from the executor. All call sites have been updated to use the new package path and the exported identifiers.

hkfires · 2025-09-24 22:12:29 +08:00

e9707c2f9e

refactor(executor): remove ClientAdapter and legacy fallback logic

- Deleted `ClientAdapter` implementation and associated fallback methods.
- Removed legacy executor logic from `codex`, `claude`, `gemini`, and `qwen` executors.
- Simplified `handlers` by eliminating `UnwrapError` handling and related dependencies.
- Cleaned up `model_registry` by removing logic associated with suspended clients.
- Updated `.gitignore` to ignore `.serena/` directory.

Luis Pater · 2025-09-24 21:09:36 +08:00

a2c5fdaf66

fix(codex): Remove reasoning.effort for default gpt-5-codex model

hkfires · 2025-09-24 13:17:19 +08:00

b86ed46845

feat(translators): add token counting support for Claude and Gemini responses

- Implemented `TokenCount` transform method across translators to calculate token usage.
- Integrated token counting logic into executor pipelines for Claude, Gemini, and CLI translators.
- Added corresponding API endpoints and handlers (`/messages/count_tokens`) for token usage retrieval.
- Enhanced translation registry to support `TokenCount` functionality alongside existing response types.

Luis Pater · 2025-09-24 11:59:38 +08:00

3dd5095792

feat(usage): implement usage tracking infrastructure across executors

- Added `LoggerPlugin` to log usage metrics for observability.
- Introduced a new `Manager` to handle usage record queuing and plugin registration.
- Integrated new usage reporter and detailed metrics parsing into executors, covering providers like OpenAI, Codex, Claude, and Gemini.
- Improved token usage breakdown across streaming and non-streaming responses.

Luis Pater · 2025-09-24 03:49:09 +08:00

3ade03f3b3

feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses

- Enhanced support for extracting system instructions from input arrays.
- Improved input message role and type determination logic for consistent message processing.
- Refined instruction handling logic across translator types for better compatibility.

Luis Pater · 2025-09-24 00:20:49 +08:00

5090d9853b

feat(translators): improve system instruction extraction and input handling for OpenAI and Claude responses

- Enhanced support for extracting system instructions from input arrays.
- Improved input message role and type determination logic for consistent message processing.
- Refined instruction handling logic across translator types for better compatibility.

Luis Pater · 2025-09-23 23:12:34 +08:00

d41ff2076f

fix(gemini): trim "data:" prefix in raw JSON and resolve variable shadowing in stream translation

Luis Pater · 2025-09-23 21:22:41 +08:00

b018072914

fix(gemini-web): Correct stream translation and reduce auth refresh lead

hkfires · 2025-09-23 20:51:55 +08:00

73cf491478

Merge pull request #58 from router-for-me/v6-test

refactor(gemini-web): Remove auto-refresh, auto-close, and caching

Luis Pater · 2025-09-23 18:20:47 +08:00

c159180589

refactor(gemini-web): Remove auto-refresh, auto-close, and caching

This commit simplifies the Gemini web client by removing several complex, stateful features. The previous implementation for auto-refreshing cookies and auto-closing the client involved background goroutines, timers, and file system caching, which made the client's lifecycle difficult to manage.

The following features have been removed:
- The cookie auto-refresh mechanism, including the background goroutine (`rotateCookies`) and related configuration fields.
- The file-based caching for the `__Secure-1PSIDTS` token. The `rotate1PSIDTS` function now fetches a new token on every call.
- The auto-close functionality, which used timers to close the client after a period of inactivity.
- Associated configuration options and methods (`WithAccountLabel`, `WithOnCookiesRefreshed`, `Close`, etc.).

By removing this logic, the client becomes more stateless and predictable. The responsibility for managing the client's lifecycle and handling token expiration is now shifted to the caller, leading to a simpler and more robust integration.

hkfires · 2025-09-23 12:48:30 +08:00

8e485e5868

feat(claude-executor): add ZSTD decoding support for Claude executor responses

- Integrated ZSTD decompression via `github.com/klauspost/compress` for responses with "zstd" content-encoding.
- Added helper `hasZSTDEcoding` to detect ZSTD-encoded responses.
- Updated response handling logic to initialize and use a ZSTD decoder when necessary.

refactor(api-handlers): split streaming and non-streaming response handling

- Introduced `handleNonStreamingResponse` for processing non-streaming requests in `ClaudeCodeAPIHandler`.
- Improved code clarity by separating streaming and non-streaming logic.

fix(service): remove redundant token refresh interval assignment logic in `cliproxy` service.

Luis Pater · 2025-09-23 12:44:44 +08:00

11b0efc38f

feat(gemini-web): Inject fallback text for image-only flash model responses

hkfires · 2025-09-23 10:05:59 +08:00

50c8f7f96f

feat(gemini-executor): implement CountTokens method with request translation and API integration

- Added `CountTokens` for token counting requests in Gemini executor.
- Integrated request translation via `sdktranslator` and response handling.
- Improved error handling, logging, and API request configuration with headers.

Luis Pater · 2025-09-23 02:45:08 +08:00

e313d39be8

feat(executor): add CountTokens support across all executors

- Introduced `CountTokens` method to Codex, Claude, Gemini, Qwen, OpenAI-compatible, and other executors.
- Implemented `ExecuteCount` in `AuthManager` for token counting via provider round-robin.
- Updated handlers to leverage `ExecuteCountWithAuthManager` for streamlined token counting.
- Added fallback and error handling logic for token counting requests.

Luis Pater · 2025-09-23 02:27:51 +08:00

ac59023abb

refactor(headers): centralize header logic using EnsureHeader utility

- Introduced `EnsureHeader` in `internal/misc/header_utils.go` to streamline header setting across executors.
- Updated Codex, Claude, and Gemini executors to utilize `EnsureHeader` for consistent header application.
- Incorporated Gin context headers (if available) into request header manipulation for better integration.

Luis Pater · 2025-09-23 02:01:57 +08:00

d32fc0400e

refactor(executor): centralize header application logic for executors

- Replaced repetitive header setting logic with helper methods (`applyCodexHeaders`, `applyClaudeHeaders`, `applyQwenHeaders`) in Codex, Claude, and Qwen executors.
- Ensured consistent headers in HTTP requests across all executors.
- Introduced UUID and additional structured headers for better traceability (e.g., session IDs, metadata).

Luis Pater · 2025-09-23 01:20:10 +08:00

7ea88358f0

chore(executor): add debug logging for API request errors

- Added detailed debug logs in all executors (Codex, Claude, Gemini, Qwen, OpenAI-compatible) to capture HTTP status and response body for failed API requests.

Luis Pater · 2025-09-22 23:37:53 +08:00

c6b391304d

feat(auth): standardize last_refresh metadata handling across executors

- Added `last_refresh` timestamp to metadata for Codex, Claude, Qwen, and Gemini executors.
- Implemented `extractLastRefreshTimestamp` utility for parsing diverse timestamp formats in management handlers.
- Ensured consistent update and preservation of `last_refresh` in file-based auth handling.

Luis Pater · 2025-09-22 23:23:31 +08:00

2e836cee88

feat(openai-compat): enhance provider key handling and model resolution

- Introduced dynamic `providerKey` resolution for OpenAI-compatible providers, incorporating attributes like `provider_key` and `compat_name`.
- Implemented upstream model overrides via `resolveUpstreamModel` and `overrideModel` methods in the OpenAI executor.
- Updated registry logic to correctly store provider mappings and register clients using normalized keys.
- Ensured consistency in handling empty or default provider names across components.

Luis Pater · 2025-09-22 22:54:21 +08:00

e41d127732

feat(gemini-web): Implement proactive PSIDTS cookie rotation

hkfires · 2025-09-22 21:54:52 +08:00

22a69333a0

56 Commits