CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
Ben Vargas	db1119dd78	fix(amp): add /threads.rss root-level route for AMP CLI AMP CLI requests /threads.rss at the root level, but the AMP module only registered routes under /api/*. This caused a 404 error during AMP CLI startup. Add the missing root-level route with the same security middleware (noCORS, optional localhost restriction) as other management routes.	2025-11-29 05:01:19 -07:00
Luis Pater	72c7ef7647	fix(translator): handle non-JSON output parsing for OpenAI function responses - Updated `antigravity_openai_request.go` to process non-JSON outputs gracefully by verifying and distinguishing between JSON and plain string formats. - Ensured proper assignment of parsed or raw response to `functionResponse`.	2025-11-27 16:18:49 +08:00
Luis Pater	d2e4639b2a	feat(registry): add context length and update max tokens for Claude model configurations - Added `ContextLength` field with a value of 200,000 to all applicable Claude model definitions. - Standardized `MaxCompletionTokens` values across models for consistency and alignment.	2025-11-27 16:13:25 +08:00
Luis Pater	08321223c4	Merge pull request #340 from nestharus/fix/339-thinking-openai-gemini-compat fix(thinking): resolve OpenAI/Gemini compatibility for thinking model…	2025-11-27 16:03:24 +08:00
Luis Pater	7e30157590	Fixed: #354 fix(translator): add support for "xhigh" reasoning effort in OpenAI responses - Updated handling in `openai_openai-responses_request.go` to include the new "xhigh" reasoning effort level.	2025-11-27 15:59:15 +08:00
nestharus	e73cdf5cff	fix(claude): ensure max_tokens exceeds thinking budget for thinking models Fixes an issue where Claude thinking models would return 400 errors when the thinking.budget_tokens was greater than or equal to max_tokens. Changes: - Add MaxCompletionTokens: 128000 to all Claude thinking model definitions - Add ensureMaxTokensForThinking() function in claude_executor.go that: - Checks if thinking is enabled with a budget_tokens value - Looks up the model's MaxCompletionTokens from the registry - Ensures max_tokens is set to at least the model's MaxCompletionTokens - Falls back to budget_tokens + 4000 buffer if registry lookup fails This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens) is always satisfied when using extended thinking features. Fixes: #339 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-26 22:31:05 -08:00
Luis Pater	39621a0340	fix(translator): normalize function calls and outputs for consistent input processing - Implemented logic to pair consecutive function calls and their outputs, ensuring proper sequencing for processing. - Adjusted `gemini_openai-responses_request.go` to normalize message structures and maintain expected flow.	2025-11-27 10:25:45 +08:00
Luis Pater	346b663079	fix(translator): handle non-JSON output gracefully in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Str` instead of `.Raw` when parsing non-JSON string outputs. - Added checks to distinguish between JSON and non-JSON `output` types for accurate `functionResponse` construction.	2025-11-27 09:40:00 +08:00
Luis Pater	0bcae68c6c	fix(translator): preserve raw JSON encoding in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Raw` instead of `.String` for preserving original JSON encoding. - Ensured proper setting of raw JSON output when constructing `functionResponse`.	2025-11-27 08:26:53 +08:00
Luis Pater	c8cee547fd	fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature - Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present. - Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing.	2025-11-27 00:52:17 +08:00
Luis Pater	36755421fe	Merge pull request #343 from router-for-me/misc style(amp): tidy whitespace in proxy module and tests	2025-11-26 19:03:07 +08:00
hkfires	6c17dbc4da	style(amp): tidy whitespace in proxy module and tests	2025-11-26 18:57:26 +08:00
Luis Pater	ee6429cc75	feat(registry): add Gemini 3 Pro Image Preview model and remove Claude Sonnet 4.5 Thinking - Added new `Gemini 3 Pro Image Preview` model with detailed metadata and configuration. - Removed outdated `Claude Sonnet 4.5 Thinking` model definition for cleanup and relevance.	2025-11-26 18:22:40 +08:00
Luis Pater	a4a26d978e	Fixed: #339 feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions - Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration. - Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure.	2025-11-26 11:42:57 +08:00
Luis Pater	ed9f6e897e	Fixed: #337 fix(executor): replace redundant commented code with `checkSystemInstructions` helper - Replaced commented-out `sjson.SetRawBytes` lines with the new `checkSystemInstructions` function. - Centralized system instruction handling for better code clarity and reuse. - Ensured consistent logic for managing `system` field across Claude executor flows.	2025-11-26 08:27:48 +08:00
Luis Pater	2e5681ea32	Merge branch 'dev' into feat/claude-thinking-and-beta-headers	2025-11-26 02:16:40 +08:00
Luis Pater	52c17f03a5	fix(executor): comment out redundant code for setting Claude system instructions - Commented out multiple instances of `sjson.SetRawBytes` for setting `system` key to Claude instructions as they are redundant. - Code cleanup to improve clarity and maintainability without affecting functionality.	2025-11-26 02:06:16 +08:00
nestharus	d0e694d4ed	feat(claude): add thinking model variants and beta headers support - Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants) - Add Thinking support for antigravity models with -thinking suffix - Add injectThinkingConfig() for automatic thinking budget based on model suffix - Add resolveUpstreamModel() mappings for thinking variants to actual Claude models - Add extractAndRemoveBetas() to convert betas array to anthropic-beta header - Update applyClaudeHeaders() to merge custom betas from request body Closes #324	2025-11-25 03:33:05 -08:00
Luis Pater	506f1117dd	fix(handlers): refactor API response capture to append data safely - Introduced `appendAPIResponse` helper to preserve and append data to existing API responses. - Ensured newline inclusion when appending, if necessary. - Improved `nil` and data type checks for response handling. - Updated middleware to skip request logging for `GET` requests.	2025-11-25 11:37:02 +08:00
Luis Pater	113db3c5bf	fix(executor): update antigravity executor to enhance model metadata handling - Added additional metadata fields (`Name`, `Description`, `DisplayName`, `Version`) to `ModelInfo` struct initialization for better model representation. - Removed unnecessary whitespace in the code.	2025-11-25 09:19:01 +08:00
Ben Vargas	0895533400	fix(registry): correct Claude Opus 4.5 created timestamp Update epoch from 1730419200 (2024-11-01) to 1761955200 (2025-11-01).	2025-11-24 12:27:23 -07:00
Ben Vargas	43f007c234	feat(registry): add Claude Opus 4.5 model definition Add support for claude-opus-4-5-20251101 with 200K context window and 64K max output tokens.	2025-11-24 12:26:39 -07:00
hkfires	943a8c74df	feat(logs): add limit query param to cap returned logs	2025-11-24 19:59:24 +08:00
Luis Pater	0a47b452e9	fix(translator): add conditional check for key renaming in Gemini tools - Ensured `functionDeclarations` key renaming only occurs if the key exists in Gemini tools processing. - Prevented unnecessary JSON reassignment when the target key is absent.	2025-11-24 17:15:43 +08:00
Luis Pater	261f08a82a	fix(translator): adjust key renaming logic in Gemini request processing - Fixed parameter key renaming to correctly handle `functionDeclarations` and `parametersJsonSchema` in Gemini tools. - Resolved potential overwriting issue by reassigning JSON strings after each key rename.	2025-11-24 17:12:04 +08:00
Luis Pater	d114d8d0bd	feat(config): add TLS support for HTTPS server configuration - Introduced `TLSConfig` to support HTTPS configurations, including enabling TLS, specifying certificate and key files. - Updated HTTP server logic to handle HTTPS mode when TLS is enabled. - Enhanced `config.example.yaml` with TLS settings example. - Adjusted internal URL generation to respect protocol based on TLS state.	2025-11-24 10:41:29 +08:00
Luis Pater	7063a176f4	#293 feat(retry): add configurable retry logic with cooldown support - Introduced `max-retry-interval` configuration for cooldown durations between retries. - Added `SetRetryConfig` in `Manager` to handle retry attempts and cooldown intervals. - Enhanced provider execution logic to include retry attempts, cooldown management, and dynamic wait periods. - Updated API endpoints and YAML configuration to support `max-retry-interval`.	2025-11-24 09:55:15 +08:00
Luis Pater	e3082887a6	feat(logging, middleware): add error-based logging support and error log management - Introduced `logOnErrorOnly` mode to enable logging only for error responses when request logging is disabled. - Added endpoints to list and download error logs (`/request-error-logs`). - Implemented error log file cleanup to retain only the newest 10 logs. - Refactored `ResponseWriterWrapper` to support forced logging for error responses. - Enhanced middleware to capture data for upstream error persistence. - Improved log file naming and error log filename generation.	2025-11-23 22:41:57 +08:00
Luis Pater	ddb0c0ec1c	fix(translator): reintroduce `thoughtSignature` bypass logic for model parts - Restored `thoughtSignature` validator bypass for model-specific parts in Gemini content processing. - Removed redundant logic from the `executor` for cleaner handling.	2025-11-23 20:52:23 +08:00
hkfires	62bfd62871	fix(aistudio): strip Gemini generation config overrides Remove generationConfig.maxOutputTokens, generationConfig.responseMimeType and generationConfig.responseJsonSchema from the Gemini payload in translateRequest so we no longer send unsupported or conflicting response configuration fields. This lets the backend or caller control response formatting and output limits and helps prevent potential API errors caused by these keys.	2025-11-23 19:44:03 +08:00
Luis Pater	257621c5ed	chore(executor): update default agent version and simplify const formatting - Updated `defaultAntigravityAgent` to version `1.11.5`. - Adjusted const value formatting for improved readability. feat(executor): introduce fallback mechanism for Antigravity base URLs - Added retry logic with fallback order for Antigravity base URLs to handle request errors and rate limits. - Refactored base URL handling with `antigravityBaseURLFallbackOrder` and related utilities. - Enhanced error handling in non-streaming and streaming requests with retry support and improved metadata reporting. - Updated `buildRequest` to support dynamic base URL assignment.	2025-11-23 17:53:07 +08:00
Luis Pater	ac064389ca	feat(executor, translator): enhance token handling and payload processing - Improved Antigravity executor to handle `thinkingConfig` adjustments and default `thinkingBudget` when `thinkingLevel` is removed. - Updated translator response handling to set default values for output token counts when specific token data is missing.	2025-11-23 11:32:37 +08:00
Luis Pater	8d23ffc873	feat(executor): add model alias mapping and improve Antigravity payload handling - Introduced `modelName2Alias` and `alias2ModelName` functions for mapping between model names and aliases. - Improved Antigravity payload transformation to include alias-to-model name conversion. - Enhanced processing for Claude Sonnet models to adjust template parameters based on schema presence.	2025-11-23 03:16:14 +08:00
Luis Pater	4307f08bbc	feat(watcher): optimize auth file handling with hash-based change detection - Added `authFileUnchanged` to skip reloads for unchanged files based on SHA256 hash comparisons. - Introduced `isKnownAuthFile` to verify known files before handling removal events. - Improved event processing in `handleEvent` to reduce unnecessary reloads and enhance performance.	2025-11-23 01:22:16 +08:00
Luis Pater	9d50a68768	feat(translator): improve content processing and Antigravity request conversion - Refactored response translation logic to support mixed content types (`input_text`, `output_text`, `input_image`) with better role assignments and part handling. - Added image processing logic for embedding inline data with MIME type and base64 encoded content. - Updated Antigravity request conversion to replace Gemini CLI references for consistency.	2025-11-22 21:34:34 +08:00
hkfires	166fa9e2e6	fix(gemini): parse stream usage from JSON, skip thoughtSignature	2025-11-22 16:07:12 +08:00
hkfires	88e566281e	fix(gemini): filter SSE usage metadata in streams	2025-11-22 15:53:36 +08:00
hkfires	d32bb9db6b	fix(runtime): treat non-empty finishReason as terminal	2025-11-22 15:39:46 +08:00
hkfires	8356b35320	fix(executor): expire stop chunks without usage metadata	2025-11-22 15:27:47 +08:00
hkfires	19a048879c	feat(runtime): track antigravity usage and token counts	2025-11-22 14:04:28 +08:00
hkfires	1061354b2f	fix: handle empty and non-JSON SSE chunks safely	2025-11-22 13:49:23 +08:00
hkfires	46b4110ff3	fix: preserve SSE usage metadata-only trailing chunks	2025-11-22 13:25:25 +08:00
hkfires	c29931e093	fix(translator): ignore empty JSON chunks in OpenAI responses	2025-11-22 13:09:16 +08:00
hkfires	b05cfd9f84	fix(translator): include empty text chunks in responses	2025-11-22 13:03:50 +08:00
hkfires	8ce22b8403	fix(sse): preserve usage metadata for stop chunks	2025-11-22 12:50:23 +08:00
Luis Pater	d1cdedc4d1	Merge pull request #303 from router-for-me/image feat(translator): support image size and googleSearch tools	2025-11-22 11:20:58 +08:00
Luis Pater	d291eb9489	Fixed: #302 feat(executor): enhance WebSocket error handling and metadata logging - Added handling for stream closure before start with appropriate error recording. - Improved metadata logging for non-OK HTTP status codes in WebSocket responses. - Consolidated event processing logic with `processEvent` for better error handling and payload management. - Refactored stream initialization to include the first event handling for smoother execution flow.	2025-11-22 11:18:13 +08:00
hkfires	dc8d3201e1	feat(translator): support image size and googleSearch tools	2025-11-22 10:36:52 +08:00
Luis Pater	7757210af6	feat(auth): implement Antigravity OAuth authentication flow - Added new endpoint `/antigravity-auth-url` to initiate Antigravity authentication. - Implemented `RequestAntigravityToken` to manage the OAuth flow, including token exchange and user info retrieval. - Introduced `.oauth-antigravity` temporary file handling for state and code management. - Added `sanitizeAntigravityFileName` utility for safe token file names based on user email. - Registered `/antigravity/callback` endpoint for OAuth redirects.	2025-11-22 01:45:06 +08:00
Luis Pater	c1031e2d3f	feat(translator): add Antigravity translation logic - Introduced request and response translation functions to enable compatibility between OpenAI Chat Completions API and Antigravity. - Registered translation utilities for both streaming and non-streaming scenarios. - Added support for reasoning content, tool calls, and metadata handling. - Established request normalization and embedding for Antigravity-compatible payloads. - Added new fields to `Params` struct for better tracking of finish reasons, usage metadata, and tool usage. - Refactored handling of response transitions, final events, and state-driven logic in `ConvertAntigravityResponseToClaude`. - Introduced `appendFinalEvents` and `resolveStopReason` helper functions for cleaner separation of concerns. - Added `TotalTokenCount` field to `Params` struct for enhanced token tracking. - Updated token count calculations to fallback on `TotalTokenCount` when specific counts are missing. - Introduced `hasNonZeroUsageMetadata` function to validate presence of token data in `usage_metadata`.	2025-11-21 23:40:59 +08:00

1 2 3 4 5 ...

553 Commits