CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
Trung Nguyen	33a5656235	docs: add model mapping documentation for Amp CLI integration - Add model mapping feature to README.md Amp CLI section - Add detailed Model Mapping Configuration section to amp-cli-integration.md - Update architecture diagram to show model mapping flow - Update Model Fallback Behavior to include mapping step - Add Table of Contents entry for model mapping	2025-11-29 12:51:03 +07:00
Trung Nguyen	2cd59806e2	feat(amp): add model mapping support for routing unavailable models to alternatives - Add AmpModelMapping config to route models like 'claude-opus-4.5' to 'claude-sonnet-4' - Add ModelMapper interface and DefaultModelMapper implementation with hot-reload support - Enhance FallbackHandler to apply model mappings before falling back to ampcode.com - Add structured logging for routing decisions (local provider, mapping, amp credits) - Update config.example.yaml with amp-model-mappings documentation	2025-11-29 12:44:09 +07:00
hkfires	5983e3ec87	feat(auth): add oauth provider model blacklist	2025-11-28 10:37:10 +08:00
hkfires	f8cebb9343	feat(config): add per-key model blacklist for providers	2025-11-27 21:57:07 +08:00
Luis Pater	72c7ef7647	fix(translator): handle non-JSON output parsing for OpenAI function responses - Updated `antigravity_openai_request.go` to process non-JSON outputs gracefully by verifying and distinguishing between JSON and plain string formats. - Ensured proper assignment of parsed or raw response to `functionResponse`. v6.5.27	2025-11-27 16:18:49 +08:00
Luis Pater	d2e4639b2a	feat(registry): add context length and update max tokens for Claude model configurations - Added `ContextLength` field with a value of 200,000 to all applicable Claude model definitions. - Standardized `MaxCompletionTokens` values across models for consistency and alignment.	2025-11-27 16:13:25 +08:00
Luis Pater	08321223c4	Merge pull request #340 from nestharus/fix/339-thinking-openai-gemini-compat fix(thinking): resolve OpenAI/Gemini compatibility for thinking model…	2025-11-27 16:03:24 +08:00
Luis Pater	7e30157590	Fixed: #354 fix(translator): add support for "xhigh" reasoning effort in OpenAI responses - Updated handling in `openai_openai-responses_request.go` to include the new "xhigh" reasoning effort level.	2025-11-27 15:59:15 +08:00
nestharus	e73cdf5cff	fix(claude): ensure max_tokens exceeds thinking budget for thinking models Fixes an issue where Claude thinking models would return 400 errors when the thinking.budget_tokens was greater than or equal to max_tokens. Changes: - Add MaxCompletionTokens: 128000 to all Claude thinking model definitions - Add ensureMaxTokensForThinking() function in claude_executor.go that: - Checks if thinking is enabled with a budget_tokens value - Looks up the model's MaxCompletionTokens from the registry - Ensures max_tokens is set to at least the model's MaxCompletionTokens - Falls back to budget_tokens + 4000 buffer if registry lookup fails This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens) is always satisfied when using extended thinking features. Fixes: #339 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-26 22:31:05 -08:00
Luis Pater	39621a0340	fix(translator): normalize function calls and outputs for consistent input processing - Implemented logic to pair consecutive function calls and their outputs, ensuring proper sequencing for processing. - Adjusted `gemini_openai-responses_request.go` to normalize message structures and maintain expected flow. v6.5.26	2025-11-27 10:25:45 +08:00
Luis Pater	346b663079	fix(translator): handle non-JSON output gracefully in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Str` instead of `.Raw` when parsing non-JSON string outputs. - Added checks to distinguish between JSON and non-JSON `output` types for accurate `functionResponse` construction.	2025-11-27 09:40:00 +08:00
Luis Pater	0bcae68c6c	fix(translator): preserve raw JSON encoding in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Raw` instead of `.String` for preserving original JSON encoding. - Ensured proper setting of raw JSON output when constructing `functionResponse`. v6.5.25	2025-11-27 08:26:53 +08:00
Luis Pater	c8cee547fd	fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature - Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present. - Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing. v6.5.24	2025-11-27 00:52:17 +08:00
Luis Pater	36755421fe	Merge pull request #343 from router-for-me/misc style(amp): tidy whitespace in proxy module and tests	2025-11-26 19:03:07 +08:00
hkfires	6c17dbc4da	style(amp): tidy whitespace in proxy module and tests	2025-11-26 18:57:26 +08:00
Luis Pater	ee6429cc75	feat(registry): add Gemini 3 Pro Image Preview model and remove Claude Sonnet 4.5 Thinking - Added new `Gemini 3 Pro Image Preview` model with detailed metadata and configuration. - Removed outdated `Claude Sonnet 4.5 Thinking` model definition for cleanup and relevance. v6.5.23	2025-11-26 18:22:40 +08:00
Luis Pater	a4a26d978e	Fixed: #339 feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions - Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration. - Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure. v6.5.22	2025-11-26 11:42:57 +08:00
Luis Pater	ed9f6e897e	Fixed: #337 fix(executor): replace redundant commented code with `checkSystemInstructions` helper - Replaced commented-out `sjson.SetRawBytes` lines with the new `checkSystemInstructions` function. - Centralized system instruction handling for better code clarity and reuse. - Ensured consistent logic for managing `system` field across Claude executor flows. v6.5.21	2025-11-26 08:27:48 +08:00
Luis Pater	9c1e3c0687	Merge pull request #334 from nestharus/feat/claude-thinking-and-beta-headers feat(claude): add thinking model variants and beta headers support v6.5.20	2025-11-26 02:17:02 +08:00
Luis Pater	2e5681ea32	Merge branch 'dev' into feat/claude-thinking-and-beta-headers	2025-11-26 02:16:40 +08:00
Luis Pater	52c17f03a5	fix(executor): comment out redundant code for setting Claude system instructions - Commented out multiple instances of `sjson.SetRawBytes` for setting `system` key to Claude instructions as they are redundant. - Code cleanup to improve clarity and maintainability without affecting functionality. v6.5.19	2025-11-26 02:06:16 +08:00
nestharus	d0e694d4ed	feat(claude): add thinking model variants and beta headers support - Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants) - Add Thinking support for antigravity models with -thinking suffix - Add injectThinkingConfig() for automatic thinking budget based on model suffix - Add resolveUpstreamModel() mappings for thinking variants to actual Claude models - Add extractAndRemoveBetas() to convert betas array to anthropic-beta header - Update applyClaudeHeaders() to merge custom betas from request body Closes #324	2025-11-25 03:33:05 -08:00
Luis Pater	506f1117dd	fix(handlers): refactor API response capture to append data safely - Introduced `appendAPIResponse` helper to preserve and append data to existing API responses. - Ensured newline inclusion when appending, if necessary. - Improved `nil` and data type checks for response handling. - Updated middleware to skip request logging for `GET` requests. v6.5.18	2025-11-25 11:37:02 +08:00
Luis Pater	113db3c5bf	fix(executor): update antigravity executor to enhance model metadata handling - Added additional metadata fields (`Name`, `Description`, `DisplayName`, `Version`) to `ModelInfo` struct initialization for better model representation. - Removed unnecessary whitespace in the code. v6.5.17	2025-11-25 09:19:01 +08:00
Luis Pater	1aa0b6cd11	Merge pull request #322 from ben-vargas/feat-claude-opus-4-5 feat(registry): add Claude Opus 4.5 model definition v6.5.16	2025-11-25 08:38:06 +08:00
Ben Vargas	0895533400	fix(registry): correct Claude Opus 4.5 created timestamp Update epoch from 1730419200 (2024-11-01) to 1761955200 (2025-11-01).	2025-11-24 12:27:23 -07:00
Ben Vargas	43f007c234	feat(registry): add Claude Opus 4.5 model definition Add support for claude-opus-4-5-20251101 with 200K context window and 64K max output tokens.	2025-11-24 12:26:39 -07:00
Luis Pater	0ceee56d99	Merge pull request #318 from router-for-me/log feat(logs): add limit query param to cap returned logs v6.5.15	2025-11-24 20:35:28 +08:00
hkfires	943a8c74df	feat(logs): add limit query param to cap returned logs	2025-11-24 19:59:24 +08:00
Luis Pater	0a47b452e9	fix(translator): add conditional check for key renaming in Gemini tools - Ensured `functionDeclarations` key renaming only occurs if the key exists in Gemini tools processing. - Prevented unnecessary JSON reassignment when the target key is absent. v6.5.14	2025-11-24 17:15:43 +08:00
Luis Pater	261f08a82a	fix(translator): adjust key renaming logic in Gemini request processing - Fixed parameter key renaming to correctly handle `functionDeclarations` and `parametersJsonSchema` in Gemini tools. - Resolved potential overwriting issue by reassigning JSON strings after each key rename.	2025-11-24 17:12:04 +08:00
Luis Pater	d114d8d0bd	feat(config): add TLS support for HTTPS server configuration - Introduced `TLSConfig` to support HTTPS configurations, including enabling TLS, specifying certificate and key files. - Updated HTTP server logic to handle HTTPS mode when TLS is enabled. - Enhanced `config.example.yaml` with TLS settings example. - Adjusted internal URL generation to respect protocol based on TLS state. v6.5.13	2025-11-24 10:41:29 +08:00
Luis Pater	bb9955e461	fix(auth): resolve index reassignment issue during auth management - Fixed improper handling of `indexAssigned` and `Index` during auth reassignment. - Ensured `EnsureIndex` is invoked after validating existing auth entries.	2025-11-24 10:10:09 +08:00
Luis Pater	7063a176f4	#293 feat(retry): add configurable retry logic with cooldown support - Introduced `max-retry-interval` configuration for cooldown durations between retries. - Added `SetRetryConfig` in `Manager` to handle retry attempts and cooldown intervals. - Enhanced provider execution logic to include retry attempts, cooldown management, and dynamic wait periods. - Updated API endpoints and YAML configuration to support `max-retry-interval`.	2025-11-24 09:55:15 +08:00
Luis Pater	e3082887a6	feat(logging, middleware): add error-based logging support and error log management - Introduced `logOnErrorOnly` mode to enable logging only for error responses when request logging is disabled. - Added endpoints to list and download error logs (`/request-error-logs`). - Implemented error log file cleanup to retain only the newest 10 logs. - Refactored `ResponseWriterWrapper` to support forced logging for error responses. - Enhanced middleware to capture data for upstream error persistence. - Improved log file naming and error log filename generation. v6.5.12	2025-11-23 22:41:57 +08:00
Luis Pater	ddb0c0ec1c	fix(translator): reintroduce `thoughtSignature` bypass logic for model parts - Restored `thoughtSignature` validator bypass for model-specific parts in Gemini content processing. - Removed redundant logic from the `executor` for cleaner handling.	2025-11-23 20:52:23 +08:00
Luis Pater	d1736cb29c	Merge pull request #315 from router-for-me/aistudio fix(aistudio): strip Gemini generation config overrides v6.5.11	2025-11-23 20:25:59 +08:00
hkfires	62bfd62871	fix(aistudio): strip Gemini generation config overrides Remove generationConfig.maxOutputTokens, generationConfig.responseMimeType and generationConfig.responseJsonSchema from the Gemini payload in translateRequest so we no longer send unsupported or conflicting response configuration fields. This lets the backend or caller control response formatting and output limits and helps prevent potential API errors caused by these keys.	2025-11-23 19:44:03 +08:00
Luis Pater	257621c5ed	chore(executor): update default agent version and simplify const formatting - Updated `defaultAntigravityAgent` to version `1.11.5`. - Adjusted const value formatting for improved readability. feat(executor): introduce fallback mechanism for Antigravity base URLs - Added retry logic with fallback order for Antigravity base URLs to handle request errors and rate limits. - Refactored base URL handling with `antigravityBaseURLFallbackOrder` and related utilities. - Enhanced error handling in non-streaming and streaming requests with retry support and improved metadata reporting. - Updated `buildRequest` to support dynamic base URL assignment. v6.5.10	2025-11-23 17:53:07 +08:00
Luis Pater	ac064389ca	feat(executor, translator): enhance token handling and payload processing - Improved Antigravity executor to handle `thinkingConfig` adjustments and default `thinkingBudget` when `thinkingLevel` is removed. - Updated translator response handling to set default values for output token counts when specific token data is missing. v6.5.9	2025-11-23 11:32:37 +08:00
Luis Pater	8d23ffc873	feat(executor): add model alias mapping and improve Antigravity payload handling - Introduced `modelName2Alias` and `alias2ModelName` functions for mapping between model names and aliases. - Improved Antigravity payload transformation to include alias-to-model name conversion. - Enhanced processing for Claude Sonnet models to adjust template parameters based on schema presence. v6.5.8	2025-11-23 03:16:14 +08:00
Luis Pater	4307f08bbc	feat(watcher): optimize auth file handling with hash-based change detection - Added `authFileUnchanged` to skip reloads for unchanged files based on SHA256 hash comparisons. - Introduced `isKnownAuthFile` to verify known files before handling removal events. - Improved event processing in `handleEvent` to reduce unnecessary reloads and enhance performance.	2025-11-23 01:22:16 +08:00
Luis Pater	9d50a68768	feat(translator): improve content processing and Antigravity request conversion - Refactored response translation logic to support mixed content types (`input_text`, `output_text`, `input_image`) with better role assignments and part handling. - Added image processing logic for embedding inline data with MIME type and base64 encoded content. - Updated Antigravity request conversion to replace Gemini CLI references for consistency. v6.5.7	2025-11-22 21:34:34 +08:00
Luis Pater	7c3c24addc	Merge pull request #306 from router-for-me/usage fix some bugs	2025-11-22 17:45:49 +08:00
hkfires	166fa9e2e6	fix(gemini): parse stream usage from JSON, skip thoughtSignature	2025-11-22 16:07:12 +08:00
hkfires	88e566281e	fix(gemini): filter SSE usage metadata in streams	2025-11-22 15:53:36 +08:00
hkfires	d32bb9db6b	fix(runtime): treat non-empty finishReason as terminal	2025-11-22 15:39:46 +08:00
hkfires	8356b35320	fix(executor): expire stop chunks without usage metadata	2025-11-22 15:27:47 +08:00
hkfires	19a048879c	feat(runtime): track antigravity usage and token counts	2025-11-22 14:04:28 +08:00
hkfires	1061354b2f	fix: handle empty and non-JSON SSE chunks safely	2025-11-22 13:49:23 +08:00

... 2 3 4 5 6 ...

911 Commits