CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-03 04:50:52 +08:00

Author	SHA1	Message	Date
NguyenSiTrung	3409f4e336	fix: enable hot reload for amp-model-mappings config - Store ampModule in Server struct to access it during config updates - Call ampModule.OnConfigUpdated() in UpdateClients() for hot reload - Watch config directory instead of file to handle atomic saves (vim, VSCode, etc.) - Improve config file event detection with basename matching - Add diagnostic logging for config reload tracing	2025-12-01 13:34:49 +07:00
NguyenSiTrung	9354b87e54	Merge branch 'router-for-me:main' into main	2025-12-01 08:12:29 +07:00
Luis Pater	54e24110ec	Merge pull request #386 from auroraflux/feat/dedupe-thinking-metadata-helpers refactor(executor): dedupe thinking metadata helpers across Gemini executors v6.5.31	2025-12-01 09:00:27 +08:00
Luis Pater	717c703bff	docs(readme): add CCS (Claude Code Switch) to projects list	2025-12-01 07:22:42 +08:00
auroraflux	1c6f4be8ae	refactor(executor): dedupe thinking metadata helpers across Gemini executors Extract applyThinkingMetadata and applyThinkingMetadataCLI helpers to payload_helpers.go and use them across all four Gemini-based executors: - gemini_executor.go (Execute, ExecuteStream, CountTokens) - gemini_cli_executor.go (Execute, ExecuteStream, CountTokens) - aistudio_executor.go (translateRequest) - antigravity_executor.go (Execute, ExecuteStream) This eliminates code duplication introduced in the -reasoning suffix PR and centralizes the thinking config application logic. Net reduction: 28 lines of code.	2025-11-30 15:20:15 -08:00
Luis Pater	0de2560cee	Merge pull request #379 from kaitranntt/docs/add-ccs-project docs: add CCS (Claude Code Switch) to projects list	2025-12-01 07:20:04 +08:00
Kai (Tam Nhu) Tran	85eb926482	fix: change AGY to Antigravity	2025-11-30 12:43:12 -05:00
Kai (Tam Nhu) Tran	c52ef08e67	docs: add CCS to projects list	2025-11-30 12:40:35 -05:00
Luis Pater	cb580cd083	Merge pull request #377 from router-for-me/gemini feat(registry): add thinking support to gemini models v6.5.30	2025-11-30 21:27:54 +08:00
hkfires	75e278c7a5	feat(registry): add thinking support to gemini models	2025-11-30 20:56:29 +08:00
Luis Pater	73208c4e55	Merge pull request #376 from auroraflux/feat/reasoning-suffix-support feat(util): add -reasoning suffix support for Gemini models	2025-11-30 20:55:38 +08:00
auroraflux	32d3809f8c	feat(util): add -reasoning suffix support for Gemini models Adds support for the `-reasoning` model name suffix which enables thinking/reasoning mode with dynamic budget. This allows clients to request reasoning-enabled inference using model names like `gemini-2.5-flash-reasoning` without explicit configuration. The suffix is normalized to the base model (e.g., gemini-2.5-flash) with thinkingBudget=-1 (dynamic) and include_thoughts=true. Follows the existing pattern established by -nothinking and -thinking-N suffixes.	2025-11-30 01:18:57 -08:00
Luis Pater	a748e93fd9	fix(executor, auth): ensure index assignment consistency for auth objects - Updated `usage_helpers.go` to call `EnsureIndex()` for proper index assignment in reporter initialization. - Adjusted `auth/manager.go` to assign auth indices inside a locked section when they are unassigned, ensuring thread safety and consistency. v6.5.29	2025-11-30 16:56:29 +08:00
Luis Pater	54a9c4c3c7	Merge pull request #371 from ben-vargas/test-amp-tools fix(amp): add /threads.rss root-level route for AMP CLI v6.5.28	2025-11-30 15:18:23 +08:00
Luis Pater	18b5c35dea	Merge pull request #366 from router-for-me/blacklist Add Model Blacklist	2025-11-30 15:17:46 +08:00
hkfires	7b7871ede2	feat(api): add oauth excluded model management	2025-11-30 13:38:23 +08:00
hkfires	c4e3646b75	docs(config): expand model exclusion examples	2025-11-30 11:55:47 +08:00
hkfires	022aa81be1	feat(cliproxy): support wildcard exclusions for models	2025-11-30 08:02:00 +08:00
hkfires	c43f0ea7b1	refactor(config): rename model blacklist fields to excluded models	2025-11-29 21:23:47 +08:00
hkfires	6a191358af	fix(auth): fix runtime auth reload on oauth blacklist change	2025-11-29 20:30:11 +08:00
Ben Vargas	db1119dd78	fix(amp): add /threads.rss root-level route for AMP CLI AMP CLI requests /threads.rss at the root level, but the AMP module only registered routes under /api/*. This caused a 404 error during AMP CLI startup. Add the missing root-level route with the same security middleware (noCORS, optional localhost restriction) as other management routes.	2025-11-29 05:01:19 -07:00
Trung Nguyen	33a5656235	docs: add model mapping documentation for Amp CLI integration - Add model mapping feature to README.md Amp CLI section - Add detailed Model Mapping Configuration section to amp-cli-integration.md - Update architecture diagram to show model mapping flow - Update Model Fallback Behavior to include mapping step - Add Table of Contents entry for model mapping	2025-11-29 12:51:03 +07:00
Trung Nguyen	2cd59806e2	feat(amp): add model mapping support for routing unavailable models to alternatives - Add AmpModelMapping config to route models like 'claude-opus-4.5' to 'claude-sonnet-4' - Add ModelMapper interface and DefaultModelMapper implementation with hot-reload support - Enhance FallbackHandler to apply model mappings before falling back to ampcode.com - Add structured logging for routing decisions (local provider, mapping, amp credits) - Update config.example.yaml with amp-model-mappings documentation	2025-11-29 12:44:09 +07:00
hkfires	5983e3ec87	feat(auth): add oauth provider model blacklist	2025-11-28 10:37:10 +08:00
hkfires	f8cebb9343	feat(config): add per-key model blacklist for providers	2025-11-27 21:57:07 +08:00
Luis Pater	72c7ef7647	fix(translator): handle non-JSON output parsing for OpenAI function responses - Updated `antigravity_openai_request.go` to process non-JSON outputs gracefully by verifying and distinguishing between JSON and plain string formats. - Ensured proper assignment of parsed or raw response to `functionResponse`. v6.5.27	2025-11-27 16:18:49 +08:00
Luis Pater	d2e4639b2a	feat(registry): add context length and update max tokens for Claude model configurations - Added `ContextLength` field with a value of 200,000 to all applicable Claude model definitions. - Standardized `MaxCompletionTokens` values across models for consistency and alignment.	2025-11-27 16:13:25 +08:00
Luis Pater	08321223c4	Merge pull request #340 from nestharus/fix/339-thinking-openai-gemini-compat fix(thinking): resolve OpenAI/Gemini compatibility for thinking model…	2025-11-27 16:03:24 +08:00
Luis Pater	7e30157590	Fixed: #354 fix(translator): add support for "xhigh" reasoning effort in OpenAI responses - Updated handling in `openai_openai-responses_request.go` to include the new "xhigh" reasoning effort level.	2025-11-27 15:59:15 +08:00
nestharus	e73cdf5cff	fix(claude): ensure max_tokens exceeds thinking budget for thinking models Fixes an issue where Claude thinking models would return 400 errors when the thinking.budget_tokens was greater than or equal to max_tokens. Changes: - Add MaxCompletionTokens: 128000 to all Claude thinking model definitions - Add ensureMaxTokensForThinking() function in claude_executor.go that: - Checks if thinking is enabled with a budget_tokens value - Looks up the model's MaxCompletionTokens from the registry - Ensures max_tokens is set to at least the model's MaxCompletionTokens - Falls back to budget_tokens + 4000 buffer if registry lookup fails This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens) is always satisfied when using extended thinking features. Fixes: #339 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-26 22:31:05 -08:00
Luis Pater	39621a0340	fix(translator): normalize function calls and outputs for consistent input processing - Implemented logic to pair consecutive function calls and their outputs, ensuring proper sequencing for processing. - Adjusted `gemini_openai-responses_request.go` to normalize message structures and maintain expected flow. v6.5.26	2025-11-27 10:25:45 +08:00
Luis Pater	346b663079	fix(translator): handle non-JSON output gracefully in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Str` instead of `.Raw` when parsing non-JSON string outputs. - Added checks to distinguish between JSON and non-JSON `output` types for accurate `functionResponse` construction.	2025-11-27 09:40:00 +08:00
Luis Pater	0bcae68c6c	fix(translator): preserve raw JSON encoding in function call outputs - Updated handling of `output` in `gemini_openai-responses_request.go` to use `.Raw` instead of `.String` for preserving original JSON encoding. - Ensured proper setting of raw JSON output when constructing `functionResponse`. v6.5.25	2025-11-27 08:26:53 +08:00
Luis Pater	c8cee547fd	fix(translator): ensure partial content is retained while skipping encrypted thoughtSignature - Updated handling of `thoughtSignature` across all translator modules to retain other content payloads if present. - Adjusted logic for `thought_signature` and `inline_data` keys for consistent processing. v6.5.24	2025-11-27 00:52:17 +08:00
Luis Pater	36755421fe	Merge pull request #343 from router-for-me/misc style(amp): tidy whitespace in proxy module and tests	2025-11-26 19:03:07 +08:00
hkfires	6c17dbc4da	style(amp): tidy whitespace in proxy module and tests	2025-11-26 18:57:26 +08:00
Luis Pater	ee6429cc75	feat(registry): add Gemini 3 Pro Image Preview model and remove Claude Sonnet 4.5 Thinking - Added new `Gemini 3 Pro Image Preview` model with detailed metadata and configuration. - Removed outdated `Claude Sonnet 4.5 Thinking` model definition for cleanup and relevance. v6.5.23	2025-11-26 18:22:40 +08:00
Luis Pater	a4a26d978e	Fixed: #339 feat(handlers, executor): add Gemini 3 Pro Preview support and refine Claude system instructions - Added support for the new "Gemini 3 Pro Preview" action in Gemini handlers, including detailed metadata and configuration. - Removed redundant `cache_control` field from Claude system instructions for cleaner payload structure. v6.5.22	2025-11-26 11:42:57 +08:00
Luis Pater	ed9f6e897e	Fixed: #337 fix(executor): replace redundant commented code with `checkSystemInstructions` helper - Replaced commented-out `sjson.SetRawBytes` lines with the new `checkSystemInstructions` function. - Centralized system instruction handling for better code clarity and reuse. - Ensured consistent logic for managing `system` field across Claude executor flows. v6.5.21	2025-11-26 08:27:48 +08:00
Luis Pater	9c1e3c0687	Merge pull request #334 from nestharus/feat/claude-thinking-and-beta-headers feat(claude): add thinking model variants and beta headers support v6.5.20	2025-11-26 02:17:02 +08:00
Luis Pater	2e5681ea32	Merge branch 'dev' into feat/claude-thinking-and-beta-headers	2025-11-26 02:16:40 +08:00
Luis Pater	52c17f03a5	fix(executor): comment out redundant code for setting Claude system instructions - Commented out multiple instances of `sjson.SetRawBytes` for setting `system` key to Claude instructions as they are redundant. - Code cleanup to improve clarity and maintainability without affecting functionality. v6.5.19	2025-11-26 02:06:16 +08:00
nestharus	d0e694d4ed	feat(claude): add thinking model variants and beta headers support - Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants) - Add Thinking support for antigravity models with -thinking suffix - Add injectThinkingConfig() for automatic thinking budget based on model suffix - Add resolveUpstreamModel() mappings for thinking variants to actual Claude models - Add extractAndRemoveBetas() to convert betas array to anthropic-beta header - Update applyClaudeHeaders() to merge custom betas from request body Closes #324	2025-11-25 03:33:05 -08:00
Luis Pater	506f1117dd	fix(handlers): refactor API response capture to append data safely - Introduced `appendAPIResponse` helper to preserve and append data to existing API responses. - Ensured newline inclusion when appending, if necessary. - Improved `nil` and data type checks for response handling. - Updated middleware to skip request logging for `GET` requests. v6.5.18	2025-11-25 11:37:02 +08:00
Luis Pater	113db3c5bf	fix(executor): update antigravity executor to enhance model metadata handling - Added additional metadata fields (`Name`, `Description`, `DisplayName`, `Version`) to `ModelInfo` struct initialization for better model representation. - Removed unnecessary whitespace in the code. v6.5.17	2025-11-25 09:19:01 +08:00
Luis Pater	1aa0b6cd11	Merge pull request #322 from ben-vargas/feat-claude-opus-4-5 feat(registry): add Claude Opus 4.5 model definition v6.5.16	2025-11-25 08:38:06 +08:00
Ben Vargas	0895533400	fix(registry): correct Claude Opus 4.5 created timestamp Update epoch from 1730419200 (2024-11-01) to 1761955200 (2025-11-01).	2025-11-24 12:27:23 -07:00
Ben Vargas	43f007c234	feat(registry): add Claude Opus 4.5 model definition Add support for claude-opus-4-5-20251101 with 200K context window and 64K max output tokens.	2025-11-24 12:26:39 -07:00
Luis Pater	0ceee56d99	Merge pull request #318 from router-for-me/log feat(logs): add limit query param to cap returned logs v6.5.15	2025-11-24 20:35:28 +08:00
hkfires	943a8c74df	feat(logs): add limit query param to cap returned logs	2025-11-24 19:59:24 +08:00

... 3 4 5 6 7 ...

982 Commits