CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-02-18 04:10:51 +08:00

Author	SHA1	Message	Date
이대희	24b4bee500	Merge remote-tracking branch 'upstream/main' into feature/ampcode-alias	2026-02-02 12:09:31 +09:00
Luis Pater	b927b0cc6c	Merge branch 'dev' into codex	2026-02-01 20:20:49 +08:00
이대희	9299897e04	Implements unified model routing Migrates the AMP module to a new unified routing system, replacing the fallback handler with a router-based approach. This change introduces a `ModelRoutingWrapper` that handles model extraction, routing decisions, and proxying based on provider availability and model mappings. It provides a more flexible and maintainable routing mechanism by centralizing routing logic. The changes include: - Introducing new `routing` package with core routing logic. - Creating characterization tests to capture existing behavior. - Implementing model extraction and rewriting. - Updating AMP module routes to utilize the new routing wrapper. - Deprecating `FallbackHandler` in favor of the new `ModelRoutingWrapper`.	2026-02-01 16:58:32 +09:00
이대희	527a269799	Refactors AMP model mapping and error handling Improves AMP request handling by consolidating model mapping logic into a helper function for better readability and maintainability. Enhances error handling for premature client connection closures during reverse proxy operations by explicitly acknowledging and swallowing the ErrAbortHandler panic, preventing noisy stack traces. Removes unused method `findProviderViaOAuthAlias` from the `DefaultModelMapper`.	2026-02-01 15:56:31 +09:00
이대희	2fe0b6cd2d	Refactors context keys for model routing Uses centralized context keys for accessing mapped and fallback models. This change deprecates the string-based context keys used in the AMP fallback handlers in favor of the `ctxkeys` package, promoting consistency and reducing the risk of typos. The authentication conductor now retrieves fallback models using the shared `ctxkeys` constants.	2026-02-01 15:50:45 +09:00
이대희	eeb1812d60	Merge remote-tracking branch 'upstream/main' into feature/ampcode-alias	2026-02-01 15:43:16 +09:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00
hkfires	4649cadcb5	refactor(api): centralize config change logging	2026-02-01 11:31:44 +08:00
sususu98	6db8d2a28e	feat(logging): make error-logs-max-files configurable - Add ErrorLogsMaxFiles config field with default value 10 - Support hot-reload via config file changes - Add Management API: GET/PUT/PATCH /v0/management/error-logs-max-files - Maintain SDK backward compatibility with NewFileRequestLogger (3 params) - Add NewFileRequestLoggerWithOptions for custom error log retention When request logging is disabled, forced error logs are retained up to the configured limit. Set to 0 to disable cleanup.	2026-01-31 17:48:40 +08:00
이대희	adedb16d35	fix(amp): update fallback_handlers_test.go for provider registration Amp-Thread-ID: https://ampcode.com/threads/T-019c0f77-82b6-711c-9172-092bd2a2059d Co-authored-by: Amp <amp@ampcode.com>	2026-01-31 13:55:44 +08:00
이대희	89907231c1	feat(routing): implement unified model routing with OAuth and API key providers - Added a new routing package to manage provider registration and model resolution. - Introduced Router, Executor, and Provider interfaces to handle different provider types. - Implemented OAuthProvider and APIKeyProvider to support OAuth and API key authentication. - Enhanced DefaultModelMapper to include OAuth model alias handling and fallback mechanisms. - Updated context management in API handlers to preserve fallback models. - Added tests for routing logic and provider selection. - Enhanced Claude request conversion to handle reasoning content based on thinking mode.	2026-01-31 13:55:43 +08:00
이대희	09044e8ccc	feature(ampcode): Improves AMP model mapping with alias support Enhances the AMP model mapping functionality to support fallback mechanisms using . This change allows the system to attempt alternative models (aliases) if the primary mapped model fails due to issues like quota exhaustion. It updates the model mapper to load and utilize the configuration, enabling provider lookup via aliases. It also introduces context keys to pass fallback model names between handlers. Additionally, this change introduces a fix to prevent ReverseProxy from panicking by swallowing ErrAbortHandler panics. Amp-Thread-ID: https://ampcode.com/threads/T-019c0cd1-9e59-722b-83f0-e0582aba6914 Co-authored-by: Amp <amp@ampcode.com>	2026-01-31 13:55:43 +08:00
Luis Pater	f887f9985d	Merge pull request #1248 from shekohex/feat/responses-compact feat(openai): add responses/compact support	2026-01-31 03:12:55 +08:00
sususu98	295f34d7f0	fix(logging): capture streaming TTFB on first chunk and make timestamps required - Add firstChunkTimestamp field to ResponseWriterWrapper for sync capture - Capture TTFB in Write() and WriteString() before async channel send - Add SetFirstChunkTimestamp() to StreamingLogWriter interface - Make requestTimestamp/apiResponseTimestamp required in LogRequest() - Remove timestamp capture from WriteAPIResponse() (now via setter) - Fix Gemini handler to set API_RESPONSE_TIMESTAMP before writing response This ensures accurate TTFB measurement for all streaming API formats (OpenAI, Gemini, Claude) by capturing timestamp synchronously when the first response chunk arrives, not when the stream finalizes.	2026-01-29 22:32:24 +08:00
sususu98	c41ce77eea	fix(logging): add API response timestamp and fix request timestamp timing Previously: - REQUEST INFO timestamp was captured at log write time (not request arrival) - API RESPONSE had NO timestamp at all This fix: - Captures REQUEST INFO timestamp when request first arrives - Adds API RESPONSE timestamp when upstream response arrives Changes: - Add Timestamp field to RequestInfo, set at middleware initialization - Set API_RESPONSE_TIMESTAMP in appendAPIResponse() and gemini handler - Pass timestamps through logging chain to writeNonStreamingLog() - Add timestamp output to API RESPONSE section This enables accurate measurement of backend response latency in error logs.	2026-01-29 22:22:18 +08:00
hkfires	8510fc313e	fix(api): update amp module only on config changes	2026-01-29 09:28:49 +08:00
hkfires	d18cd217e1	feat(api): add management model definitions endpoint	2026-01-27 18:33:12 +08:00
Shady Khalifa	95096bc3fc	feat(openai): add responses/compact support	2026-01-26 16:36:01 +02:00
hkfires	e95be10485	fix(auth): validate antigravity token userinfo email	2026-01-24 08:33:52 +08:00
hkfires	f3d58fa0ce	fix(auth): correct antigravity oauth redirect and expiry	2026-01-24 08:33:52 +08:00
hkfires	8c0eaa1f71	refactor(auth): export Gemini constants and use in handler	2026-01-24 08:33:52 +08:00
hkfires	405df58f72	refactor(auth): export Codex constants and slim down handler	2026-01-24 08:33:52 +08:00
hkfires	e7f13aa008	refactor(api): slim down RequestAnthropicToken to use internal/auth	2026-01-24 08:33:51 +08:00
hkfires	9aa5344c29	refactor(api): slim down RequestAntigravityToken to use internal/auth	2026-01-24 08:33:51 +08:00
hkfires	4a4dfaa910	refactor(auth): replace sanitizeAntigravityFileName with antigravity.CredentialFileName	2026-01-24 08:33:51 +08:00
Chén Mù	19b4ef33e0	Merge pull request #1102 from aldinokemal/main feat(management): add PATCH endpoint to enable/disable auth files	2026-01-23 09:05:24 +08:00
Luis Pater	9823dc35e1	feat(auth): hash account ID for improved uniqueness in credential filenames	2026-01-20 11:37:52 +08:00
Luis Pater	1fef90ff58	Merge pull request #877 from zhiqing0205/main feat(codex): include plan type in auth filename	2026-01-20 11:11:25 +08:00
Luis Pater	28726632a9	Merge pull request #861 from umairimtiaz9/fix/gemini-cli-backend-project-id fix(auth): use backend project ID for free tier Gemini CLI OAuth users	2026-01-20 10:32:17 +08:00
Aldino Kemal	2f6004d74a	perf(management): optimize auth lookup in PatchAuthFileStatus Use GetByID() for O(1) map lookup first, falling back to iteration only for FileName matching. Consistent with pattern in disableAuth().	2026-01-19 20:05:37 +07:00
Aldino Kemal	a1634909e8	feat(management): add PATCH endpoint to enable/disable auth files Add new PATCH /v0/management/auth-files/status endpoint that allows toggling the disabled state of auth files without deleting them. This enables users to temporarily disable credentials from the management UI.	2026-01-19 19:50:36 +07:00
Luis Pater	99c7abbbf1	Merge pull request #1067 from router-for-me/auth-files refactor(auth): simplify filename prefixes for qwen and iflow tokens	2026-01-18 13:41:59 +08:00
Luis Pater	62e2b672d9	refactor(logging): centralize log directory resolution logic - Introduced `ResolveLogDirectory` function in `logging` package to standardize log directory determination across components. - Replaced redundant logic in `server`, `global_logger`, and `handlers` with the new utility function.	2026-01-18 12:40:57 +08:00
hkfires	109cffc010	refactor(auth): simplify filename prefixes for qwen and iflow tokens	2026-01-17 12:20:58 +08:00
hkfires	48cba39a12	feat(codex): add config toggle for codex instructions injection	2026-01-16 12:30:12 +08:00
hkfires	fe5b3c80cb	refactor(config): rename oauth-model-mappings to oauth-model-alias	2026-01-15 18:03:26 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
hkfires	6494330c6b	feat(codex): add subscription date fields to ID token claims	2026-01-10 11:15:20 +08:00
Luis Pater	95f87d5669	Merge pull request #947 from pykancha/fix-memory-leak Resolve memory leaks causing OOM in k8s deployment	2026-01-10 00:40:47 +08:00
hemanta212	47dacce6ea	fix(server): resolve memory leaks causing OOM in k8s deployment - usage/logger_plugin: cap modelStats.Details at 1000 entries per model - cache/signature_cache: add background cleanup for expired sessions (10 min) - management/handler: add background cleanup for stale IP rate-limit entries (1 hr) - executor/cache_helpers: add mutex protection and TTL cleanup for codexCacheMap (15 min) - executor/codex_executor: use thread-safe cache accessors Add reproduction tests demonstrating leak behavior before/after fixes. Amp-Thread-ID: https://ampcode.com/threads/T-019ba0fc-1d7b-7338-8e1d-ca0520412777 Co-authored-by: Amp <amp@ampcode.com>	2026-01-09 13:33:46 +05:45
Luis Pater	ed28b71e87	refactor(amp): remove duplicate comments in response rewriter	2026-01-09 08:21:13 +08:00
Luis Pater	af2efa6f7e	Merge pull request #605 from soilSpoon/feature/amp-compat feature: Improves Amp client compatibility	2026-01-09 04:28:17 +08:00
LTbinglingfeng	5e5d8142f9	fix(auth): error when antigravity refresh token missing during refresh	2026-01-07 01:09:50 +08:00
LTbinglingfeng	b01619b441	fix(management): refresh antigravity token for api-call $TOKEN$	2026-01-07 00:14:02 +08:00
zhiqing0205	ac3ca0ad8e	feat(codex): include plan type in auth filename	2026-01-06 02:25:56 +08:00
CodeIgnitor	52760a4eaa	fix(auth): use backend project ID for free tier Gemini CLI OAuth users Fixes issue where free tier users cannot access Gemini 3 preview models due to frontend/backend project ID mapping. ## Problem Google's Gemini API uses a frontend/backend project mapping system for free tier users: - Frontend projects (e.g., gen-lang-client-) are user-visible - Backend projects (e.g., mystical-victor-) host actual API access - Only backend projects have access to preview models (gemini-3-) Previously, CLIProxyAPI ignored the backend project ID returned by Google's onboarding API and kept using the frontend ID, preventing access to preview models. ## Solution ### CLI (internal/cmd/login.go) - Detect free tier users (gen-lang-client- projects or FREE/LEGACY tier) - Show interactive prompt allowing users to choose frontend or backend - Default to backend (recommended for preview model access) - Pro users: maintain original behavior (keep frontend ID) ### Web UI (internal/api/handlers/management/auth_files.go) - Detect free tier users using same logic - Automatically use backend project ID (recommended choice) - Pro users: maintain original behavior (keep frontend ID) ### Deduplication (internal/cmd/login.go) - Add deduplication when user selects ALL projects - Prevents redundant API calls when multiple frontend projects map to same backend - Skips duplicate project IDs in activation loop ## Impact - Free tier users: Can now access gemini-3-pro-preview and gemini-3-flash-preview models - Pro users: No change in behavior (backward compatible) - Only affects Gemini CLI OAuth (not antigravity or API key auth) ## Testing - Tested with free tier account selecting single project - Tested with free tier account selecting ALL projects - Verified deduplication prevents redundant onboarding calls - Confirmed pro user behavior unchanged	2026-01-05 02:41:24 +05:00
Supra4E8C	cd22c849e2	feat(management): 更新OAuth模型映射的清理逻辑以增强数据安全性	2026-01-04 17:57:34 +08:00
Supra4E8C	f0e73efda2	feat(management): add vertex api key and oauth model mappings endpoints	2026-01-04 17:32:00 +08:00
Supra4E8C	3156109c71	feat(management): 支持管理接口调整日志大小/强制前缀/路由策略	2026-01-04 12:21:49 +08:00
Luis Pater	ebec293497	feat(api): integrate `TokenStore` for improved auth entry management Replaced file-based auth entry counting with `TokenStore`-backed implementation, enhancing flexibility and context-aware token management. Updated related logic to reflect this change.	2026-01-03 04:53:47 +08:00

1 2 3 4 5 ...

304 Commits