Commit Graph

785 Commits

  • feat(pricing): add GPT-5 Codex model pricing presets
    Add pricing configuration for GPT-5 Codex variants to support cost tracking for Codex-specific models.
    
    Changes:
    - Add gpt-5-codex model with standard GPT-5 pricing
    - Add gpt-5-1-codex model with standard GPT-5.1 pricing
    - Input: $1.25/M tokens, Output: $10/M tokens
    - Cache read: $0.125/M tokens, Cache creation: $0
    
    This ensures accurate cost calculation for Codex API requests using GPT-5 Codex models.
  • fix: add fallback for crypto.randomUUID() on older WebViews
    crypto.randomUUID() is not available on older systems (macOS < 12.3,
    Safari < 15.4). This caused "crypto.randomUUID is not a function"
    error when adding providers.
    
    - Add generateUUID() utility with getRandomValues() fallback
    - Provide user-friendly error message if crypto API unavailable
  • fix(ui): restore card borders in usage statistics panels
    Restore proper card styling for ModelTestConfigPanel and PricingConfigPanel by adding back border and rounded-lg classes. The transparent background styling was causing visual inconsistency.
    
    Changes:
    - Replace border-none bg-transparent shadow-none with border rounded-lg
    - Apply to both loading and error states for consistency
    - Format TypeScript code for better readability
    - Break long function signatures across multiple lines
    
    This ensures the usage statistics panels have consistent visual appearance with proper borders and rounded corners.
  • style(rust): apply clippy formatting suggestions
    Apply automatic clippy fixes for uninlined_format_args warnings across Rust codebase. Replace format string placeholders with inline variable syntax for improved readability.
    
    Changes:
    - Convert format!("{}", var) to format!("{var}")
    - Apply to model_test.rs, parser.rs, and usage_stats.rs
    - Fix line length issues by breaking long function calls
    - Improve code formatting consistency
    
    All changes are automatic formatting with no functional impact.
  • fix(pricing): standardize model ID format for pricing lookup
    Normalize model IDs by removing vendor prefixes and converting dots to hyphens to ensure consistent pricing lookups across different API response formats.
    
    Changes:
    - Update seed data to use hyphen format (e.g., gpt-5-1, gemini-2-5-pro)
    - Add normalize_model_id() function to strip vendor prefixes (anthropic/, openai/)
    - Convert dots to hyphens in model IDs (claude-haiku-4.5 → claude-haiku-4-5)
    - Try both original and normalized IDs for exact matching
    - Use normalized ID for suffix-based fallback matching
    - Add comprehensive test cases for prefix and dot handling
    - Add warning log when no pricing found
    
    This ensures pricing lookups work correctly for:
    - Models with vendor prefixes: anthropic/claude-haiku-4.5
    - Models with dots in version: claude-sonnet-4.5
    - Models with date suffixes: claude-haiku-4-5-20240229
  • fix(proxy): resolve token parsing for OpenRouter streaming responses
    Problem:
    - OpenRouter and similar third-party services return streaming responses
      where input_tokens appear in message_delta instead of message_start
    - The previous implementation only extracted input_tokens from message_start,
      causing input_tokens to be recorded as 0 for these providers
    
    Changes:
    - streaming.rs: Add prompt_tokens field to Usage struct and include
      input_tokens in the transformed message_delta event when converting
      OpenAI format to Anthropic format
    - parser.rs: Update from_claude_stream_events() to handle input_tokens
      from both message_start (native Claude API) and message_delta (OpenRouter)
      - Use if-let pattern instead of direct unwrap for safer parsing
      - Only update input_tokens from message_delta if not already set
    - logger.rs: Adjust test parameters to match updated function signature
    
    Tests:
    - Add test_openrouter_stream_parsing() for OpenRouter format validation
    - Add test_native_claude_stream_parsing() for native Claude API validation
  • feat(model-test): add provider model availability testing
    Implement standalone model testing feature to verify provider API connectivity:
    - Add ModelTestService for Claude/Codex/Gemini endpoint testing
    - Create model_test_logs table for test result persistence
    - Add test button to ProviderCard with loading state
    - Include ModelTestConfigPanel for customizing test parameters
  • Merge branch 'main' into feat/proxy-server
    # Conflicts:
    #	src/components/settings/SettingsPage.tsx
  • style(config): format mcpPresets code style
    Apply consistent formatting to createNpxCommand function and
    sequential-thinking server configuration.
  • refactor(ui): enhance RequestLogTable with filtering and pagination
    UI improvements:
    - Add filter bar with app type, provider name, model, status selectors
    - Add date range picker (startDate/endDate)
    - Add search/reset/refresh buttons
    
    Pagination:
    - Implement proper page-based pagination with page info display
    - Show total count and current page range
    - Add prev/next navigation buttons
    
    Features:
    - Default to last 24 hours filter
    - Streamlined table columns layout
    - Query invalidation on refresh
  • feat(frontend): update usage types and API for pagination support
    Types (usage.ts):
    - Add isStreaming, firstTokenMs, durationMs to RequestLog
    - Add PaginatedLogs interface with data, total, page, pageSize
    - Change LogFilters: providerId -> appType + providerName
    
    API (usage.ts):
    - Change getRequestLogs params from limit/offset to page/pageSize
    - Return PaginatedLogs instead of RequestLog[]
    - Pass filters object directly to backend
    
    Query (usage.ts):
    - Update usageKeys.logs key generation for pagination
    - Update useRequestLogs hook signature
  • feat(services): add pagination and enhanced filtering for request logs
    Usage stats service:
    - Change get_request_logs() from limit/offset to page/page_size pagination
    - Return PaginatedLogs with total count, page, and page_size
    - Add appType and providerName filters with LIKE search
    - Add is_streaming, first_token_ms, duration_ms to RequestLogDetail
    - Join with providers table for provider name lookup
    
    Commands:
    - Update get_request_logs command signature for pagination params
    
    Module exports:
    - Export PaginatedLogs struct
  • feat(proxy): enhance proxy server with session tracking and OpenAI route
    Error handling:
    - Add StreamIdleTimeout and AuthError variants for better error classification
    
    Module exports:
    - Export ResponseType, StreamHandler, NonStreamHandler from response_handler
    - Export ProxySession, ClientFormat from session module
    
    Server routing:
    - Add /v1/chat/completions route for OpenAI Chat Completions API
    
    Handlers:
    - Add log_usage_with_session() for enhanced usage tracking with session context
    - Add first_token_ms timing measurement for streaming responses
    - Use SseUsageCollector with start_time for accurate latency calculation
    - Track is_streaming flag in usage logs
  • feat(proxy/usage): enhance token parser and logger for multi-format support
    Parser enhancements:
    - Add OpenAI Chat Completions format parsing (prompt_tokens/completion_tokens)
    - Add model field to TokenUsage for actual model name extraction
    - Add from_codex_response_adjusted() for proper cache token handling
    - Add debug logging for better stream event tracing
    
    Logger enhancements:
    - Add first_token_ms, provider_type, is_streaming, cost_multiplier fields
    - Extend RequestLog struct with full metadata tracking
    - Update log_with_calculation() signature for new fields
    
    Calculator: Update tests with model field in TokenUsage.
  • feat(database): extend schema with streaming and timing fields
    Add new columns to proxy_request_logs table for enhanced usage tracking:
    - first_token_ms and duration_ms for performance metrics
    - provider_type and is_streaming for request classification
    - cost_multiplier for flexible pricing
    
    Update model pricing with accurate rates for Claude/GPT/Gemini models.
    Add ensure_model_pricing_seeded() call on database initialization.
    Add test for model pricing auto-seeding verification.
  • feat(proxy): add ProviderType enum for fine-grained provider detection
    Introduce ProviderType enum to distinguish between different provider
    implementations (Claude, ClaudeAuth, Codex, Gemini, GeminiCli, OpenRouter).
    This enables proper authentication handling and request transformation
    based on the actual provider type rather than just AppType.
    
    - Add ProviderType enum with detection logic from config
    - Enhance Claude adapter with OpenRouter detection
    - Enhance Gemini adapter with CLI mode detection
    - Add helper methods for provider type inference
  • deps: add recharts for charts and rust_decimal/uuid for usage tracking
    - recharts: Chart visualization for usage trends
    - rust_decimal: Precise cost calculations
    - uuid: Request ID generation
  • fix(ui): integrate usage dashboard and fix type errors
    Add usage dashboard tab to settings page.
    Fix UsageScriptModal TypeScript type annotations.
  • feat(ui): add usage dashboard components
    Add UsageDashboard with summary cards, trend chart, and data tables.
    Implement model pricing configuration panel.
    Add request log viewer with filtering and detail panel.
  • feat(api): add frontend usage API and query hooks
    Add TypeScript types for usage statistics.
    Implement usage API with Tauri invoke calls.
    Add TanStack Query hooks for usage data fetching.
  • feat(commands): add usage statistics Tauri commands
    Register usage commands for summary, trends, logs, and pricing.
    Expose usage stats service through Tauri command layer.
  • feat(proxy): integrate usage logging into request handlers
    Add usage logging to forwarder and streaming handlers.
    Track token usage and costs for each proxy request.
  • feat(proxy): implement usage tracking subsystem
    Add request logger with automatic cost calculation.
    Implement token parser for Claude/OpenAI/Gemini responses.
    Add cost calculator based on model pricing configuration.
  • feat(db): add usage tracking schema and types
    Add database tables for proxy request logs and model pricing.
    Extend Provider and error types to support usage statistics.
  • fix(mcp): use browser-compatible platform detection for MCP presets
    Replace Node.js process.platform check with navigator.userAgent-based
    detection from @/lib/platform. The previous implementation using
    process.platform failed in Tauri's browser environment, causing
    Windows platform to always return false and not apply the required
    'cmd /c' wrapper for npx commands.
    
    This fix ensures that on Windows, MCP presets like sequential-thinking
    correctly use 'cmd /c npx' format instead of direct 'npx', eliminating
    the warning: "Windows requires 'cmd /c' wrapper to execute npx"
    
    Changes:
    - Import isWindows() from @/lib/platform instead of defining locally
    - Remove incorrect process.platform-based detection
    - Now properly detects Windows in browser/WebView environment
  • feat(proxy): add streaming SSE transform and thinking parameter support
    New features:
    - Add OpenAI → Anthropic SSE streaming response transformation
    - Support thinking parameter detection for reasoning model selection
    - Add ANTHROPIC_REASONING_MODEL config option for extended thinking
    
    Changes:
    - streaming.rs: Implement SSE event parsing and Anthropic format conversion
    - transform.rs: Add thinking detection logic and reasoning model mapping
    - handlers.rs: Integrate streaming transform for OpenRouter compatibility
    - Cargo.toml: Add async-stream and bytes dependencies
  • fix(ui): prevent provider card hover scale from being clipped
    Move 1px of horizontal padding from outer container to inner scroll
    container, providing buffer space for the hover scale-[1.01] effect
    without being cut off by overflow-hidden.
  • feat(proxy): implement provider adapter pattern with OpenRouter support
    This major refactoring introduces a modular provider adapter architecture
    to support format transformation between different AI API formats.
    
    New features:
    - Add ProviderAdapter trait for unified provider abstraction
    - Implement Claude, Codex, and Gemini adapters with specific logic
    - Add Anthropic ↔ OpenAI format transformation for OpenRouter compatibility
    - Support model mapping from provider configuration (ANTHROPIC_MODEL, etc.)
    - Add OpenRouter preset to Claude provider presets
    
    Refactoring:
    - Extract authentication logic into auth.rs with AuthInfo and AuthStrategy
    - Move URL building and request transformation to individual adapters
    - Simplify ProviderRouter to only use proxy target providers
    - Refactor RequestForwarder to use adapter-based request/response handling
    - Use whitelist mode for header forwarding (only pass necessary headers)
    
    Architecture:
    - providers/adapter.rs: ProviderAdapter trait definition
    - providers/auth.rs: AuthInfo, AuthStrategy types
    - providers/claude.rs: Claude adapter with OpenRouter detection
    - providers/codex.rs: Codex (OpenAI) adapter
    - providers/gemini.rs: Gemini (Google) adapter
    - providers/models/: Anthropic and OpenAI API data models
    - providers/transform.rs: Bidirectional format transformation
  • fix(proxy): resolve clippy warnings for dead code and uninlined format args
    - Add #[allow(dead_code)] to unused ProviderUnhealthy variant
    - Inline format string arguments in handlers.rs and codex.rs log macros
    - Refactor error response handling to properly pass through upstream errors
    - Add URL deduplication logic for /v1/v1 paths in CodexAdapter
  • Merge branch 'main' into feat/proxy-server
    # Conflicts:
    #	src/App.tsx
  • refactor(proxy): remove unused request handlers and routes
    - Remove unused GET/DELETE request forwarding methods
    - Remove count_tokens, get/delete response handlers
    - Simplify router by removing unused endpoints
    - Keep only essential routes: /v1/messages, /v1/responses, /v1beta/*
  • fix(proxy): improve URL building and Gemini request handling
    - Refactor URL construction with version path deduplication (/v1, /v1beta)
    - Preserve query parameters for Gemini API requests
    - Support GOOGLE_GEMINI_API_KEY field name (with fallback)
    - Change default proxy port from 5000 to 15721
    - Fix test: use Option type for is_proxy_target field
  • fix(ui): add independent scroll containers to fix scroll wheel on Linux
    Providers page was using DndContext which may interfere with scroll wheel
    events on Linux/Ubuntu WebKitGTK. Added independent scroll containers
    with `overflow-y-auto` to all main pages, matching the pattern already
    used by the MCP panel which works correctly.
    
    Changes:
    - App.tsx: Wrap ProviderList in independent scroll container
    - SkillsPage: Use consistent h-[calc(100vh-8rem)] layout
    - SettingsPage: Add overflow-hidden and overflow-x-hidden for consistency
  • fix(config): update Codex default model and correct MiniMax URL
    - Update Codex preset model from gpt-5-codex to gpt-5.1-codex
    - Fix MiniMax English preset URL from minimaxi.io to minimax.io
  • fix(proxy): resolve UI/UX issues and database constraint error
    Simplify proxy control interface and fix database persistence issues:
    
    Backend Fixes:
    - Fix NOT NULL constraint error in proxy_config.created_at field
      * Use COALESCE to preserve created_at on updates
      * Ensure proper INSERT OR REPLACE behavior
    - Remove redundant enabled field validation on startup
      * Auto-enable when user clicks start button
      * Persist enabled state after successful start
    - Preserve enabled state during config updates
      * Prevent accidental service shutdown on config save
    
    Frontend Improvements:
    - Remove duplicate proxy enable switch from settings dialog
      * Keep only runtime toggle in ProxyPanel
      * Simplify user experience with single control point
    - Hide proxy target button when proxy service is stopped
      * Add isProxyRunning prop to ProviderCard
      * Conditionally render proxy controls based on service status
    - Update form schema to omit enabled field
      * Managed automatically by backend
    
    Files: 5 changed, 81 insertions(+), 94 deletions(-)
  • feat(proxy): implement local HTTP proxy server with multi-provider failover
    Add a complete HTTP proxy server implementation built on Axum framework,
    enabling local API request forwarding with automatic provider failover
    and load balancing capabilities.
    
    Backend Implementation (Rust):
    - Add proxy server module with 7 core components:
      * server.rs: Axum HTTP server lifecycle management (start/stop/status)
      * router.rs: API routing configuration for Claude/OpenAI/Gemini endpoints
      * handlers.rs: Request/response handling and transformation
      * forwarder.rs: Upstream forwarding logic with retry mechanism (652 lines)
      * error.rs: Comprehensive error handling and HTTP status mapping
      * types.rs: Shared types (ProxyConfig, ProxyStatus, ProxyServerInfo)
      * health.rs: Provider health check infrastructure
    
    Service Layer:
    - Add ProxyService (services/proxy.rs, 157 lines):
      * Manage proxy server lifecycle
      * Handle configuration updates
      * Track runtime status and metrics
    
    Database Layer:
    - Add proxy configuration DAO (dao/proxy.rs, 242 lines):
      * Persist proxy settings (listen address, port, timeout)
      * Store provider priority and availability flags
    - Update schema with proxy_config table (schema.rs):
      * Support runtime configuration persistence
    
    Tauri Commands:
    - Add 6 command endpoints (commands/proxy.rs):
      * start_proxy_server: Launch proxy server
      * stop_proxy_server: Gracefully shutdown server
      * get_proxy_status: Query runtime status
      * get_proxy_config: Retrieve current configuration
      * update_proxy_config: Modify settings without restart
      * is_proxy_running: Check server state
    
    Frontend Implementation (React + TypeScript):
    - Add ProxyPanel component (222 lines):
      * Real-time server status display
      * Start/stop controls
      * Provider availability monitoring
    - Add ProxySettingsDialog component (420 lines):
      * Configuration editor (address, port, timeout)
      * Provider priority management
      * Settings validation
    - Add React hooks:
      * useProxyConfig: Manage proxy configuration state
      * useProxyStatus: Poll and display server status
    - Add TypeScript types (types/proxy.ts):
      * Define ProxyConfig, ProxyStatus interfaces
    
    Provider Integration:
    - Extend Provider model with availability field (providers.rs):
      * Track provider health for failover logic
    - Update ProviderCard UI to display proxy status
    - Integrate proxy controls in Settings page
    
    Dependencies:
    - Add Axum 0.7 (async web framework)
    - Add Tower 0.4 (middleware and service abstractions)
    - Add Tower-HTTP (CORS layer)
    - Add Tokio sync primitives (oneshot, RwLock)
    
    Technical Details:
    - Graceful shutdown via oneshot channel
    - Shared state with Arc<RwLock<T>> for thread-safe config updates
    - CORS enabled for cross-origin frontend access
    - Request/response streaming support
    - Automatic retry with exponential backoff (forwarder)
    - API key extraction from multiple config formats (Claude/Codex/Gemini)
    
    File Statistics:
    - 41 files changed
    - 3491 insertions(+), 41 deletions(-)
    - Core modules: 1393 lines (server + forwarder + handlers)
    - Frontend UI: 642 lines (ProxyPanel + ProxySettingsDialog)
    - Database/DAO: 326 lines
    
    This implementation provides the foundation for advanced features like:
    - Multi-provider load balancing
    - Automatic failover on provider errors
    - Request logging and analytics
    - Usage tracking and cost monitoring
  • style: apply code formatting fixes
    - Fix Prettier formatting in claudeProviderPresets.ts
    - Fix cargo fmt trailing blank line in provider/mod.rs
  • fix(config): correct MiniMax apiKeyUrl to minimaxi.com
    Update apiKeyUrl domain to match official website URL.
    Also update screenshots for all locales (en, ja, zh).