mirror of https://github.com/microsoft/agent-framework.git synced 2026-06-16 21:04:09 +08:00

Files

T

Eduard van Valkenburg 0cd40f8354 Python: [BREAKING] Refactor middleware layering and split Anthropic raw client (#4746 )

* [BREAKING] Refactor middleware layering and raw clients

Reorder chat client layers so function invocation wraps chat middleware, and chat middleware stays outside telemetry while still running for each inner model call. Add middleware pipeline caching, refresh docs and samples, and split Anthropic into raw and public clients to match the standard layering model.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Tighten typing ignores in ancillary modules

Add targeted typing ignores in workflow visualization and lab modules so pyright stays clean alongside the middleware refactor work.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Fix categorize_middleware to unpack tuple/Sequence and use relative MRO assertions

- Broaden isinstance check in categorize_middleware from list to Sequence
  so tuples and other Sequence types are properly unpacked instead of
  being appended as a single item.
- Replace fragile hardcoded MRO index assertions in anthropic test with
  relative ordering via mro.index().
- Add regression tests for categorize_middleware with tuple, list, and
  None inputs.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Fix middleware string decomposition, add middleware param to FunctionInvocationLayer, and add tests (#4710)

- Guard categorize_middleware Sequence check against str/bytes to prevent
  character-by-character decomposition of accidentally passed strings
- Add explicit middleware parameter to FunctionInvocationLayer.get_response
  and merge it into client_kwargs before categorization, fixing the
  inconsistency where only OpenAIChatClient supported this parameter
- Add assertions that RawAnthropicClient does not inherit convenience layers
- Add chat middleware cache test with non-empty base middleware
- Add tests for single unwrapped middleware item and string input

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Apply pre-commit auto-fixes

* Apply pre-commit auto-fixes

* Address review feedback for #4710: review comment fixes

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <copilot@github.com>

0cd40f8354 · 2026-03-20 00:43:37 +00:00

History

gaia

Python: [BREAKING] Refactor middleware layering and split Anthropic raw client (#4746 )

2026-03-20 00:43:37 +00:00

lightning

Python: [BREAKING] Refactor middleware layering and split Anthropic raw client (#4746 )

2026-03-20 00:43:37 +00:00

namespace/agent_framework

Python: Replace wildcard imports with explicit imports (#3908 )

2026-02-13 14:02:36 +00:00

tau2

Python: chore(python): improve dependency range automation (#4343 )

2026-03-13 12:32:37 +00:00

.gitignore

Python: Add Agent Framework Lab Lightning package with RL training examples (#937 )

2025-09-30 01:18:49 +00:00

AGENTS.md

.NET: Python: Add AGENTS.md files and update coding standards (#3644 )

2026-02-05 10:27:46 +00:00

LICENSE

Python: consolidate lab packages into a single one; update contribution guidelines (#940 )

2025-09-27 03:28:05 +00:00

pyproject.toml

Python: Simplify Python Poe tasks and unify package selectors (#4722 )

2026-03-18 18:39:11 +00:00

README.md

Python: Simplify Python Poe tasks and unify package selectors (#4722 )

2026-03-18 18:39:11 +00:00

README.md

Agent Framework Lab

This is the experimental package for Microsoft Agent Framework, agent-framework-lab, which contains various lab modules built on top of the core framework. Lab modules are not part of the core framework and may experience breaking changes or be deprecated in the future.

What are Lab Modules?

Lab modules are extensions to the core Agent Framework that fall into one of the following categories:

Incubation of new features that may get incorporated by the core framework.
Research prototypes built on the core framework.
Benchmarks and experimentation tools.

Lab Modules

gaia: Evaluate your agents using the GAIA benchmark for general assistant tasks
tau2: Evaluate your agents using the TAU2 benchmark for customer support tasks
lightning: RL training for agents using Agent Lightning

Repository Structure

agent-framework-lab/
├── pyproject.toml          # Single package configuration for agent-framework-lab
├── README.md               # This file
├── LICENSE                 # License file
├── namespace/              # Centralized namespace package files
│   └── agent_framework/
│       └── lab/
│           ├── gaia/       # Re-exports from agent_framework_lab_gaia
│           ├── lightning/  # Re-exports from agent_framework_lab_lightning
│           └── tau2/       # Re-exports from agent_framework_lab_tau2
├── gaia/                   # GAIA module implementation
│   └── agent_framework_lab_gaia/
├── lightning/              # Lightning module implementation
│   └── agent_framework_lab_lightning/
└── tau2/                   # TAU2 module implementation
    └── agent_framework_lab_tau2/

This structure maintains a single PyPI package agent-framework-lab while supporting modular imports through the namespace package mechanism.

Installation

To install each lab module, use the extras syntax with pip:

pip install "agent-framework-lab[gaia]"
pip install "agent-framework-lab[tau2]"
pip install "agent-framework-lab[lightning]"

Usage

Import and use lab modules from the agent_framework.lab namespace. For example, to use the GAIA module:

# Using GAIA module
from agent_framework.lab.gaia import GAIA

Running Tests Locally

For machine-safe local runs, prefer package-scoped commands first:

uv run --directory packages/lab poe test
uv run --directory packages/lab pytest -q -m "not integration"

When you need to run lab tests from the repository root, scope the root task to the lab package:

uv run poe test -P lab

Lightning observability tests intentionally exercise heavier tracing paths and are marked as resource_intensive:

uv run --directory packages/lab pytest lightning/tests/test_lightning.py -m "resource_intensive" -q

Should I consume Lab Modules?

If you are looking for stable and production-ready features, you should not use lab modules. Stick to the core framework.

If you are looking for experimentation, research, or want to benchmark different approaches -- most importantly, if you don't mind breaking changes and potential deprecations -- then lab modules are for you.

Contributing to Lab Modules

Microsoft-maintained modules

For Microsoft-maintained modules in this repository, please follow standard contribution guidelines and submit pull requests directly to this repository.

Community modules

If you want to contribute a community-maintained lab module:

Create a new repository on GitHub for your module
Tag your repository with agent-framework-lab for discoverability
Submit a PR to add a link to your repository in the Lab Modules section above
Use the PR title format: [New Lab Module] Your Module Name

We will review your submission based on the guidelines below.

Guidelines

Purpose: Community modules should fit into one of the three categories of lab modules (incubation, research, benchmarks)
Namespace: Community modules should avoid the agent_framework.lab namespace (reserved for modules maintained in this repository)
Dependencies: Minimize external dependencies, always include agent-framework as a base dependency
Documentation: Include comprehensive README with installation instructions and usage examples
Tests: Write comprehensive tests with good coverage
Type hints: Always include type hints and a py.typed file
Versioning: Use semantic versioning, start with 0.1.0 for initial releases