Files
agent-framework/dotnet/tests/Foundry.Hosting.IntegrationTests
T
Ben Thomas 03e14ca187 .NET: Update hosted agents (#6243)
* Updating to latest Foundry hosting packages.

* Re-applying .gitignore.

* Adding empty line at end of .gitignore

---------

Co-authored-by: Ben Thomas <25218250+alliscode@users.noreply.github.com>
03e14ca187 ยท 2026-06-01 21:27:29 +00:00
History
..

Foundry.Hosting.IntegrationTests

Integration tests for Microsoft.Agents.AI.Foundry.Hosting against real Foundry hosted agents.

How it works

Each test class is bound to a scenario fixture (e.g. HappyPathHostedAgentFixture, ToolCallingHostedAgentFixture). On InitializeAsync the fixture:

  1. Reads AZURE_AI_PROJECT_ENDPOINT and IT_HOSTED_AGENT_IMAGE from the environment.
  2. Targets a stable, scenario keyed agent name (e.g. it-happy-path). The agent is provisioned out of band by scripts/it-bootstrap-agents.ps1; tests only manage versions.
  3. Calls AgentAdministrationClient.CreateAgentVersionAsync with a HostedAgentDefinition that points at the image, sets IT_SCENARIO=<scenario> in the container env vars, and adds a per-run IT_RUN_ID so each run gets a fresh content-addressed version (Foundry deduplicates versions by definition hash).
  4. Polls until the agent reports AgentVersionStatus.Active (timeout: 5 minutes).
  5. Patches the agent endpoint with AgentEndpointConfig (Responses protocol, version selector pointing 100% at the new version).
  6. Builds a per-agent ProjectOpenAIClient with AgentName set on the options (this selects the /agents/{name}/endpoint/protocols/openai URL suffix; the cached projectClient.ProjectOpenAIClient cannot serve a hosted agent), wraps the ProjectResponsesClient as an AIAgent, and exposes it via Agent.

On DisposeAsync only the version created by this fixture is deleted. The agent itself is intentionally never deleted, because its managed identity must hold the pre-granted Azure AI User role on the project scope for inbound inference to succeed.

The container image is the same for every scenario. The IT_SCENARIO env var, set on the agent definition by each fixture, drives a switch in the test container's Program.cs to wire up the scenario specific behavior (tools, toolbox, custom storage, etc.).

Required environment variables

Variable Source Purpose
AZURE_AI_PROJECT_ENDPOINT Foundry project Where to provision the agent. Must be in a region that has the Hosted Agents preview enabled (e.g. East US 2).
AZURE_AI_MODEL_DEPLOYMENT_NAME Foundry project Model the agent uses. Defaults to gpt-4o inside the container.
IT_HOSTED_AGENT_IMAGE scripts/it-build-image.ps1 ACR image reference the agent points at.
AZURE_SEARCH_ENDPOINT Pre-provisioned Azure AI Search service Endpoint for the azure-search-rag scenario. The index it points at must already exist with the schema and content described under Azure AI Search index prerequisite below.
AZURE_SEARCH_INDEX_NAME Pre-provisioned Azure AI Search service Name of the pre-seeded index for the azure-search-rag scenario.

One-time bootstrap (per Foundry project)

Hosted agent invocation requires the agent's own managed identity to hold the Azure AI User role on the project scope. Because each agent's MI is created when the agent is first provisioned (and recycled on agent delete), the bootstrap creates the six stable scenario agents once and grants the role to each MI. The fixture then only manages versions under those existing agents, so the role grants survive across runs.

./scripts/it-bootstrap-agents.ps1 `
    -ProjectEndpoint "https://<account>.services.ai.azure.com/api/projects/<project>" `
    -Image "<acr>.azurecr.io/foundry-hosting-it:<tag>"

The script is idempotent. It requires Owner or User Access Administrator on the project scope (RBAC writes). Wait ~3 minutes after first-time grants for AAD propagation before running the tests.

Per-scenario data-plane RBAC (manual, one time per agent)

The bootstrap script grants only Azure AI User on the Foundry project scope, which is what every hosted agent needs to receive inbound inference traffic. Scenarios that read from external data services need an additional grant on that service to the agent's managed identity. Today only the azure-search-rag scenario falls into this category.

For it-azure-search-rag, after the first bootstrap run, grant Search Index Data Reader on the Azure AI Search service to the agent's managed identity:

# 1. Get the agent MI principal id
$tok = az account get-access-token --resource "https://ai.azure.com" --query accessToken -o tsv
$agent = Invoke-RestMethod `
    -Headers @{Authorization="Bearer $tok"; "Foundry-Features"="HostedAgents=V1Preview"} `
    -Uri "<project-endpoint>/agents/it-azure-search-rag?api-version=v1"
$mi = $agent.versions.latest.instance_identity.principal_id

# 2. Grant Search Index Data Reader on the search service
az role assignment create `
    --assignee-object-id $mi `
    --assignee-principal-type ServicePrincipal `
    --role "Search Index Data Reader" `
    --scope "/subscriptions/<sub>/resourceGroups/<rg>/providers/Microsoft.Search/searchServices/<search-service>"

Wait ~3 minutes after the grant for RBAC propagation before running the tests.

If the search service has authOptions = apiKeyOnly (default for older deployments), Entra auth will return 403 regardless of role assignments. Flip it to aadOrApiKey first:

az search service update -g <rg> -n <search-service> --auth-options aadOrApiKey --aad-auth-failure-mode http403

Azure AI Search index prerequisite (one time, out of band)

The azure-search-rag scenario assumes the index pointed at by AZURE_SEARCH_INDEX_NAME already exists with the schema and Contoso Outdoors content the test asserts against. See dotnet/samples/04-hosting/FoundryHostedAgents/responses/Hosted-AzureSearchRag/README.md for the schema and copy-pasteable provisioning snippet. Provisioning the index from your user identity needs Search Index Data Contributor on the search service scope. The search service itself is treated as pre-existing infrastructure shared with python-sample-validation.yml; no automated provisioning script ships in this repository.

Required user/SP roles for delegating data-plane grants

To self-serve the Search Index Data Reader grant above, you need User Access Administrator (or Owner) on the search service scope. To create/seed the index from your own identity, you need Search Index Data Contributor. These are typically granted once per onboarded engineer and reused for every new IT scenario that needs Search.

Building and pushing the test container image

The test container source lives at dotnet/tests/Foundry.Hosting.IntegrationTests.TestContainer. Build and push it with:

$env:IT_REGISTRY = "<your-acr>.azurecr.io"
$env:IT_HOSTED_AGENT_IMAGE = (./scripts/it-build-image.ps1 -Registry $env:IT_REGISTRY | Select-String IT_HOSTED_AGENT_IMAGE).Line.Split('=', 2)[1]

The script tags the image by content hash of the test container source. If you didn't change anything since the last build, the push is a no op.

The Foundry project's account MI and project MI both need AcrPull on the registry.

Running the tests locally

$env:AZURE_AI_PROJECT_ENDPOINT = "https://<your-account>.services.ai.azure.com/api/projects/<your-project>"
$env:AZURE_AI_MODEL_DEPLOYMENT_NAME = "gpt-4o"
# IT_HOSTED_AGENT_IMAGE was set above.

dotnet test dotnet/tests/Foundry.Hosting.IntegrationTests/Foundry.Hosting.IntegrationTests.csproj

Note: all tests are currently tagged [Fact(Skip = ...)] until end to end smoke verification has run against a live Foundry deployment. Once a scenario has been exercised and the assertions stabilized, remove the Skip annotation on its tests.

All test classes carry [Trait("Category", "FoundryHostedAgents")] so the CI workflow can route them to a separate Foundry project than the rest of the integration tests (see .github/workflows/dotnet-build-and-test.yml).

CI wiring

The main "Run Integration Tests" step excludes this category. Two extra steps run only on ubuntu-latest for this category, gated on paths-filter.outputs.foundryHostingChanges so they execute only when the project under test, its dependency chain, the test container, the test fixture, or their tooling changed:

  1. Build and push Foundry Hosted Agents test container invokes scripts/it-build-image.ps1 against vars.IT_HOSTED_AGENT_REGISTRY. The image is rebuilt every IT run; its tag is content-hashed across the test container source AND its referenced framework projects (Microsoft.Agents.AI.Foundry.Hosting, Microsoft.Agents.AI.Foundry, Microsoft.Agents.AI, Microsoft.Agents.AI.Abstractions), so unchanged content is a docker push no-op while any framework code change forces a fresh image. The script pipes its IT_HOSTED_AGENT_IMAGE=<tag> line into $GITHUB_ENV for the next step.

  2. Run Foundry Hosted Agents Integration Tests executes only --filter-trait "Category=FoundryHostedAgents" with the env vars below mapped onto the names the fixture reads. IT_HOSTED_AGENT_IMAGE is the value just exported by step 1.

GitHub env var Mapped to
IT_HOSTED_AGENT_PROJECT_ENDPOINT AZURE_AI_PROJECT_ENDPOINT
IT_HOSTED_AGENT_MODEL_DEPLOYMENT_NAME AZURE_AI_MODEL_DEPLOYMENT_NAME
IT_HOSTED_AGENT_REGISTRY (consumed by it-build-image.ps1; not passed to tests)
secrets.AZURE_SEARCH_ENDPOINT AZURE_SEARCH_ENDPOINT (shared with python-sample-validation.yml)
secrets.AZURE_SEARCH_INDEX_NAME AZURE_SEARCH_INDEX_NAME (shared with python-sample-validation.yml)

Like all integration tests in this workflow, the steps run only on push and merge-queue events, never on plain pull_request. The path-filter list lives in the paths-filter job in .github/workflows/dotnet-build-and-test.yml under filters.foundryHosting and must stay in sync with $hashedDirs in scripts/it-build-image.ps1.

The CI service principal that backs secrets.AZURE_CLIENT_ID needs:

  • Azure AI User on the hosted-agents Foundry project (to add/delete agent versions).
  • AcrPush on the registry referenced by IT_HOSTED_AGENT_REGISTRY (to push the image).

The Azure AI Search index referenced by secrets.AZURE_SEARCH_ENDPOINT and secrets.AZURE_SEARCH_INDEX_NAME is provisioned out of band (shared with python-sample-validation.yml); CI does not need write access to the search service.

The bootstrap script (and one-time AcrPull grants for the Foundry project's MIs) is a human-only operation; CI only adds and deletes versions under existing agents.

Scenarios

Fixture IT_SCENARIO Agent name What it tests
HappyPathHostedAgentFixture happy-path it-happy-path Round trip, streaming, multi turn (previous_response_id and conversation_id), stored=false flag in three combinations, instructions obeyed.
ToolCallingHostedAgentFixture tool-calling it-tool-calling Server side AIFunction invocation; arguments; multi turn referencing prior tool result.
ToolCallingApprovalHostedAgentFixture tool-calling-approval it-tool-calling-approval Approval requests raised, approved, denied.
McpToolboxHostedAgentFixture mcp-toolbox it-mcp-toolbox MCP backed tool invocation against https://learn.microsoft.com/api/mcp (placeholder).
CustomStorageHostedAgentFixture custom-storage it-custom-storage Round trip with custom IResponsesStorageProvider; multi turn reads from the custom store (placeholder).
AzureSearchRagHostedAgentFixture azure-search-rag it-azure-search-rag RAG against a real Azure AI Search index seeded with Contoso Outdoors documents; verifies the model cites the retrieved sources.
SessionFilesHostedAgentFixture session-files it-session-files End-to-end: upload via AgentSessionFiles (alpha) into a pinned agent_session_id, invoke the agent, assert it reads the file via the container's ReadFile tool.
AgentSkillsHostedAgentFixture agent-skills it-agent-skills Agent skills via AgentSkillsProvider: advertises two Contoso Outdoors skills (support-style, escalation-policy) in the system prompt, loads them on demand via load_skill, verifies canary tokens prove the skill was loaded.

The placeholder scenarios will be wired up in the test container Program.cs once the relevant Microsoft.Agents.AI.Foundry.Hosting API surfaces stabilize.