mirror of https://github.com/microsoft/agent-framework.git synced 2026-06-16 21:04:09 +08:00

Files

T

Eduard van Valkenburg b03cb324d5 Python: Add Hyperlight CodeAct package and docs (#5185 )

* initial work on code_mode

* updated samples

* updates to codeact

* udpated codeact

* Draft CodeAct ADR and sample updates

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* initial implementation and adr and feature

* Python: Limit Hyperlight wasm backend to Python <3.14

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Fix CI for Hyperlight CodeAct PR

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Run Hyperlight integration when available

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Address Hyperlight review feedback

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Simplify Hyperlight file mount inputs

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Accept Path host paths in Hyperlight mounts

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Python: Fix Hyperlight mount typing for CI

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* temp run integration test

* Python: Strengthen Hyperlight real sandbox tests

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* added additional tests

* Python: Simplify Hyperlight CodeAct API

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* set tests as non-integration

* Retry Hyperlight allowed-domain registration

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Gate Hyperlight integration tests by runtime support

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Fix Hyperlight skip test on Python 3.14

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Delay Hyperlight runtime probe until test execution

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Relax Hyperlight Windows integration stdout assertion

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Scan Hyperlight output directory for artifacts

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Retry Hyperlight output artifact collection

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Harden Hyperlight integration output assertions

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Retry Hyperlight read-back check in integration test

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Simplify Hyperlight integration write assertion

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Avoid pathlib in Hyperlight integration sandbox

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Use socket network check in Hyperlight sandbox

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Replace blocked Azure AI Search blog link

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Clarify Hyperlight guest stdlib limits

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Use _socket in Hyperlight integration sandbox

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Handle Hyperlight mounted file paths

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Broaden Hyperlight sandbox path fallbacks

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Search Hyperlight guest mounts recursively

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Split Hyperlight mount coverage

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Split Hyperlight live network tests

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Fix Hyperlight file-write test on Windows

Enable the sandbox filesystem by providing a workspace_root so
/output is mounted. Remove os.path.exists assertion (unsupported
in WASM guest) and fix Content data assertion to use .uri.
Skip the network integration test on Windows where the WASM
sandbox lacks the encodings.idna codec.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address PR review: ADR intro, manual wiring sample, doc clarifications

- Add CodeAct introduction section to ADR for unfamiliar readers
- Clarify 'less runtime efficient' con with specific overhead description
- Add note in Python impl doc clarifying ADR vs impl doc split
- Explain why before_run hooks must be per-run (CRUD, concurrency, approval)
- Rename code_interpreter variable to codeact in E2E sample
- Add manual static wiring sample (codeact_manual_wiring.py)
- Add 'when to use which pattern' guidance to samples README

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address PR #5185 review comments and add .NET CodeAct design doc

- Fix async callback: _make_sandbox_callback returns sync wrapper with
  thread + asyncio.run() bridge (was broken with real Wasm FFI)
- Fix stale output: clear output_dir before each sandbox.run() call
- Fix blocking event loop: _run_code now async with asyncio.to_thread()
- Revert _agents.py options['tools'] injection (unnecessary; provider
  uses context.extend_tools())
- Revert SessionContext.options docstring back to read-only
- Add real-sandbox test fixtures (shared/restored/fresh)
- Add 8 new real-sandbox tests for callback round-trip, stale output,
  event loop non-blocking, basic execution, stdout/stderr, errors,
  snapshot/restore, and tool registration
- Add comprehensive .NET HyperlightCodeActProvider design document

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Update hyperlight README with code snippets and remove Public API section

Replace bare export list with Quick Start code examples covering the
context provider, standalone tool, manual static wiring, and file
mounts / network access patterns.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

b03cb324d5 · 2026-04-17 00:49:44 +00:00

History

README.md

Python: Add Hyperlight CodeAct package and docs (#5185 )

2026-04-17 00:49:44 +00:00

search_context_agentic.py

fixes to azure ai search init, samples (#5021 )

2026-04-01 09:59:52 +00:00

search_context_semantic.py

Python: [BREAKING] Standardize model selection on model (#4999 )

2026-04-01 19:00:18 +00:00

README.md

Azure AI Search Context Provider Examples

Azure AI Search context provider enables Retrieval Augmented Generation (RAG) with your agents by retrieving relevant documents from Azure AI Search indexes. It supports two search modes optimized for different use cases.

This folder contains examples demonstrating how to use the Azure AI Search context provider with the Agent Framework.

Examples

File	Description
`search_context_agentic.py`	Agentic mode (recommended for most scenarios): Uses Knowledge Bases in Azure AI Search for query planning and multi-hop reasoning. Provides more accurate results through intelligent retrieval with automatic query reformulation. Slightly slower with more token consumption for query planning. Learn more
`search_context_semantic.py`	Semantic mode (fast queries): Fast hybrid search combining vector and keyword search with semantic ranking. Returns raw search results as context. Best for scenarios where speed is critical and simple retrieval is sufficient.

Installation

pip install agent-framework-foundry-search agent-framework-foundry

Prerequisites

Required Resources

Azure AI Search service with a search index containing your documents
- Create Azure AI Search service
- Create and populate a search index
Azure AI Foundry project with a model deployment
- Create Azure AI Foundry project
- Deploy a model (e.g., GPT-4o)
For Agentic mode only: Azure OpenAI resource for Knowledge Base model calls
- Create Azure OpenAI resource
- Note: This is separate from your Azure AI Foundry project endpoint

Authentication

Both examples support two authentication methods:

API Key: Set AZURE_SEARCH_API_KEY environment variable
Entra ID (Managed Identity): Uses DefaultAzureCredential when API key is not provided

Run az login if using Entra ID authentication.

Configuration

Environment Variables

Common (both modes):

AZURE_SEARCH_ENDPOINT: Your Azure AI Search endpoint (e.g., https://myservice.search.windows.net)
AZURE_SEARCH_INDEX_NAME: Name of your search index
FOUNDRY_PROJECT_ENDPOINT: Your Azure AI Foundry project endpoint
FOUNDRY_MODEL: Model deployment name (e.g., gpt-4o, defaults to gpt-4o)
AZURE_SEARCH_API_KEY: (Optional) Your search API key - if not provided, uses DefaultAzureCredential

Agentic mode only:

AZURE_SEARCH_KNOWLEDGE_BASE_NAME: Name of your Knowledge Base in Azure AI Search
AZURE_OPENAI_RESOURCE_URL: Your Azure OpenAI resource URL (e.g., https://myresource.openai.azure.com)
- Important: This is different from FOUNDRY_PROJECT_ENDPOINT - Knowledge Base needs the OpenAI endpoint for model calls

Example .env file

For Semantic Mode:

AZURE_SEARCH_ENDPOINT=https://myservice.search.windows.net
AZURE_SEARCH_INDEX_NAME=my-index
FOUNDRY_PROJECT_ENDPOINT=https://<resource-name>.services.ai.azure.com/api/projects/<project-name>
FOUNDRY_MODEL=gpt-4o
# Optional - omit to use Entra ID
AZURE_SEARCH_API_KEY=your-search-key

For Agentic Mode (add these to semantic mode variables):

AZURE_SEARCH_KNOWLEDGE_BASE_NAME=my-knowledge-base
AZURE_OPENAI_RESOURCE_URL=https://myresource.openai.azure.com

Search Modes Comparison

Feature	Semantic Mode	Agentic Mode
Speed	Fast	Slower (query planning overhead)
Token Usage	Lower	Higher (query reformulation)
Retrieval Strategy	Hybrid search + semantic ranking	Multi-hop reasoning with Knowledge Base
Query Handling	Direct search	Automatic query reformulation
Best For	Simple queries, speed-critical apps	Complex queries, multi-document reasoning
Additional Setup	None	Requires Knowledge Base + OpenAI resource

When to Use Semantic Mode

Simple queries where direct keyword/vector search is sufficient
Speed is critical and you need low latency
Straightforward retrieval from single documents
Lower token costs are important

When to Use Agentic Mode

Complex queries requiring multi-hop reasoning
Cross-document analysis where information spans multiple sources
Ambiguous queries that benefit from automatic reformulation
Higher accuracy is more important than speed
You need intelligent query planning and document synthesis

How the Examples Work

Semantic Mode Flow

User query is sent to Azure AI Search
Hybrid search (vector + keyword) retrieves relevant documents
Semantic ranking reorders results for relevance
Top-k documents are returned as context
Agent generates response using retrieved context

Agentic Mode Flow

User query is sent to the Knowledge Base
Knowledge Base plans the retrieval strategy
Multiple search queries may be executed (multi-hop)
Retrieved information is synthesized
Enhanced context is provided to the agent
Agent generates response with comprehensive context

Code Example

Semantic Mode

from agent_framework import Agent
from agent_framework.azure import AzureAISearchContextProvider
from agent_framework.foundry import FoundryChatClient
from azure.identity.aio import DefaultAzureCredential

# Create search provider with semantic mode (default)
search_provider = AzureAISearchContextProvider(
    endpoint=search_endpoint,
    index_name=index_name,
    api_key=search_key,  # Or use credential for Entra ID
    mode="semantic",  # Default mode
    top_k=3,  # Number of documents to retrieve
)

# Create agent with search context
async with FoundryChatClient(
    project_endpoint=project_endpoint,
    model=model_deployment,
    credential=DefaultAzureCredential(),
) as client:
    async with Agent(
        client=client,
        context_providers=[search_provider],
    ) as agent:
        response = await agent.run("What information is in the knowledge base?")

Agentic Mode

from agent_framework.azure import AzureAISearchContextProvider

# Create search provider with agentic mode
search_provider = AzureAISearchContextProvider(
    endpoint=search_endpoint,
    index_name=index_name,
    api_key=search_key,
    mode="agentic",  # Enable agentic retrieval
    knowledge_base_name=knowledge_base_name,
    azure_openai_resource_url=azure_openai_resource_url,
    top_k=5,
)

# Use with agent (same as semantic mode)
async with Agent(
    client=client,
    model=model_deployment,
    context_providers=[search_provider],
) as agent:
    response = await agent.run("Analyze and compare topics across documents")

Running the Examples

Set up environment variables (see Configuration section above)

Ensure you have an Azure AI Search index with documents:

# Verify your index exists
curl -X GET "https://myservice.search.windows.net/indexes/my-index?api-version=2024-07-01" \
     -H "api-key: YOUR_API_KEY"

For agentic mode: Create a Knowledge Base in Azure AI Search
- Knowledge Base documentation

Run the examples:

# Semantic mode (fast, simple)
python azure_ai_with_search_context_semantic.py

# Agentic mode (intelligent, complex)
python azure_ai_with_search_context_agentic.py

Key Parameters

Common Parameters

endpoint: Azure AI Search service endpoint
index_name: Name of the search index
api_key: API key for authentication (optional, can use credential instead)
credential: Azure credential for Entra ID auth (e.g., DefaultAzureCredential())
mode: Search mode - "semantic" (default) or "agentic"
top_k: Number of documents to retrieve (default: 3 for semantic, 5 for agentic)

Semantic Mode Parameters

semantic_configuration: Name of semantic configuration in your index (optional)
query_type: Query type - "semantic" for semantic search (default)

Agentic Mode Parameters

knowledge_base_name: Name of your Knowledge Base (required)
azure_openai_resource_url: Azure OpenAI resource URL (required)
max_search_queries: Maximum number of search queries to generate (default: 3)

Troubleshooting

Common Issues

Authentication errors
- Ensure AZURE_SEARCH_API_KEY is set, or run az login for Entra ID auth
- Verify your credentials have search permissions
Index not found
- Verify AZURE_SEARCH_INDEX_NAME matches your index name exactly
- Check that the index exists and contains documents
Agentic mode errors
- Ensure AZURE_SEARCH_KNOWLEDGE_BASE_NAME is correctly configured
- Verify AZURE_OPENAI_RESOURCE_URL points to your Azure OpenAI resource (not AI Foundry endpoint)
- Check that your OpenAI resource has the necessary model deployments
No results returned
- Verify your index has documents with vector embeddings (for semantic/hybrid search)
- Check that your queries match the content in your index
- Try increasing top_k parameter
Slow responses in agentic mode
- This is expected - agentic mode trades speed for accuracy
- Reduce max_search_queries if needed
- Consider semantic mode for speed-critical applications

Performance Tips

Use semantic mode as the default for most scenarios - it's fast and effective
Switch to agentic mode when you need multi-hop reasoning or complex queries
Adjust top_k based on your needs - higher values provide more context but increase token usage
Enable semantic configuration in your index for better semantic ranking
Use Entra ID authentication in production for better security

README.md

Azure AI Search Context Provider Examples

Examples

Installation

Prerequisites

Required Resources

Authentication

Configuration

Environment Variables

Example .env file

Search Modes Comparison

When to Use Semantic Mode

When to Use Agentic Mode

How the Examples Work

Semantic Mode Flow

Agentic Mode Flow

Code Example

Semantic Mode

Agentic Mode

Running the Examples

Key Parameters

Common Parameters

Semantic Mode Parameters

Agentic Mode Parameters

Troubleshooting

Common Issues

Performance Tips

Additional Resources