mirror of https://github.com/microsoft/agent-framework.git synced 2026-06-16 21:04:09 +08:00

Files

T

Giles Odigwe 0340b7596b Python: bump package versions for 1.3.0 release (#5706 )

* Python: bump package versions for 1.3.0 release

MINOR bump on the released cohort (agent-framework, agent-framework-core,
agent-framework-openai, agent-framework-foundry: 1.2.2 -> 1.3.0). All 22
beta packages stamp 1.0.0b260507 and all 3 alpha packages stamp
1.0.0a260507 per the lockstep convention. Date stamp reflects 2026-05-07
Pacific.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address review: bump foundry_local openai floor, fix devui orchestrations pin, clarify breaking scope

- foundry_local: bump agent-framework-openai lower bound from >=1.1.0 to >=1.3.0
- devui: update stale agent-framework-orchestrations dev pin from 1.0.0b260402 to 1.0.0b260507
- CHANGELOG: clarify [BREAKING] applies to experimental skills API only

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Revert devui orchestrations pin to 1.0.0b260402 to avoid breaking DevUI

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

0340b7596b · 2026-05-08 08:57:02 +09:00

History

agent_framework_azure_contentunderstanding

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

samples

Python: bump package versions for 1.2.2 release (#5561 )

2026-04-29 17:51:48 +09:00

tests/cu

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

.gitignore

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

AGENTS.md

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

LICENSE

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

pyproject.toml

Python: bump package versions for 1.3.0 release (#5706 )

2026-05-08 08:57:02 +09:00

README.md

[Python] Add agent-framework-azure-ai-contentunderstanding package (#4829 )

2026-04-28 20:55:59 +00:00

README.md

Get Started with Azure Content Understanding in Microsoft Agent Framework

Please install this package via pip:

pip install agent-framework-azure-contentunderstanding --pre

Azure Content Understanding Integration

Prerequisites

Before using this package, you need an Azure Content Understanding resource:

An active Azure subscription (create one for free)
A Microsoft Foundry resource created in a supported region
Default model deployments configured for your resource (GPT-4.1, GPT-4.1-mini, text-embedding-3-large)

Follow the prerequisites section in the Azure Content Understanding quickstart for setup instructions.

Introduction

The Azure Content Understanding integration provides a context provider that automatically analyzes file attachments (documents, images, audio, video) using Azure Content Understanding and injects structured results into the LLM context.

Document & image analysis: State-of-the-art OCR with markdown extraction, table preservation, and structured field extraction — handles scanned PDFs, handwritten content, and complex layouts
Audio & video analysis: Transcription, speaker diarization, and per-segment summaries
Background processing: Configurable timeout with async background fallback for large files
file_search integration: Optional vector store upload for token-efficient RAG on large documents

Learn more about Azure Content Understanding capabilities at https://learn.microsoft.com/azure/ai-services/content-understanding/

Basic Usage Example

See the samples directory which demonstrates:

Single PDF upload and Q&A (01_document_qa)
Multi-turn sessions with cached results (02_multi_turn_session)
PDF + audio + video parallel analysis (03_multimodal_chat)
Structured field extraction with prebuilt-invoice (04_invoice_processing)
CU extraction + OpenAI vector store RAG (05_large_doc_file_search)
Interactive web UI with DevUI (02-devui)

import asyncio
from agent_framework import Agent, AgentSession, Message, Content
from agent_framework.foundry import FoundryChatClient
from agent_framework.foundry import ContentUnderstandingContextProvider
from azure.identity import AzureCliCredential

credential = AzureCliCredential()

cu = ContentUnderstandingContextProvider(
    endpoint="https://my-resource.cognitiveservices.azure.com/",
    credential=credential,
    max_wait=None,  # block until CU extraction completes before sending to LLM
)

client = FoundryChatClient(
    project_endpoint="https://your-project.services.ai.azure.com",
    model="gpt-4.1",
    credential=credential,
)

async def main():
    async with cu:
        agent = Agent(
            client=client,
            name="DocumentQA",
            instructions="You are a helpful document analyst.",
            context_providers=[cu],
        )
        session = AgentSession()

        response = await agent.run(
            Message(role="user", contents=[
                Content.from_text("What's on this invoice?"),
                Content.from_uri(
                    "https://raw.githubusercontent.com/Azure-Samples/"
                    "azure-ai-content-understanding-assets/main/document/invoice.pdf",
                    media_type="application/pdf",
                    additional_properties={"filename": "invoice.pdf"},
                ),
            ]),
            session=session,
        )
        print(response.text)

asyncio.run(main())

Supported File Types

Category	Types
Documents	PDF, DOCX, XLSX, PPTX, HTML, TXT, Markdown
Images	JPEG, PNG, TIFF, BMP
Audio	WAV, MP3, M4A, FLAC, OGG
Video	MP4, MOV, AVI, WebM

For the complete list of supported file types and size limits, see Azure Content Understanding service limits.

Environment Variables

The provider supports automatic endpoint resolution from environment variables. When endpoint is not passed to the constructor, it is loaded from AZURE_CONTENTUNDERSTANDING_ENDPOINT:

# Endpoint auto-loaded from AZURE_CONTENTUNDERSTANDING_ENDPOINT env var
cu = ContentUnderstandingContextProvider(credential=credential)

Set these in your shell or in a .env file:

AZURE_CONTENTUNDERSTANDING_ENDPOINT=https://your-cu-resource.cognitiveservices.azure.com/
AZURE_AI_PROJECT_ENDPOINT=https://your-project.services.ai.azure.com
AZURE_OPENAI_DEPLOYMENT_NAME=gpt-4.1

You also need to be logged in with az login (for AzureCliCredential).

Next steps

Explore the samples directory for complete code examples
Read the Azure Content Understanding documentation for detailed service information
Learn more about the Microsoft Agent Framework