mirror of https://github.com/microsoft/agent-framework.git synced 2026-06-16 21:04:09 +08:00

Files

T

westey e224f06e60 .NET: Update models used in dotnet samples to gpt-5.4-mini (#5080 )

* Update models used in dotnet samples to gpt-5.4-mini

* Fix additional missed sample

e224f06e60 · 2026-04-07 15:34:00 +00:00

History

Assets

Samples fix (#4932 )

2026-03-26 16:45:01 +00:00

Agent_Step08_UsingImages.csproj

Samples fix (#4932 )

2026-03-26 16:45:01 +00:00

Program.cs

.NET: Update models used in dotnet samples to gpt-5.4-mini (#5080 )

2026-04-07 15:34:00 +00:00

README.md

.NET: Update models used in dotnet samples to gpt-5.4-mini (#5080 )

2026-04-07 15:34:00 +00:00

README.md

Using Images with AI Agents

This sample demonstrates how to use image multi-modality with an AI agent. It shows how to create a vision-enabled agent that can analyze and describe images using Azure OpenAI.

What this sample demonstrates

Creating a persistent AI agent with vision capabilities
Sending both text and image content to an agent in a single message
Using UriContent to Uri referenced images
Processing multimodal input (text + image) with an AI agent

Key features

Vision Agent: Creates an agent specifically instructed to analyze images
Multimodal Input: Combines text questions with image uri in a single message
Azure OpenAI Integration: Uses AzureOpenAI LLM agents

Prerequisites

Before running this sample, ensure you have:

An Azure OpenAI project set up
A compatible model deployment (e.g., gpt-5.4-mini)
Azure CLI installed and authenticated

Environment Variables

Set the following environment variables:

$env:AZURE_OPENAI_ENDPOINT="https://your-resource.openai.azure.com/" # Replace with your Azure OpenAI endpoint
$env:AZURE_OPENAI_DEPLOYMENT_NAME="gpt-5.4-mini" # Replace with your model deployment name (optional, defaults to gpt-5.4-mini)

Run the sample

Navigate to the sample directory and run:

cd Agent_Step08_UsingImages
dotnet run

Expected behavior

The sample will:

Create a vision-enabled agent named "VisionAgent"
Send a message containing both text ("What do you see in this image?") and a Uri image of a green walk
The agent will analyze the image and provide a description
Clean up resources by deleting the thread and agent