Agents can process images, audio, video, and files as input, and generate images and audio as output. This section introduces Multimodal I/O. Check out the full guide for more details.Documentation Index
Fetch the complete documentation index at: https://agno-v2-shaloo-ai-support-link.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Media Classes
| Class | Parameters |
|---|---|
Image | url, filepath, content (bytes) |
Audio | url, filepath, content (bytes), format |
Video | url, filepath, content (bytes) |
File | url, filepath, content (bytes) |
Quickstart
- Input
- Output
- Combined I/O
Select Media Type:
- Image
- Audio
- Video
- File
Pass images via URL, file path, or base64 content: