feat(types): add native audio content block support #1515

strands-agent · 2026-01-20T02:10:39Z

Description

Add native audio content block support to the SDK's type system, following the established pattern for video, image, and document content.

Changes

Types (src/strands/types/media.py):
- Add AudioFormat literal type (mp3, wav, flac, ogg, webm)
- Add AudioSource TypedDict with bytes attribute
- Add AudioContent TypedDict with format and source attributes
ContentBlock (src/strands/types/content.py):
- Add audio: AudioContent field to the ContentBlock TypedDict
Bedrock Provider (src/strands/models/bedrock.py):
- Add audio handling in _format_request_message_content() following the video pattern
LlamaCpp Provider (src/strands/models/llamacpp.py):
- Update to use native AudioContent types instead of cast(Dict[str, Any], content)
Tests (tests/strands/models/test_bedrock.py):
- Add test_format_request_filters_audio_content_blocks test

Benefits

Type Safety: No more cast(Dict[str, Any], content) workarounds
Consistency: Audio follows the same pattern as video, image, and document
Model Support: Enables type-safe audio handling for Bedrock (Nova Sonic), LlamaCpp (Qwen2.5-Omni), and future providers

Usage Example

from strands.types.content import ContentBlock
from strands.types.media import AudioContent

audio_block: ContentBlock = {
    "audio": {
        "format": "mp3",
        "source": {"bytes": audio_bytes}
    }
}

response = agent([audio_block, {"text": "What do you hear in this audio?"}])

Related Issues

Closes #866

Documentation PR

No documentation changes required - this adds types that follow existing patterns.

Type of Change

New feature

Testing

I ran hatch run prepare (formatter + linter + mypy + tests)
Unit test passes: test_format_request_filters_audio_content_blocks
All existing tests continue to pass

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

Adds AudioContent TypedDict to support audio input in messages, following the established pattern for image and video content. Changes: - Add AudioFormat Literal type with common audio formats (mp3, wav, flac, ogg, aac, webm) - Add AudioSource TypedDict for audio binary content - Add AudioContent TypedDict with format and source fields - Add 'audio' field to ContentBlock TypedDict - Add audio handling in BedrockModel._format_request_message_content() - Add unit test for audio content block filtering This enables type-safe audio input for model providers that support multimodal audio content, such as Bedrock (Nova Sonic) and LlamaCpp (Qwen2.5-Omni). Closes strands-agents#866

strands-agent force-pushed the feat/audio-content-block branch from 292e851 to 07b34c5 Compare January 20, 2026 02:10

github-actions bot added size/s and removed size/s labels Jan 20, 2026

strands-agent requested a deployment to manual-approval January 20, 2026 02:10 — with GitHub Actions Waiting

github-actions bot added the size/s label Jan 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(types): add native audio content block support #1515

feat(types): add native audio content block support #1515

strands-agent commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(types): add native audio content block support #1515

Are you sure you want to change the base?

feat(types): add native audio content block support #1515

Conversation

strands-agent commented Jan 20, 2026

Description

Changes

Benefits

Usage Example

Related Issues

Documentation PR

Type of Change

Testing

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant