fix: Compatibility with transformers 4.49+ and 5.0+ #137

xander1421 · 2026-02-01T02:37:29Z

Summary

Fix compatibility issues with newer transformers versions (4.49+ and 5.0+)

Problem

DiffRhythm fails with transformers >= 4.49 due to two issues:

Missing num_attention_heads in LlamaConfig (transformers 4.49+)
```
RuntimeError: The size of tensor a (32) must match the size of tensor b (64) at non-singleton dimension 3
```
The rotary embeddings have incorrect dimensions because head_dim is calculated wrong.
LlamaDecoderLayer output format changed (transformers 5.0+)
- Old: Returns tuple (hidden_states, present_key_value, ...)
- New: Returns tensor directly
The line x, *_ = block(...) unpacks a tensor by iterating over its first dimension, effectively doing x = tensor[0] and losing the batch dimension.

Solution

Add explicit num_attention_heads to LlamaConfig:

num_attention_heads = dim // dim_head
llama_config = LlamaConfig(
    ...
    num_attention_heads=num_attention_heads,
)

Change output handling to work with both formats:

# Before (breaks on transformers 5.0):
x, *_ = block(x, ...)

# After (works on all versions):
x = block(x, ...)

Testing

Tested successfully with:

Python 3.14
PyTorch 2.7
transformers 5.0.0

Generated 95s audio samples without issues.

🤖 Generated with Claude Code

Two fixes for newer transformers versions: 1. Add explicit num_attention_heads to LlamaConfig - transformers 4.49+ requires this for correct head_dim calculation - head_dim = hidden_size // num_attention_heads - Without this, rotary embeddings have wrong dimensions 2. Fix LlamaDecoderLayer output handling for transformers 5.0+ - In transformers 5.0, LlamaDecoderLayer returns a tensor directly - Previously returned tuple (hidden_states, ...) - Using `x, *_ = block(...)` on a tensor iterates over first dimension, effectively doing x = tensor[0] and losing the batch dimension - Changed to `x = block(...)` which works for both old and new versions Tested with: - Python 3.14 - PyTorch 2.7 - transformers 5.0.0 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Compatibility with transformers 4.49+ and 5.0+ #137

fix: Compatibility with transformers 4.49+ and 5.0+ #137

Uh oh!

xander1421 commented Feb 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: Compatibility with transformers 4.49+ and 5.0+ #137

Are you sure you want to change the base?

fix: Compatibility with transformers 4.49+ and 5.0+ #137

Uh oh!

Conversation

xander1421 commented Feb 1, 2026

Summary

Problem

Solution

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant