All checks were successful
Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s
Previous approach patched DiT.__init__ at runtime, but models were already instantiated and cached. This version patches LlamaConfig globally BEFORE any DiffRhythm imports, ensuring all model instances use the correct config. Key changes: - Created PatchedLlamaConfig subclass that auto-calculates attention heads - Replaced LlamaConfig in transformers.models.llama module at import time - Patch applies to all LlamaConfig instances, including pre-loaded models This should finally fix the tensor dimension mismatch error. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>