8x AMD MI50 32GB at 26 t/s (tg) with MiniMax-M2.1 and 15 t/s (tg) with GLM 4.7 (vllm-gfx906)
🔴 r/LocalLLaMA by /u/ai-infos
technical
View Original Post ↗ No analysis available for this story.
This story was indexed before article generation was enabled.
🤖 Classification Details
Detailed technical post with specific hardware setup, benchmarked throughput metrics, reproducible configuration with GitHub link, and practical inference performance data for local LLM deployment.