MLX-LM – v0.29.0

Written by

MLX-LM – v0.29.0

🚀 MLX LM v0.29.0 is live — and it’s a beast!

Batch generation just got 2x faster thanks to `wired_limit` fixes — your server will thank you.
RoPE & SuScaledRoPE fixed for `rnj-1` and others — smoother attention, less drift.
Dequantize bug squashed ✅ Now using the right function — cleaner outputs, better precision.
Repetition penalty defaults to 0.0 — less annoying repetition from day one. 🎯
DSV32 & Gemma3 — bugs gone, stable and ready to deploy.
SSM batching fixed — state-space models now behave on the server. 💡
Nemotron 3 added! 🎉 Go ahead, test it.
Devstral-2 now works properly — no more surprises. 👏

Big shoutout to first-time contributors: @otarkhan, @devnamrits, @DePasqualeOrg, and @inferencers — welcome to the crew! 🙌

Update now — your LLMs are ready for a speed run. 🛠️

Full changelog: [v0.28.4…v0.29.0](link)

More posts