Ollama – v0.13.0

Written by

Ollama – v0.13.0

🚀 Ollama v0.13.0 is live — and it’s a game-changer for local LLM folks!

Meet DeepSeek-V3.1 (aka Deepseek2) — now officially supported with 128K context, razor-sharp reasoning, and killer coding skills. But here’s the kicker: it’s running on Ollama’s brand-new engine with MLA (Multi-Layer Attention) — meaning faster token generation, lower latency, and no more sluggish long-context hangs.

✨ What’s new?

✅ DeepSeek-V3.1 support — perfect for complex prompts, multilingual tasks & code generation
🚀 MLA engine = smoother, faster inference on both CPU and GPU (NVIDIA/AMD)
💡 Optimized streaming — ideal for chat apps, agents, and real-time LLM workflows

Just run `ollama pull deepseek2` and feel the difference. No more waiting. Just pure, local LLM power. 🤖💻

🔗 View Release

Ollama – v0.30.0-rc31

May 29, 2026
Ollama – v0.30.0-rc30

May 29, 2026
Ollama – v0.30.0-rc29

May 28, 2026
Ollama – v0.30.0-rc28

May 27, 2026

Ollama – v0.13.0

More posts

Ollama – v0.30.0-rc31

Ollama – v0.30.0-rc30

Ollama – v0.30.0-rc29

Ollama – v0.30.0-rc28