Ollama – v0.18.0-rc1

Written by

Tater Totterson

Ollama – v0.18.0-rc1

🚨 Ollama v0.18.0-rc1 is here — and it’s packing some serious upgrades! 🚨

🔥 Anthropic Model Fixes

Fixed parsing of `close_thinking` blocks before `tool_use`, especially when no intermediate text is present — critical for clean tool invocations in Claude-style models.

🛠️ Tool Use & Function Calling

Major improvements for structured outputs and function calling — think smoother integrations with `claude-3.5-sonnet` and similar models.

⚡ Performance & Stability Boosts

Optimized context handling & reduced memory footprint.
Fixed bugs in multi-turn tool-based conversations — fewer hiccups, more reliability.

💻 Platform Love

Updated CUDA & Metal backends for faster inference.
Better Apple Silicon (M-series) support — and improved WSL2 & native Windows performance.

CLI/API Tweaks 🛠️

New flags for `ollama run` & `ollama chat`, including fine-grained streaming control.
Cleaner error messages when models fail to load (no more cryptic dead ends!).

This RC is a solid preview of what’s coming — especially if you’re relying on tool use, local Claude-style models, or pushing Ollama hard on macOS/Windows. 🧪 Try it out and let us know what you think!

🔗 Download v0.18.0-rc1

#Ollama #LLMs #AIEnthusiasts 🤖

🔗 View Release

Ollama – v0.18.0-rc1

More posts

Ollama – v0.30.0-rc31

Ollama – v0.30.0-rc30

Ollama – v0.30.0-rc29

Ollama – v0.30.0-rc28