TaterBytes – Page 19

Skip to content

Ollama – v0.17.7-rc1
Ollama – v0.17.7-rc1

🚨 Ollama v0.17.7-rc1 is out! 🚨

This release is a tiny but tidy patch candidate — only one commit landed:

🔧 `cmd/config: fix cloud model limit lookups in integrations (#14650)`

✅ What’s fixed:
- Resolves a bug where Ollama was misfetching or misapplying model usage limits when integrated with cloud services (e.g., Ollama Cloud or third-party APIs).
- Ensures smoother, more accurate rate-limit handling in hybrid/local–cloud workflows.
📌 Why it matters:
- If you’re using Ollama with cloud backends or integrations (like LangChain, LlamaIndex, or custom tooling), this fix helps avoid unexpected throttling or config mismatches.
- No new features, no breaking changes — just more reliability 🛠️
📅 Tagged: Mar 5, 2024

🔗 Release on GitHub

⚠️ RC = Release Candidate — test it out, but maybe wait for the stable drop before pushing to prod.

Let me know if you want a deep dive into PR #14650 or how this affects your integrations! 🤖✨

🔗 View Release
March 5, 2026
Ollama – v0.17.7-rc0

Ollama – v0.17.7-rc0

🚨 Ollama v0.17.7-rc0 is here — and it’s all about Qwen3.5 love! 🧠✨

The latest release candidate is a focused update with one standout improvement:

🔹 Context length configuration for Qwen3.5 models at launch — now you can tweak how much context the model uses right from the start, boosting compatibility and flexibility for longer prompts or multi-turn conversations.

No flashy new features this time — just smart, targeted tuning to make Qwen3.5 models run smoother and more predictably on your machine 🛠️💻

Perfect for anyone experimenting with Qwen3.5 locally or building apps around it!

Curious how it behaves? Drop a test prompt and share your results 👇

🔗 View Release

March 4, 2026
Ollama – v0.17.6
Ollama – v0.17.6

🚨 Ollama v0.17.6 is out — and it’s a quick but important patch! 🚨

This release is light on features, heavy on precision:

🔧 Bug fix: Corrected how `glm-ocr` image tags are parsed in renderer prompts

🔗 PR #14584 by @Victor-Quqi

✅ Why it matters:
- If you’re using GLM-OCR (especially for multimodal OCR tasks), image tags like `<image>` in your prompts will now render correctly instead of causing errors or misinterpretations.
- Ensures smoother integration in custom renderer workflows — critical for anyone building multimodal apps or pipelines on top of Ollama.
📦 No new models, no API changes — just a clean, targeted fix to keep your local LLM workflows humming.

If you rely on GLM-OCR or custom multimodal prompts, update away! 🛠️

Let me know if you want a breakdown of how Ollama renderers work or how to test this fix! 🤖✨

🔗 View Release
March 3, 2026
Voxtral Wyoming – v1.0.0

Voxtral Wyoming – v1.0.0

🚨 Voxtral Wyoming v1.0.0 is live — and it’s production-ready! 🚀

The wait is over: this release marks the stable, final v1.0.0 of Voxtral Wyoming — your go-to offline STT service powered by Mistral’s Voxtral models, now fully integrated with Home Assistant Assist via the Wyoming protocol.

✨ What’s new (and why it matters):

✅ Stable & battle-tested — all major bugs squashed, performance optimized for real-world use

✅ API finalized — no more breaking changes ahead; integrations are safe to lock in

✅ Full tooling in place — docs, tests, and CI/CD pipelines are now rock-solid

✅ Zero flash, all function — no flashy new features, just a polished, reliable upgrade ready for production 🛠️

🎯 Whether you’re running it on CPU, CUDA (NVIDIA), or MPS (Apple Silicon), and whether your audio comes in MP3, OGG, FLAC, or WAV — Voxtral Wyoming handles it all with automatic PCM16 conversion. Config via env vars? Yep — host, port, language, model ID… all covered.

📦 Dockerized. Deployed. Ready.

🟢 Green light for production! Let’s build smarter, offline-first voice assistants — together. 🎤💡

🔗 View Release

March 1, 2026
Ollama – v0.17.5
Ollama – v0.17.5

🚨 Ollama v0.17.5 is live! 🚨

Hey AI tinkerers — fresh update alert! 🔥 Ollama just rolled out v0.17.5, and it’s a quiet but mighty one — especially if you love playing with Qwen3 or importing GGUF models. Here’s the lowdown:

🔹 GGUF love, expanded! 🎁
- Full support for importing and running Qwen3 models (like `Qwen3-0.6B`, `Qwen3-1.7B`) — straight from Hugging Face or wherever you grab your GGUFs.
- Smoother imports, fewer hiccups 🛠️
🔹 Under-the-hood polish
- Bug fixes and stability tweaks (you won’t see them, but you’ll feel the smoother run).
💡 Why care?

If you’re experimenting with lightweight Qwen3 variants or love the flexibility of GGUF (quantized, portable, efficient 📦), this update makes your workflow just a little more magical. ✨

Ready to upgrade? `ollama pull ollama` 🚀

Let us know how it runs!

🔗 View Release
March 1, 2026
Voxtral Wyoming – v0.5.0

Voxtral Wyoming – v0.5.0

_New update detected._

🔗 View Release

February 28, 2026
Voxtral Wyoming – v0.4.0

Voxtral Wyoming – v0.4.0

_New update detected._

🔗 View Release

February 27, 2026
Lemonade – v9.4.1

Lemonade – v9.4.1

_New update detected._

🔗 View Release

February 27, 2026
Voxtral Wyoming – v0.3.0

Voxtral Wyoming – v0.3.0

_New update detected._

🔗 View Release

February 27, 2026
Ollama – v0.17.4

Ollama – v0.17.4

🚀 Ollama v0.17.4 is live! Here’s what’s fresh in this patch release:

🔹 Stable Tool Calling for GLM-4 & Qwen3

✅ Reliable tool/function calling support—no more misaligned or garbled tool outputs!

✅ Works seamlessly with `curl`, Python clients, and custom tools via the Ollama API.

🔹 Better JSON & Parser Handling

🧠 Internal upgrades to model parsers—especially for Chinese-language models (GLM, Qwen).

📊 More consistent parsing of JSON-formatted tool responses.

🔹 Minor Fixes & Tweaks

⚙️ Performance bumps, bug fixes, and general polish—zero breaking changes.

Perfect for anyone relying on structured outputs or tool integrations with local LLMs. Try it out and let us know how your tool-calling workflows feel! 🛠️✨

🔗 View Release

February 27, 2026

←Previous Page

1 … 17 18 19 20 21 … 44