Ollama – v0.22.0 – TaterBytes

Ollama – v0.22.0

🚀 Ollama Update Alert! 🚀

If you’re running your local LLMs on Apple Silicon, listen up! The latest release (v0.22.0-rc1) is officially here, and it’s bringing some massive performance optimizations via an MLX update. This is a huge deal for anyone trying to squeeze every bit of juice out of their Mac hardware.

Here’s the breakdown of what’s new:

Batch Processing Power: The `mlxrunner` now supports batching the sampler across multiple sequences. If you’re working with large datasets or need to generate multiple outputs at once, this is a massive efficiency win! 📈
NVIDIA & MLX Bridge: In a super cool move for cross-platform workflows, MLX now supports importing models optimized via NVIDIA TensorRT. This makes it way easier to move your heavy-duty workflows between NVIDIA and Apple hardware without the headache.
Precision Tokenization: A bug fix for multi-regex BPE offset handling is included, ensuring your tokenization stays precise and error-free during complex text processing tasks.

Time to pull that update and start benchmarking! 🛠️

🔗 View Release

Ollama – v0.22.0

More posts

Ollama – v0.22.0

Ollama – v0.22.0

Lemonade – v10.3.0: Refine collection image reply behavior (#1726)

ComfyUI – v0.20.1