Lemonade – v10.5.1
🍋 Lemonade SDK v10.5.1 is officially here!
If you’re obsessed with running high-performance LLMs locally without relying on the cloud, this update is a must-have for your toolkit. Lemonade is all about squeezing every bit of power out of your hardware—specifically leveraging NPUs and GPUs (via Vulkan) to make local inference snappy and responsive.
This latest release focuses on keeping your backend integrations rock-solid as the underlying engines evolve:
- llama.cpp Upgrade: The SDK now supports `llama.cpp` build b9213. If you rely on high-performance C++ inference for your GGUF or ONNX models, this is a huge win for stability and speed! 🚀
- AMD/ROCm Optimizations: For my fellow AMD enthusiasts pushing the limits of Ryzen AI or Radeon hardware, we’ve got critical compatibility updates:
- `rocm-stable` has been bumped to build b9211.
- `rocm-nightly` has been updated to build b127.
Time to pull the latest version and keep those local models running smooth! 🛠️
