Lemonade – v9.0.6

Written by

Lemonade – v9.0.6

🚀 Lemonade v9.0.6 just dropped — and it’s a game-changer for local LLM folks!

Now you can load multiple models at once — LLMs, embeddings, and rerankers — all running in parallel. No more restarting to switch contexts. 🤖🧠

✨ New goodies:

Run concurrent requests across models → smoother, faster workflows
Linux logs? Less spam. More chill. 🐧
`run` command now works even if the server’s already up — no more “port in use” headaches
Selective tray unloading keeps RAM sane (bye-bye, memory bloat!)
Better docs + venv testing + more robust system info

Try the live demo: open `examples/demos/multi-model-tester.html` in your browser and juggle 3 models like a pro.

Perfect for devs running RAG pipelines, local agents, or just tinkering with multiple models side-by-side.

Full changelog: [v9.0.5…v9.0.6](link)

More posts