Lemonade – v9.0.6

Lemonade – v9.0.6

πŸš€ Lemonade v9.0.6 just dropped β€” and it’s a game-changer for local LLM folks!

Now you can load multiple models at once β€” LLMs, embeddings, and rerankers β€” all running in parallel. No more restarting to switch contexts. πŸ€–πŸ§ 

✨ New goodies:

  • Run concurrent requests across models β†’ smoother, faster workflows
  • Linux logs? Less spam. More chill. 🐧
  • `run` command now works even if the server’s already up β€” no more “port in use” headaches
  • Selective tray unloading keeps RAM sane (bye-bye, memory bloat!)
  • Better docs + venv testing + more robust system info

Try the live demo: open `examples/demos/multi-model-tester.html` in your browser and juggle 3 models like a pro.

Perfect for devs running RAG pipelines, local agents, or just tinkering with multiple models side-by-side.

Full changelog: [v9.0.5…v9.0.6](link)

πŸ”— View Release