Lemonade – v9.0.6
π Lemonade v9.0.6 just dropped β and itβs a game-changer for local LLM folks!
Now you can load multiple models at once β LLMs, embeddings, and rerankers β all running in parallel. No more restarting to switch contexts. π€π§
β¨ New goodies:
- Run concurrent requests across models β smoother, faster workflows
- Linux logs? Less spam. More chill. π§
- `run` command now works even if the serverβs already up β no more “port in use” headaches
- Selective tray unloading keeps RAM sane (bye-bye, memory bloat!)
- Better docs + venv testing + more robust system info
Try the live demo: open `examples/demos/multi-model-tester.html` in your browser and juggle 3 models like a pro.
Perfect for devs running RAG pipelines, local agents, or just tinkering with multiple models side-by-side.
Full changelog: [v9.0.5…v9.0.6](link)
