Text Generation Webui – v3.21

Text Generation Webui – v3.21

🚀 Text Generation WebUI v3.21 just dropped — and it’s lighter, faster, smarter!

The portable builds are now leaner: no more bloated llama.cpp symlinks (Python .whl quirks, we see you 😅). They auto-recreate on first launch — clean, efficient, zero hassle.

🔥 Backend upgrades galore:

llama.cpp → updated to latest ggml-org commit (5c8a717) — smoother inference, fewer crashes
ExLlamaV3 v0.0.18 — better quantization + smarter memory use
safetensors v0.7 — faster load times, tighter security
triton-windows 3.5.1.post22 — CUDA ops on Windows? Smoother than ever

📦 Portable builds now come in 4 flavors:

🖥️ `cuda12.4` (NVIDIA)
💻 `vulkan` (AMD/Intel GPUs)
🧠 `cpu` (no GPU? no problem)
🍏 `macos-arm64` (Apple Silicon optimized)

🔄 Update? Just unzip → replace only your `user_data/` folder. All your models, settings, themes — untouched. No reconfiguring. No stress.

Perfect for tinkerers who want power without the install drama. Grab it, unzip, and start generating 🚀

🔗 View Release

Text Generation Webui – v3.21

More posts

Voxtral Wyoming – v1.0.0

Ollama – v0.17.5

Voxtral Wyoming – v0.5.0

Voxtral Wyoming – v0.4.0