Text Generation Webui – v4.3.2
text-generation-webui v4.3.2 is officially live! π This Gradio-based powerhouse is the go-to interface for running LLMs locally, and this update brings some serious heavy-hitting performance boosts and expanded model support for all you tinkerers out there.
Here is the breakdown of whatβs new in this release:
Core Model & Backend Upgrades
- Gemma 4 Support: You can now run Gemma 4 with full tool-calling capabilities enabled in both the API and the UI. π
- New `ik_llama.cpp` Backend: A massive addition for performance enthusiasts! This backend offers superior KV cache quantization using Hadamard rotation, better optimizations for MoE models, and improved CPU inference.
- Transformers Enhancements: The engine now auto-detects `torch_dtype` from model configs rather than forcing half-precision, making the model loading process much smarter.
API & UI Improvements
- Enhanced Completions API: The `/v1/completions` endpoint now supports `echo` and `logprobs`, allowing you to see token-level probabilities and IDs. π
- Snappier Interface: A custom Gradio fork has been optimized to save up to 50ms per UI event, making button clicks and transitions feel much smoother.
- Smarter Templates: Instruction templates are now detected via model metadata instead of relying on old filename patterns.
Security & Stability Fixes
- Hardened Security: Fixed an ACL bypass in the Gradio fork for Windows/macOS and added server-side validation for various input groups like Dropdowns and Radio buttons. π‘οΈ
- SSRF Protection: Added URL validation to `superbooga` extensions to block requests to private or internal networks.
- Bug Squashing: Resolved several critical issues, including crashes related to Gemma 4 templates in llama.cpp and loading failures for Qwen3.5 MoE models.
Portable Builds & Updates
New self-contained packages are available for Windows, Linux, Mac, and various GPU architectures (NVIDIA CUDA, AMD Vulkan/ROCm, and Intel). If you’re using the portable version, updating is easier than everβyou can now use a shared `user_data` folder across multiple installs! π
