Text Generation Webui – v4.3.2

Text Generation Webui – v4.3.2

text-generation-webui v4.3.2 is officially live! πŸš€ This Gradio-based powerhouse is the go-to interface for running LLMs locally, and this update brings some serious heavy-hitting performance boosts and expanded model support for all you tinkerers out there.

Here is the breakdown of what’s new in this release:

Core Model & Backend Upgrades

  • Gemma 4 Support: You can now run Gemma 4 with full tool-calling capabilities enabled in both the API and the UI. πŸ†•
  • New `ik_llama.cpp` Backend: A massive addition for performance enthusiasts! This backend offers superior KV cache quantization using Hadamard rotation, better optimizations for MoE models, and improved CPU inference.
  • Transformers Enhancements: The engine now auto-detects `torch_dtype` from model configs rather than forcing half-precision, making the model loading process much smarter.

API & UI Improvements

  • Enhanced Completions API: The `/v1/completions` endpoint now supports `echo` and `logprobs`, allowing you to see token-level probabilities and IDs. πŸ“Š
  • Snappier Interface: A custom Gradio fork has been optimized to save up to 50ms per UI event, making button clicks and transitions feel much smoother.
  • Smarter Templates: Instruction templates are now detected via model metadata instead of relying on old filename patterns.

Security & Stability Fixes

  • Hardened Security: Fixed an ACL bypass in the Gradio fork for Windows/macOS and added server-side validation for various input groups like Dropdowns and Radio buttons. πŸ›‘οΈ
  • SSRF Protection: Added URL validation to `superbooga` extensions to block requests to private or internal networks.
  • Bug Squashing: Resolved several critical issues, including crashes related to Gemma 4 templates in llama.cpp and loading failures for Qwen3.5 MoE models.

Portable Builds & Updates

New self-contained packages are available for Windows, Linux, Mac, and various GPU architectures (NVIDIA CUDA, AMD Vulkan/ROCm, and Intel). If you’re using the portable version, updating is easier than everβ€”you can now use a shared `user_data` folder across multiple installs! πŸ“‚

πŸ”— View Release