Text Generation Webui – v4.3.3 – Gemma 4 support!

Text Generation Webui – v4.3.3 – Gemma 4 support!

text-generation-webui just dropped a massive update! If you’re looking for the “AUTOMATIC1111” experience for local LLMs, this Gradio-based powerhouse is now even more capable and snappy. 🚀

Here is the breakdown of what’s new in this release:

🧠 New Model & Backend Support

Gemma 4 Integration: Full support is officially live! You can now run Gemma 4 with full tool-calling capabilities via both the UI and the API.
ik_llama.cpp Backend: A brand new backend option has arrived, offering much more accurate KV cache quantization (via Hadamard rotation) and specialized optimizations for MoE models and CPU inference.

🛠️ API & Transformer Enhancements

Enhanced Completions: The `/v1/completions` endpoint now supports `echo` and `logprobs`, giving you deep visibility into token-level probabilities.
Smarter Model Loading: The system now auto-detects `torch_dtype` from model configs, providing way more flexibility than the previous forced half-precision method.
Metadata-Driven Templates: Instruction templates are now intelligently detected via model metadata instead of relying on filename patterns.

⚡ Performance & UI Polish

Snappier Interface: A custom Gradio fork has been tuned to save up to 50ms per UI event, making the whole experience feel much more responsive.
Critical Bug Fixes: Resolved several issues including dropdown crashes, API parsing errors for non-dict JSON tool calls, and `llama.cpp` template parsing bugs.

🛡️ Security & Stability

Hardened Protections: Implemented ACL/SSRF fixes for extensions, patched path-matching bypasses on Windows/macOS, and added filename sanitization to prevent manipulation during prompt file operations.

📦 Portable Build Upgrades

New self-contained packages are available for NVIDIA, AMD, Intel, Apple Silicon, and CPU users! Pro tip: You can now move your `user_data` folder one level up to easily share settings across multiple version installs. 🛠️

🔗 View Release

Text Generation Webui – v4.3.3 – Gemma 4 support!

More posts

Ollama – v0.20.1: Revert “enable flash attention for gemma4 (#15296)” (#15311)

Text Generation Webui – v4.3.3 – Gemma 4 support!

Ollama – v0.20.1-rc2: model/parsers: rework gemma4 tool call handling (#15306)

Ollama – v0.20.1-rc1: ggml: fix ROCm build for cublasGemmBatchedEx reserve wrapper