Text Generation Webui – v4.6.2

Text Generation Webui – v4.6.2

text-generation-webui v4.6.2 is officially live, and it’s bringing some massive quality-of-life upgrades for your local LLM playground! πŸš€ If you’ve been looking for more control over agentic workflows or better context management, this is the update you’ve been waiting for.

Tool Call Control & MCP Support

  • Manual Approval: No more rogue tool calls! You can now toggle “Confirm tool calls” in the Chat tab to manually approve or reject actions with inline buttons. πŸ›‘οΈ
  • Stdio MCP Servers: Huge win for interoperability! You can now configure local subprocess-based MCP servers via `mcp.json`, making it much easier to sync your setup with Claude Desktop or Cursor.
  • Performance Boost: Tool discovery is now cached, so you won’t be re-querying servers every single time a generation runs.

Enhanced Reasoning & Context Management

  • Preserve Thinking: A new `–preserve-thinking` flag (and UI checkbox) lets you decide if thinking blocks from previous turns stay in your context window. 🧠
  • Smart UI: The “Reasoning effort” and “Enable thinking” controls now only appear for models that actually support them, keeping your interface clutter-free.

UI & UX Overhaul

  • Persistent Sidebars: Sidebars now toggle independently and remember their state even after a page refresh.
  • Visual Polishing: Improved light mode borders, fixed code block copy buttons, and better spacing in the past chats menu.

Under the Hood & Security

  • Security Patch: Fixed SSRF vulnerabilities in URL fetching to keep your local environment safer during web-based tasks.
  • llama.cpp Updates: Includes new defaults for speculative decoding (`–draft-min 48`) and updated dependencies for `ik_llama.cpp` and `ExLlamaV3`.
  • New Portable Builds: Self-contained packages are available for Windows, Linux, and macOS (Apple Silicon/Intel), covering everything from CUDA 13.1 to CPU-only setups.

Pro Tip: If you’re updating a portable install, just swap your `user_data` folder into the new version to keep all your models and settings exactly where they belong! πŸ› οΈ

πŸ”— View Release