• Deep-Live-Cam – 2.7-RC1

    Deep-Live-Cam – 2.7-RC1

    🚀 Deep-Live-Cam 2.7-RC1 Update Alert! 🚀

    If you’ve been looking for ways to level up your real-time face swapping and live streaming game, this release is a massive leap forward in speed and hardware integration. Whether you’re tinkering with local GPUs or cloud-based diffusion, there is some serious magic happening here.

    New Live Modes & Engines:

    • Decart Live: You can now stream your webcam directly through Decart’s cloud GPUs using real-time diffusion. Just plug in your API key and choose between the Lucy 2.0 or 2.1 variants! ☁️

    FLUX Live (prompt-edit): A powerhouse feature for high-end rigs! This enables real-time, prompt-driven face editing at a smooth 30fps using FluxRT. Warning: You’ll need an RTX 5090 or RTX 6000-class GPU to handle this beast.*

    • ReSwapper: A brand new face-swap engine has joined the lineup alongside Inswapper and Hyperswap, giving you more options for your swaps.

    UI/UX & Hardware Perks:

    • RTX Upscaler: If you have compatible hardware, a new toggle in settings lets you use RTX power to upscale your output beyond the source resolution. 💎
    • View Toggles: You can now switch between “Classic” and “Modern” UI layouts with a single click from the status bar.
    • Cleaner Live View: The floating player overlay now hides automatically during live streams, giving you a much cleaner viewport for focusing on your content.

    Global Accessibility & Polish:

    • Full Localization: The interface is officially multilingual! You can switch between English, Chinese, Japanese, French, and Arabic in the settings (just remember to restart the app to apply changes). 🌍
    • Improved Window Projection: Resizing during live sessions is now automatic, and a new dedicated close button has been added to the overlay for easier management.

    All your favorite tools like RIFE Interpolation, Lip Mask, and Face Enhancer are still baked in and ready to go! 🛠️

    🔗 View Release

  • Text Generation Webui – v4.9

    Text Generation Webui – v4.9

    🚀 Major Update Alert: text-generation-webui v4.9 is here!

    If you’ve been looking for the “AUTOMATIC1111” experience for your local LLMs, this update is a massive win for efficiency and workflow smoothness. We’re seeing some serious upgrades to how the UI handles web data and inference speed! 🛠️

    Smart Web Search Enhancements

    • Snippet Support: The `web_search` tool now pulls text excerpts directly from search results. Your model can grab answers without the heavy lifting of parsing entire pages!
    • Token Efficiency: The `fetch_webpage` tool is much leaner now, stripping out raw URLs to keep your context window clean and focused on the actual content.
    • Polished UI: Enjoy a new loading spinner during searches and much prettier result rendering in your chat interface.

    Performance & Inference Tweaks

    • MTP Speculative Decoding: New support for `draft-mtpas` is live! It auto-enables when loading MTP GGUFs (like Qwen 3.6 MoE), which can significantly boost generation speeds.
    • Live Stats: Monitor your hardware in real-time with live tokens/s tracking and context size monitoring during generation.
    • Auto-mmproj Detection: No more manual hunting! The app now automatically detects and selects sibling `mmproj` files when you load a vision model.

    UI & Workflow Improvements

    • Drag-and-Drop: You can now drag files directly into the chat input for lightning-fast uploads. 📂
    • Refined Sidebar: A reorganized sidebar (Mode/Character/Chat) and hidden reasoning controls in simple mode mean much less clutter while you work.
    • Electron Upgrades: New “Check for updates” button, a dedicated model directory folder picker, and a handy right-click context menu for easy text copying.

    Security & Stability Fixes

    • Hardened Security: CORS is now restricted to `localhost` by default, and character name loading has been sanitized to prevent path traversal attacks. 🛡️
    • Windows Reliability: Fixed the bug where `llama-server` would hang after the parent process closed on Windows.
    • Dependency Refresh: The engine has been bumped up with the latest updates from `llama.cpp`, `ik_llama.cpp`, and `ExLlamaV3`.

    🔗 View Release

  • ComfyUI – v0.22.0

    ComfyUI – v0.22.0

    ComfyUI v0.22.0 is officially live! 🚀

    If you’re looking for ultimate control over your Stable Diffusion pipelines, ComfyUI remains the gold standard for node-based generative AI. It lets you stitch together complex workflows—from upscaling to inpainting—without writing a single line of code.

    This update marks a major milestone within the expanding Comfy-Org ecosystem! While the release focuses heavily on structural improvements, here is what’s driving this new era for your workflows:

    • Enhanced Workflow Stability: Expect much smoother interactions between nodes and smarter memory management, which is a lifesaver when you’re running massive, multi-stage generations. 🛠️
    • Ecosystem Scaling: As part of the official Comfy-Org push, there is a heavy focus on better integration with custom node managers and more streamlined update processes.
    • Performance Tweaks: Ongoing optimizations are rolling out to ensure faster sampling and more efficient execution across your hardware.

    Whether you’re running SDXL, ControlNet, or experimenting with LCM models, this version is all about making those complex graphs more reliable and scalable. 🎨

    🔗 View Release

  • Wyoming Openai – Python 3.13 support (0.4.4)

    Wyoming Openai – Python 3.13 support (0.4.4)

    Wyoming OpenAI (v0.4.4) 🚀

    If you’re building local voice assistants or tinkering with Home Assistant, this update is a massive win for efficiency and stability! This proxy middleware bridges the Wyoming protocol with OpenAI-compatible ASR and TTS services—allowing you to use heavy hitters like GPT-4o, Whisper, and Kokoro without direct API integration.

    What’s New:

    • Python 3.13 Support: The project is officially ready for the latest and greatest! Full support is now included across CI, Docker, and packaging metadata.
    • Smart STT/TTS Modes: No more wasted resources! If you aren’t using a specific mode (STT or TTS), the client now skips the unused side entirely instead of attempting backend autodetection. This makes startup much cleaner when you only need one direction of audio processing.

    Key Fixes & Improvements:

    • Fixed Stereo Streaming Timestamps: No more “fast-forward” audio! A bug that caused timestamps to advance too quickly during stereo TTS streaming has been squashed by calculating timing based on PCM frame count. 🛠️
    • Robust WAV Header Handling: Fixed an issue where fragmented WAV headers in streaming TTS could leak bytes or cause incorrect audio starts. Now, chunks are buffered until a complete header is ready to be parsed.
    • OpenAI Compatibility Patch: A lifesaver for local devs! Recent changes in the `openai` library (v2.34+) were breaking keyless local backends like Speaches and LocalAI. The client now bypasses credential enforcement to ensure your local, keyless setups keep running smoothly without needing fake bearer tokens.

    🔗 View Release

  • Ollama – v0.30.0-rc21

    Ollama – v0.30.0-rc21

    Ollama v0.30.0-rc21 🦬

    If you’re running LLMs locally, you know Ollama is the go-to for getting models like Llama 3 and DeepSeek-R1 up and running with zero friction. This latest release candidate is a targeted update focused on stability for our friends over on Windows! 🪟

    What’s new:

    • Windows Bug Fix: The primary focus of this release is an improvement to exit error logs on Windows. If you’ve been seeing messy or unhelpful logs when closing out sessions, this should make your debugging process much smoother.

    It’s a small but mighty update to keep those local environments running clean! 🛠️

    🔗 View Release

  • Lemonade – v10.5.1

    Lemonade – v10.5.1

    🍋 Lemonade SDK v10.5.1 is officially here!

    If you’re obsessed with running high-performance LLMs locally without relying on the cloud, this update is a must-have for your toolkit. Lemonade is all about squeezing every bit of power out of your hardware—specifically leveraging NPUs and GPUs (via Vulkan) to make local inference snappy and responsive.

    This latest release focuses on keeping your backend integrations rock-solid as the underlying engines evolve:

    • llama.cpp Upgrade: The SDK now supports `llama.cpp` build b9213. If you rely on high-performance C++ inference for your GGUF or ONNX models, this is a huge win for stability and speed! 🚀
    • AMD/ROCm Optimizations: For my fellow AMD enthusiasts pushing the limits of Ryzen AI or Radeon hardware, we’ve got critical compatibility updates:
    • `rocm-stable` has been bumped to build b9211.
    • `rocm-nightly` has been updated to build b127.

    Time to pull the latest version and keep those local models running smooth! 🛠️

    🔗 View Release

  • Ollama – v0.30.0-rc20

    Ollama – v0.30.0-rc20

    Ollama just dropped v0.30.0-rc20, and it looks like a quick but important fix for the AMD crowd! 🛠️

    If you aren’t using Ollama yet, it is the ultimate framework for running powerful LLMs like Llama 3, DeepSeek-R1, and Mistral locally on your own machine. It handles all the heavy lifting of downloading and configuring models so you can focus on building.

    What’s new in this release:

    • ROCm Build Fix: This update specifically addresses a “cache miss” issue encountered during the ROCm build process.
    • Better Stability for AMD Users: If you are running LLMs on AMD hardware, this patch helps ensure your build process is much more reliable and efficient.

    If you’ve been tinkering with local models on an AMD GPU, definitely grab this release candidate to keep your environment running smoothly! 🚀

    🔗 View Release

  • Ollama – v0.30.0-rc19

    Ollama – v0.30.0-rc19

    Ollama v0.30.0-rc19 is here! 🛠️

    If you’ve been looking for a way to run heavy-hitting models like Llama 3, DeepSeek-R1, or Phi-4 directly on your own hardware without relying on the cloud, Ollama remains the gold standard for local LLM orchestration. It handles all the heavy lifting of model downloading and configuration so you can get straight to prompting.

    This latest release candidate (rc19) is part of the ongoing push to refine the local inference experience. While we are still waiting on the full granular changelog for this specific RC, these incremental updates typically focus on:

    • Performance Tweaks: Optimization for smoother inference speeds across macOS, Windows, and Linux.
    • Stability Fixes: Polishing the REST API and CLI to ensure seamless integration with your custom dev workflows.
    • Model Compatibility: Refinements to GGUF support and memory management when swapping between large models.

    Keep an eye on this space—as soon as the full breakdown of these specific bug fixes hits the repo, I’ll let you know exactly what’s under the hood! 🥔✨

    🔗 View Release

  • Ollama – v0.30.0-rc18

    Ollama – v0.30.0-rc18

    Ollama v0.30.0-rc18 🛠️

    If you’re running local LLMs, you know Ollama is the go-to for getting models like Llama 3, DeepSeek-R1, and Mistral up and running on your own machine with zero friction. It’s a powerhouse for anyone wanting to experiment with open-source models privately and locally.

    This latest release candidate is a quick version bump, keeping the momentum going as the team polishes the latest features for the upcoming stable release!

    What’s new:

    • Version Bump: This update moves the needle to `v0.30.0-rc18`, focusing on stability and preparing the codebase for the next major milestone. 🚀

    Keep an eye on this one—RC updates are usually where we see the final bug squashing before a big rollout!

    🔗 View Release

  • Tater – Tater v80

    Tater – Tater v80

    🥔 Tater v80 — “Wake Word Wars” is here! 😏🎙️

    The Tater ecosystem just leveled up from a simple tool to a full-blown laboratory for voice activation! This update turns your local AI setup into a modular powerhouse, allowing you to mix and match different wake engines across all your satellites.

    What’s New:

    • Multi-Engine Support: You can now run `microWakeWord`, `openWakeWord`, and the brand new `NanoWakeWord` all within the same ecosystem. The best part? You can switch between them live via firmware entities without breaking a sweat. 🎙️
    • Dedicated NanoWakeWord Server: We’ve added a standalone server for NanoWakeWord! It includes Docker/NVIDIA Docker support and dedicated WebSocket streaming.

    Pro-tip:* Use this to stream audio from your satellites to a separate backend, offloading the heavy processing from your main Tater app! 🛰️

    • NanoWakeWord Trainer: For the true tinkerers, there is now a dedicated training flow. It features synthetic sample generation, validation handling, and seamless ONNX + PyTorch model syncing. You can build and deploy your own custom models directly into the ecosystem. 🏋️
    • openWakeWord Cleanup: We’ve dialed back the noise by making diagnostic detection logging optional and disabling it by default. Your logs will finally stop screaming at you! 🔧

    The Big Picture: 🧠

    Tater is officially modular. Whether you want local detection, remote detection, or a custom-trained model running on a dedicated server, you can now build a highly customized, distributed voice ecosystem. We didn’t just add an engine; we started a collection!

    🔗 View Release