A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Feb 18, 2026 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
✨ NovelAI api python sdk, easy to use, modern and user-friendly.
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline
TTSLab is THE place to easily test ANY text to text to speech model on your own pc with 0 cost
AI generates conversational podcast for ANY research paper, vividly!
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA 12.8). One-click install.
EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。
Add a description, image, and links to the voice-generation topic page so that developers can more easily learn about it.
To associate your repository with the voice-generation topic, visit your repo's landing page and select "manage topics."