VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
-
Updated
Apr 10, 2026 - Python
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
A programmable version of Neil Thapen's Pink Trombone
Open-source AI audiobook studio. A free, private alternative to ElevenLabs. 3 voice modes, per-sentence voice & emotion control, LLM smart character analysis, mixed-voice generation. Runs 100% locally on your GPU with zero API costs.
Qwen3-TTS Audiobook Studio: Ultimate local multi-role AI audiobook generator. Built-in 3s Voice Clone & Design. Portable one-click launch for Mac/Win. 极致本地 AI 有声书制作工坊。
🎙️ Qwen3-TTS-DubFlow: An open-source, human-in-the-loop AI dubbing workbench for novels, games, podcasts, and more. Features a "Design-then-Clone" workflow powered by Qwen3-TTS to achieve consistent identity and context-aware emotional performance.
Native macOS text-to-speech app powered by Qwen3-TTS and Apple Silicon (MLX). Voice cloning, voice design, and custom voices — all running locally.
🎤 Transform text into captivating audio dramas using AI-powered workflows for scripts, character voices, and high-quality audio synthesis.
🎤 Run Qwen3-TTS locally on Apple Silicon for offline AI text-to-speech, offering voice cloning and customization features without cloud reliance.
Fine-tuning toolkit for Maya1, a 3B-parameter open-source TTS model by Maya Research. Supports full fine-tuning with SNAC neural codec, natural language voice design, and inline emotion tags. Features YAML-based config, offline preprocessing, and automatic audio sample generation at every checkpoint.
Add a description, image, and links to the voice-design topic page so that developers can more easily learn about it.
To associate your repository with the voice-design topic, visit your repo's landing page and select "manage topics."