Portfolio / Python lane / Qwen3-TTS-Studio

PY

Qwen3-TTS-Studio

AI voice generator powered by Qwen3-TTS

Python

Open source on GitHub Open the Python lane

Delivery

Source-first

Browse code, README, and release notes on GitHub.

Primary lane

Python lane

The clearest adjacent context for this project inside the portfolio.

Freshness

May 18, 2026

Updated May 18, 2026

Latest release

No tag yet

README is the clearest project overview right now.

Preview

Using the generated project card as a clean fallback until a live capture is available.

Source at github.com/SysAdminDoc/Qwen3-TTS-Studio.

README

Cached at build time, cleaned up for in-site reading, and linked back to the canonical GitHub source.

3 min read 584 words 21 sections

Contents

Qwen3-TTS Studio
Features
Requirements
Installation
Option 1: Download Release (Recommended)
Option 2: Clone Repository
First-Time Setup
Usage
Basic Text-to-Speech
Voices
Style Presets
Custom Style Instructions
File Locations
Troubleshooting
"Network error. Check your internet connection."
Generation is slow
Out of memory errors
Setup Wizard won't close
Configuration
Credits
License

Qwen3-TTS Studio

A professional, turnkey AI voice generator powered by Alibaba's Qwen3-TTS

Qwen3-TTS Studio

2026-01-25 23_21_02-Qwen3-TTS-Studio

Features

Zero Configuration — One-click setup installs everything automatically (Python, dependencies, models)
8 Built-in Voices — English, Chinese, Japanese, and Korean speakers
Style Control — Presets for Natural, Cheerful, Calm, Professional, and more
Custom Instructions — Fine-tune voice emotion and delivery with text prompts
Recording Library — Browse, play, and manage your generated audio files
Modern Dark UI — Clean, professional interface inspired by Spotify and Discord
GPU Accelerated — Automatic NVIDIA CUDA detection for faster generation
Offline Ready — Works completely offline after initial setup

Requirements

OS: Windows 10/11 (64-bit)
RAM: 8GB minimum, 16GB recommended
Storage: 15GB free space for models
GPU: NVIDIA GPU with 6GB+ VRAM recommended (CPU mode available but slower)

Note: No Python installation required! The app downloads and manages its own embedded Python environment.

Installation

Option 1: Download Release (Recommended)

Download the latest release from the Releases page
Extract to any folder
Run qwen3_tts_studio.py

Option 2: Clone Repository

git clone https://github.com/yourusername/qwen3-tts-studio.git
cd qwen3-tts-studio
python qwen3_tts_studio.py

First-Time Setup

On first launch, the Setup Wizard will guide you through installation:

2026-01-25 23_15_02-C__Users_--_AppData_Local_Qwen3-TTS_models_Qwen3-TTS-12Hz-0 6B-CustomVoice - Exp

Python Environment — Downloads embedded Python 3.12 (~25MB)
AI Components — Installs PyTorch, Transformers, etc. (~3GB)
Voice Tokenizer — Required for all voices (~500MB)
Voice Model — Standard quality model (~7GB)

Simply click "Install Everything" and wait 10-20 minutes depending on your internet speed.

Usage

Basic Text-to-Speech

Type or paste text in the input box
Select a voice from the dropdown
Choose a style preset (or enter custom instructions)
Click "Generate Speech"
Audio plays automatically and saves to your library

Voices

Voice	Language	Description
Ryan	English	Dynamic male voice with strong rhythm
Aiden	English	Sunny American male accent
Vivian	Chinese	Bright, youthful female voice
Serena	Chinese	Warm and gentle female voice
Dylan	Chinese	Youthful Beijing male accent
Eric	Chinese	Lively Sichuan male accent
Anna	Japanese	Playful and expressive female
Sohee	Korean	Warm and friendly female

Style Presets

Natural — Clear, everyday speech
Cheerful — Enthusiastic and happy
Calm — Relaxed and soothing
Professional — Confident business tone
Excited — High energy delivery
Gentle — Soft and tender
News Anchor — Broadcast style
Storytelling — Narrative delivery

Custom Style Instructions

For fine-grained control, enter custom instructions like:

"Speak slowly with a mysterious tone"
"Sound excited but slightly out of breath"
"Whisper softly as if telling a secret"

File Locations

Content	Location
Application Data	`%LOCALAPPDATA%\Qwen3-TTS\`
Voice Models	`%LOCALAPPDATA%\Qwen3-TTS\models\`
Recordings	`Documents\Qwen3-TTS Recordings\`
Configuration	`%LOCALAPPDATA%\Qwen3-TTS\studio_config.json`

Troubleshooting

"Network error. Check your internet connection."

Verify your internet connection is working
Try disabling VPN if active
Check if firewall is blocking Python
The installer will retry automatically 3 times

Generation is slow

With GPU: First generation loads the model (~1-2 min), subsequent ones are faster
Without GPU: CPU mode is significantly slower, consider using the 0.6B lite model

Out of memory errors

Close other applications to free RAM
Use the lighter CustomVoice-0.6B model (Settings → Run Setup Wizard)
Reduce text length per generation

Setup Wizard won't close

Ensure all 4 checkmarks are green
Click "Launch Studio" or "Continue to Studio"
If stuck, close and reopen the application

Configuration

Settings are stored in studio_config.json:

{
  "selected_voice": "Ryan (English Male)",
  "selected_model": "CustomVoice-1.7B",
  "style_preset": "Natural",
  "volume": 0.8,
  "auto_play": true,
  "setup_complete": true
}

Credits

Qwen3-TTS — Alibaba Qwen Team
Models — Hugging Face
UI Framework — CustomTkinter

License

This project is licensed under the MIT License - see the LICENSE file for details.

The underlying Qwen3-TTS models are licensed under Apache 2.0 by Alibaba Cloud.

Made with ❤️ for the AI voice community

Read on GitHub → github.com/SysAdminDoc/Qwen3-TTS-Studio

Qwen3-TTS-Studio

Preview

README

Qwen3-TTS Studio

Features

Requirements

Installation

Option 1: Download Release (Recommended)

Option 2: Clone Repository

First-Time Setup

Usage

Basic Text-to-Speech

Voices

Style Presets

Custom Style Instructions

File Locations

Troubleshooting

"Network error. Check your internet connection."

Generation is slow

Out of memory errors

Setup Wizard won't close

Configuration

Credits

License

More from this lane