Qwen3-TTS 1.7B

Name: Qwen3-TTS-12Hz-1.7B-CustomVoice
Author: 阿里巴巴

Audio modelQwen3

Qwen3-TTS-12Hz-1.7B-CustomVoice

Release date: 2026-01-22Updated: 2026-01-25 10:19:43750

Live demo GitHub Hugging Face Compare

Parameters

1.7B

Context length

Chinese support

Supported

Reasoning ability

Qwen3-TTS-12Hz-1.7B-CustomVoice is an AI model published by 阿里巴巴, released on 2026-01-22, for Audio model, with 1.7B parameters, and 8K context length, requiring about 3.4GB storage, under the Apache 2.0 license.

Data sourced primarily from official releases (GitHub, Hugging Face, papers), then benchmark leaderboards, then third-party evaluators. Learn about our data methodology

Qwen3-TTS 1.7B

Model basics

Reasoning traces

Not supported

Thinking modes

Thinking modes not supported

Context length

8K tokens

Max output length

4K tokens

Model type

Audio model

Modality (in / out)

Text → Audio

Release date

2026-01-22

Model file size

3.4GB

MoE architecture

Total params / Active params

1.7B / N/A

Knowledge cutoff

No data

Qwen3-TTS 1.7B

Open source & experience

Code license

Apache 2.0

Weights license

Apache 2.0- 免费商用授权

GitHub repo

https://github.com/QwenLM/Qwen

Hugging Face

https://huggingface.co/Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Live demo

https://huggingface.co/spaces/Qwen/Qwen3-TTS-Demo

Qwen3-TTS 1.7B

Official resources

Paper

Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation!

DataLearnerAI blog

https://www.datalearner.com/blog/1051769091773677

Qwen3-TTS 1.7B

API details

API speed

5/5

No public API pricing yet.

Qwen3-TTS 1.7B

Benchmark Results

No benchmark data to show.

Compare with other models

No curated comparisons for this model yet.

Want a custom combination? Open the compare tool

Qwen3-TTS 1.7B

Publisher

阿里巴巴

View publisher details

Qwen3-TTS-12Hz-1.7B-CustomVoice

Model Overview

DataLearner on WeChat

Follow DataLearner on WeChat for AI model updates and research notes.