AI-RADAR.IT · AI-RADAR.NET · AI-RADAR.TECH

News & analysis on local LLMs, stack & on-prem hardware.

📁 LLM AI generated

Qwen3: Voice embeddings for cloning and modifying voices

Published on 2026-02-23 03:11 ℹ️ LocalLLaMA 📰 Read the original source article →

Qwen3: voice embedding per clonare e modificare voci

Voice embedding in Qwen3

Qwen3 introduces a voice embedding feature in its Text-to-Speech (TTS) model, opening new possibilities in voice cloning and manipulation. The system transforms a voice into a 1024-dimensional vector (or 2048 for the 1.7 billion parameter model), allowing the voice to be recreated based solely on this vector.

Voice manipulation through mathematics

The most interesting aspect is the ability to modify voices through mathematical operations. You can combine different voices, alter gender or pitch, and even create an emotional space. This technique also enables semantic voice search.

Implementation and resources

The voice embedding model is a small encoder, with only a few million parameters. It has been made available in a standalone version, with ONNX models optimized for web and front-end inference. Inference via voice embedding is supported in specific forks of vLLM.

AI-Radar Takeaway

Qwen3 Text-to-Speech (TTS) model utilizes voice embeddings for voice cloning. Voice is transformed into a vector (1024 or 2048 dimensions for the 1.7b version), enabling voice modification through mathematical operations, such as gender swapping, pitch alteration, or creation of emotional spaces. An encoder has been extracted for standalone use, with ONNX models available for optimized inference.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

⚡

RunPod GPU Cloud Platform

Flexible GPU cloud with pay-per-second billing. Deploy instantly with Docker support, auto-scaling, and a wide selection of GPU types from RTX 4090 to H100.

✓ No commitments ✓ Instant deployment ✓ Production-ready

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

SECTION

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

Read →

LLM Jan 22

Qwen3-TTS: Open-Sourced Family of Models for Text-to-Speech

Qwen has open-sourced the full Qwen3-TTS model family, including VoiceDesign, CustomVoice, and Base. Five models are available in two sizes (0.6B & 1.8B), suppo

Read →

Altro May 30

Moss TTS 1.5: Voice Cloning Advances, Between Licensing and On-Premise Deployment

The new Text-to-Speech model Moss TTS v1.5, developed by the OpenMOSS team, is generating interest for its voice cloning capabilities. User preference over alte

Read →

LLM Apr 17

DeepL Launches Real-Time Voice-to-Voice Translation in 40+ Languages

DeepL, the Cologne-based company known for its text translation tools, has unveiled a comprehensive suite for real-time voice-to-voice translation, supporting o

Read →

LLM Jan 29

Qwen3-ASR: Open-Source Models for Multilingual Speech Recognition

The Qwen3-ASR family includes 1.7B and 0.6B parameter models, capable of identifying the language and transcribing audio in 52 languages and dialects. The large

Read →

LLM Jan 24

Qwen3-TTS: Ultra-Low Latency, Voice Cloning & OpenAI-Compatible API

The Qwen team has released Qwen3-TTS, an open-source speech synthesis system offering low latency (97ms), voice cloning, and OpenAI API compatibility. It suppor

Read →

Qwen3: Voice embeddings for cloning and modifying voices

Voice embedding in Qwen3

Voice manipulation through mathematics

Implementation and resources

💻 Need GPU Cloud Infrastructure?

Stay ahead — get AI signals in your inbox

💬 Comments (0)

🔍 Continue Exploring

More in LLM

👥 Join 160+ AI explorers