AI-RADAR.IT · AI-RADAR.NET · AI-RADAR.TECH

News & analysis on local LLMs, stack & on-prem hardware.

📁 LLM AI generated

KaniTTS2: open-source TTS model with voice cloning, 3GB VRAM footprint

Published on 2026-02-14 19:16 ℹ️ LocalLLaMA 📰 Read the original source article →

🏷️ LLM On-Premise 🏷️ Fine-Tuning 🏷️ DevOps

KaniTTS2: modello TTS open-source con voice cloning, VRAM da 3GB

KaniTTS2 is an open-source text-to-speech (TTS) model designed for real-time conversational applications. With 400 million parameters, this model offers voice cloning capabilities and supports several languages, including English and Spanish, with plans for future expansion.

Technical Specifications

Parameters: 400 million (BF16)
Sample rate: 22kHz
Voice Cloning: Yes
VRAM requirement: 3GB
Training time: 6 hours on 8x H100

A particularly interesting aspect is the availability of the complete pre-training code. This allows users to develop custom TTS models for specific languages, accents, or domains. The pre-trained model and code are available on Hugging Face and GitHub under the Apache 2.0 license.

For those evaluating on-premise deployments, there are trade-offs to consider. AI-RADAR offers analytical frameworks on /llm-onpremise to evaluate these aspects.

AI-Radar Takeaway

KaniTTS2 is a 400M parameter open-source text-to-speech (TTS) model designed for real-time conversational use cases. It supports voice cloning and runs with only 3GB of VRAM. The pre-training code is included, allowing users to develop custom TTS models.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

🚀

PeerPush AI Community Platform

Discover and share AI tools and projects. Connect with developers, get feedback, and grow your AI startup in a vibrant community of innovators.

✓ AI Community ✓ Project Showcase ✓ Developer Network

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

SECTION

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

Read →

LLM Jan 24

LuxTTS: Efficient voice cloning with a compact TTS model

LuxTTS, a diffusion-based text-to-speech model with only 120 million parameters, has been released. It stands out for its high-quality voice cloning capabilitie

Read →

LLM Jan 22

Qwen3 TTS: New Open-Source Text-to-Speech Model Released

Qwen3 TTS, a new open-source text-to-speech (TTS) model, has been released. The project is available on GitHub and Hugging Face, offering developers new options

Read →

LLM Feb 19

Kitten TTS V0.8: New SOTA Super-tiny TTS Model (Less than 25 MB)

Kitten ML has released Kitten TTS V0.8, a series of super-tiny open-source text-to-speech (TTS) models, with the smallest model taking up less than 25 MB. These

Read →

LLM Feb 05

SoproTTS v1.5: Zero-Shot Voice Cloning TTS for ~$100

SoproTTS v1.5 is a 135M parameter TTS (text-to-speech) model offering zero-shot voice cloning. Trained for approximately $100 on a single GPU, the model achieve

Read →

Frameworks Feb 01

Kanade Tokenizer: real-time voice cloning on CPU

A developer has presented Kanade Tokenizer, a voice cloning tool optimized for speed, with a real-time factor exceeding RVC. It also runs on CPU. A fork with a

Read →

KaniTTS2: open-source TTS model with voice cloning, 3GB VRAM footprint

Technical Specifications

💻 Need GPU Cloud Infrastructure?

Stay ahead — get AI signals in your inbox

💬 Comments (0)

🔍 Continue Exploring

More in LLM

👥 Join 160+ AI explorers