IntermediatePython、PyTorch、Conda + pip

Cosy VoiceOpen-source, multilingual text-to-speech (TTS)

CosyVoice is a mature open-source text-to-speech (TTS) solution that supports multilingual, cross-lingual, emotion control, zero-shot voice cloning, and streaming low-latency synthesis. The project is built primarily in Python, making it suitable for deployment in cloud or local server environments, and it supports Docker-based production deployment.

21.4K Stars
2.5K forks
799 issues
77 browse
Python、PyTorch、Conda + pip
Apache-2.0
Indexed
Updated

Project Overview

CosyVoice is a mature open-source text-to-speech (TTS) solution that supports multilingual, cross-lingual, emotion control, zero-shot voice cloning, and streaming low-latency synthesis. The project is built primarily in Python, making it suitable for deployment in cloud or local server environments, and it supports Docker-based production deployment.

CosyVoice is an open-source multilingual speech generation model, positioned as a "full-stack solution for large-scale speech generation and deployment". It supports advanced features such as generating natural speech from text, cross-lingual voice cloning, and emotional control, making it suitable for scenarios like TTS (Text-to-Speech), voice assistants, and podcast synthesis.


CosyVoice is developed by the FunAudioLLM organization, released under the Apache-2.0 open-source license, and enjoys high community attention and active engagement.


1. Core Highlights


? Multilingual & Dialect Support

Supports mainstream languages such as Chinese, English, Japanese, and Korean.

Offers production-level support for various Chinese dialects, e.g., Cantonese, Sichuanese, Shanghainese, etc.

Supports cross-lingual and mixed-language voice cloning and synthesis.


⚡ Low-Latency Real-Time Generation

Introduces a bidirectional streaming inference mechanism, achieving first-packet latency as low as ~150ms.

Maintains fluency in scenarios where speech is output while text is still being input.


? High Naturalness & Controllability

Improves pronunciation accuracy, with overall quality significantly better than earlier versions.

Supports control tags for emotion, speaking rate, volume, etc. (configurable at the API or service level).


? Zero-Shot Voice Cloning

Can generate speech output with a similar voice based on a short audio clip without requiring extensive samples (Zero-shot Voice Cloning).


2. Core Features & Modules


Feature Description
TTS Speech SynthesisDirectly converts text into high-fidelity speech
Zero-shot Voice CloningClones a voice using a small number of audio samples
Emotional Speech ControlAllows setting parameters for expression, mood, tone, etc.
Cross-Lingual SynthesisSupports output in different languages and mixed languages
Streaming Output MechanismEnables low-latency real-time speech generation
Multilingual Speech GenerationTTS (Text-to-Speech)Voice CloningReal-time Speech SynthesisLow-latency Speech ModelsPyTorch Speech ModelsOpen-source Speech SynthesisAI Speech API CapabilitiesDocker Deployment for TTSLLM Voice Integration

Project Rating

0.0 (0 Evaluation)

Share

Frequently Asked Questions

What is Cosy Voice: Open-source, multilingual text-to-speech (TTS)?

CosyVoice is a mature open-source text-to-speech (TTS) solution that supports multilingual, cross-lingual, emotion control, zero-shot voice cloning, and streaming low-latency synthesis. The project is built primarily in Python, making it suitable for deployment in cloud or local server environments, and it supports Docker-based production deployment.

What language is Cosy Voice: Open-source, multilingual text-to-speech (TTS) written in?

Cosy Voice: Open-source, multilingual text-to-speech (TTS) is primarily written in Python、PyTorch、Conda + pip.

What license is Cosy Voice: Open-source, multilingual text-to-speech (TTS) under?

Cosy Voice: Open-source, multilingual text-to-speech (TTS) is released under the Apache-2.0 license.

Related Projects

No results yet

Comments

Comments

0
0/500 Characters

No comments yet

Be the first to comment

Open Source Project

Explore, learn and contribute to open source AI projects to advance the development of artificial intelligence technology

View All