Add STG Support for Video Diffusion in CosyVoice Audio by primepake · Pull Request #1391 · FunAudioLLM/CosyVoice

primepake · 2025-06-23T15:11:27Z

This PR introduces Stage-Guided (STG) support to CosyVoice, inspired by the video diffusion framework from STGuidance. The changes enhance the text-to-speech pipeline by integrating stage-guided techniques, improving [e.g., generation quality, efficiency, or compatibility with diffusion-based workflows].

Changes Made

Updated cosyvoice/flow/decoder.py to [e.g., "incorporate stage-guided decoding logic for better alignment with diffusion processes"].
Modified cosyvoice/flow/flow_matching.py to [e.g., "adapt flow matching to support STG’s stage-based optimization"].

Motivation

The addition of STG support aims to [e.g., "leverage stage-guided diffusion techniques to enhance the quality and speed of speech synthesis, aligning CosyVoice with advanced video diffusion methodologies"]. This builds on the concepts from junhahyung/STGuidance, adapted for audio generation.

johnwick123f · 2025-06-24T21:18:24Z

Looks interesting, but may I ask, what are the effects of adding STG? Better voice cloning quality or better emotion?

primepake · 2025-06-25T01:39:14Z

yes, I will improve the model quality. For example, the flow matching in flow model is sometimes difficult to maintain the consistent of speaker like it changed the voice identity from male to female in the same audio with STG it's improved

HaiFengZeng · 2025-11-12T06:17:21Z

想问一下，这里的stg_applied_layers_idx是怎么设置的？

primepake added 2 commits June 23, 2025 14:43

adding STG for audio inference

8fdbd7f

add stg

3199349

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add STG Support for Video Diffusion in CosyVoice Audio#1391

Add STG Support for Video Diffusion in CosyVoice Audio#1391
primepake wants to merge 2 commits intoFunAudioLLM:mainfrom
primepake:audio-stg

primepake commented Jun 23, 2025

Uh oh!

johnwick123f commented Jun 24, 2025

Uh oh!

primepake commented Jun 25, 2025

Uh oh!

HaiFengZeng commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

primepake commented Jun 23, 2025

Changes Made

Motivation

Uh oh!

johnwick123f commented Jun 24, 2025

Uh oh!

primepake commented Jun 25, 2025

Uh oh!

HaiFengZeng commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants