text-video-to-audio

Here are 4 public repositories matching this topic...

FunAudioLLM / ThinkSound

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

tta video-to-audio text-to-audio foley-sound-synthesis aigc-audio text-video-to-audio

Updated Apr 3, 2026
Python

Tencent-Hunyuan / HunyuanVideo-Foley

Star

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.

tta video-to-audio text-to-audio text-to-video foley-sound-synthesis foley-art aigc-audio text-video-to-audio

Updated Sep 28, 2025
Python

xiaomi-research / controlfoley

Star

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

text-to-audio audio-generation foley-sound-synthesis foley-art text-video-to-audio text-controlled-video-to-audio audio-controlled-video-to-audio

Updated Apr 21, 2026
Python

YJX-Research / ControlFoley_test

Star

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

text-to-audio audio-generation foley-sound-synthesis foley-art text-video-to-audio text-controlled-video-to-audio audio-controlled-video-to-audio

Updated Apr 17, 2026
Python

Improve this page

Add a description, image, and links to the text-video-to-audio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-video-to-audio topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-video-to-audio

Here are 4 public repositories matching this topic...

FunAudioLLM / ThinkSound

Tencent-Hunyuan / HunyuanVideo-Foley

xiaomi-research / controlfoley

YJX-Research / ControlFoley_test

Improve this page

Add this topic to your repo