TTS
🏆 Foundation Models HunyuanVideo-Foley Tencent has released HunyuanVideo-Foley, an AI-powered sound design tool for video creators. It generates professional-grade sound effects that synchronize precisely with video content, even in complex scenes. Powered by multimodal semantic balancing, the system intelligently analyzes both visual and textual inputs to produce personalized and context-aware audio. Potential applications include short-form video creation, filmmaking, advertising, and game development.
📹 Videos: HunyuanVideo-Foley video | HunyuanVideo-Foley video
Marvis TTS Introduces an advanced conversational speech model designed for real-time voice cloning and streaming text-to-speech synthesis.