- [浏览需要 0 积分] 发布于18小时前
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
论文链接 Demo 链接赞评论浏览 8 - [浏览需要 0 积分] 发布于23天前
FLOW2GAN: HYBRID FLOW MATCHING AND GAN WITH MULTI-RESOLUTION NETWORK FOR FEW-STEP HIGH-FIDELITY AUDIO GENERATION
论文链接 代码链接赞评论浏览 67 - [浏览需要 0 积分] 发布于24天前
【ASR+WFST的第二春】IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition
论文链接赞 1评论浏览 203 - [浏览需要 0 积分] 发布于27天前赞评论浏览 42
- [浏览需要 0 积分] 发布于2026-01-06 11:18:31
MULTILINGUAL VISUAL SPEECH RECOGNITION WITH A SINGLE MODEL BY LEARNING WITH DISCRETE VISUAL SPEECH UNITS
论文链接赞评论浏览 57