- [浏览需要 0 积分] 发布于1天前
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
论文链接 Demo 链接赞评论浏览 13 - [浏览需要 0 积分] 发布于24天前
FLOW2GAN: HYBRID FLOW MATCHING AND GAN WITH MULTI-RESOLUTION NETWORK FOR FEW-STEP HIGH-FIDELITY AUDIO GENERATION
论文链接 代码链接赞评论浏览 69 - [浏览需要 0 积分] 发布于25天前
【ASR+WFST的第二春】IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition
论文链接赞 1评论浏览 205 - [浏览需要 0 积分] 发布于28天前赞评论浏览 44
- [浏览需要 0 积分] 发布于2026-01-06 11:18:31
MULTILINGUAL VISUAL SPEECH RECOGNITION WITH A SINGLE MODEL BY LEARNING WITH DISCRETE VISUAL SPEECH UNITS
论文链接赞评论浏览 59