- [浏览需要 0 积分] 发布于3天前
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
论文链接 Demo 链接赞评论浏览 18 - [浏览需要 0 积分] 发布于26天前
FLOW2GAN: HYBRID FLOW MATCHING AND GAN WITH MULTI-RESOLUTION NETWORK FOR FEW-STEP HIGH-FIDELITY AUDIO GENERATION
论文链接 代码链接赞评论浏览 72 - [浏览需要 0 积分] 发布于27天前
【ASR+WFST的第二春】IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition
论文链接赞 1评论浏览 207 - [浏览需要 0 积分] 发布于2026-01-09 17:25:16
《让炼丹更科学一些(五):基于梯度精调学习率》
https://kexue.fm/archives/11530 这篇文章开始,我们考虑基于梯度的学习率调度,它有助于我们了解诸如 Warmup、Decay 等学习率策略的原理,也能为各种自适应学习率优化器提供有益的参考。赞评论浏览 70 - [浏览需要 0 积分] 发布于2026-01-09 14:07:52赞评论浏览 47
- [浏览需要 0 积分] 发布于2026-01-06 11:18:31
MULTILINGUAL VISUAL SPEECH RECOGNITION WITH A SINGLE MODEL BY LEARNING WITH DISCRETE VISUAL SPEECH UNITS
论文链接赞评论浏览 62