- [浏览需要 0 积分] 发布于2025-03-06 14:10:10赞 2评论 1浏览 192
- [浏览需要 0 积分] 发布于2024-11-08 10:48:20赞 2评论 1浏览 519
- [浏览需要 0 积分] 发布于2024-11-14 14:57:38
【Conference Paper】Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
论文链接 代码链接赞 2评论浏览 496 - [浏览需要 0 积分] 发布于2025-02-07 17:23:54赞 2评论浏览 368
- [浏览需要 0 积分] 发布于2025-02-05 11:33:08
【TR】FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
论文链接赞 2评论浏览 365 - [浏览需要 0 积分] 发布于2025-01-14 10:48:48
【CP】A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
论文链接赞 2评论浏览 397 - [浏览需要 0 积分] 发布于2024-11-28 19:19:13
【Conference Paper】Contextual Biasing Speech Recognition in Speech-enhanced Large Language Model
论文链接赞 2评论浏览 390 - [浏览需要 0 积分] 发布于2024-11-26 20:28:00
【Conference Paper】Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
论文链接赞 2评论浏览 389 - [浏览需要 0 积分] 发布于2024-11-25 17:06:13赞 2评论浏览 455
- [浏览需要 0 积分] 发布于2024-11-20 11:12:26
【Conference Paper】Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
论文链接 代码链接 官方链接赞 2评论浏览 514 - [浏览需要 0 积分] 发布于2024-11-19 15:50:25赞 2评论浏览 472
- [浏览需要 0 积分] 发布于2024-10-18 17:35:42
【Tech Report】MooER: LLM-based Speech Recognition and Translation Models from Moore Threads
论文链接 代码链接赞 2评论浏览 869 - [浏览需要 0 积分] 发布于2024-11-13 19:23:16赞 2评论浏览 526
- [浏览需要 0 积分] 发布于2024-11-12 19:56:49
【Conference Paper】Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment
论文链接赞 2评论浏览 481 - [浏览需要 0 积分] 发布于2024-11-13 13:17:00赞 2评论浏览 484
- [浏览需要 0 积分] 发布于2024-11-05 15:14:23
【Conference Paper】SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs
论文链接赞 2评论浏览 457 - [浏览需要 0 积分] 发布于2024-11-05 10:04:21
【Conference Paper】Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
论文链接赞 2评论浏览 427 - [浏览需要 0 积分] 发布于2024-10-31 17:13:36赞 2评论浏览 757
- [浏览需要 0 积分] 发布于2024-10-24 16:26:57赞 1评论浏览 606