- Large Language Models Are Strong Audio-Visual Speech Recognition Learners arxiv.org/abs/2409.12319
- EFL-PEFT: A communication Efficient Federated Learning framework using PEFT sparsification for ASR
- Large Language Models Are Strong Audio-Visual Speech Recognition Learners arxiv.org/abs/2409.12319
- EFL-PEFT: A communication Efficient Federated Learning framework using PEFT sparsification for ASR