Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Multimodal LLMs, Speech-to-Speech, Speech Recognition
Organizations
models
48
pyf98/DPHuBERT
Updated
•
4
pyf98/fisher_callhome_spanish_e_branchformer
Automatic Speech Recognition
•
Updated
•
5
pyf98/fisher_callhome_spanish_conformer
Automatic Speech Recognition
•
Updated
•
3
pyf98/slurp_entity_e_branchformer
Automatic Speech Recognition
•
Updated
•
1
pyf98/aidatatang_200zh_e_branchformer_e16
Automatic Speech Recognition
•
Updated
•
2
pyf98/librispeech_100_transducer_e_branchformer
Automatic Speech Recognition
•
Updated
•
7
pyf98/librispeech_100_transducer_conformer
Automatic Speech Recognition
•
Updated
•
4
•
1
pyf98/jsut_e_branchformer
Automatic Speech Recognition
•
Updated
•
2
pyf98/aishell_ctc_e_branchformer_e12
Automatic Speech Recognition
•
Updated
•
1
pyf98/aishell_ctc_conformer_e15_linear1024
Automatic Speech Recognition
•
Updated
•
2
•
2
datasets
0
None public yet