Speech-to-text translation or S2T are translate speech to text with different language.
|Fleurs is the speech version of the FLoRes machine translation benchmark. We use 2009 n-way parallel sentences from the FLoRes dev and devtest publicly available sets, in 102 languages.
|Whisper is a general-purpose speech recognition model. (include S2T X->English)