A notebook is available to try with your microphone on Colab here: https://colab...

bambax · on Sept 21, 2022

Thanks! I played with this in French and posted the results as replies to this comment: https://news.ycombinator.com/item?id=32928643

It's sometimes close to perfect, and sometimes goes off the rail; I think that maybe the model tries to establish some sort of consistency for each sentence; if starts wrong for the first few words of a sentence, it can't build the rest properly.

But it's super fun.

orionsbelt · on Sept 22, 2022

How do you get this to translate instead of just transcribe?

tekacs · on Sept 22, 2022

To be more specific than the above:

1. Make sure you're using a model that isn't suffixed with `.en` (`base`, not `base.en). 2. Use `model.transcribe(your_input_audio, language='Japanese', task='translate')` ... with the appropriate input language.

paraschopra · on Sept 22, 2022

Just specify language and record an audio in another language.

>result = model.transcribe("audio.wav", language="english")

orionsbelt · on Sept 22, 2022

That actually seems to set the language for it to transcribe (as opposed to it guessing), with the following triggering a translation to English:

result = model.transcribe("audio.wav", task="translate")

But your post helped me figure out the above, so thank you!