Anecdotally I've found ctranslate2 to be even faster than insanely-fast-whisper. On an L4, using ctranslate2 with a batch size as low as 4 beats all their benchmarks except the A100 with flash attention 2.
It's a shame faster-whisper never landed batch mode, as I think that's preventing folks from trying ctranslate2 more easily.
It's a shame faster-whisper never landed batch mode, as I think that's preventing folks from trying ctranslate2 more easily.