We have tried Microsoft Speech Service and found it to be way too complicated.
The Azure OpenAI whisper deployment has a pretty low quota.
Running it using ggeranov's whisper on a Mac works fairly well but it's not in our corporate network.
I really need to batch transcribe these calls. I am a few weeks behind.
I have access to a server with 2x RTX 4090. It is all up and ready to go with the Nvidia drivers.
By the way, these calls are an average of 90s. Not long.
reply