This reminds me of the podcast HowSound, a semi-instructional show about the craft of storytelling for podcasts and radio. I have enjoyed several of their episodes.
Upload a full podcast or a small part, we transcribe it, and you can add a title + subtitle, and get an audio wave animation + word-by-word captions. You can export as various dimensions for social media.
I was looking for something similar recently for a video clip and had a good experience with gifs.com. Their real-time preview was impressive, helpful. Any way you could offer that I can paste a youtube url instead of vid upload?
Edit: think actually y'all are in a different report category with a different comparable set of companies, but added it to the list for when that time comes
https://transom.org/topics/howsound/