I like the idea of this. When I tried a couple things it didn't work. Either too long or nothing happened after I clicked the 2nd button. Maybe an example page would be nice.
Thanks a lot for the feedback! Really appreciate you trying it out. For longer videos, transcription can definitely take a bit... I'm working on moving processing to more powerful servers (currently running off my home machine)
For a quick overview on how it works and to get past those initial steps, this demo video of ragsplain might help: