Hacker News new | past | comments | ask | show | jobs | submit | BetterWhisper's comments login

Developed https://www.videototextai.com/ exactly for this reason as it was quite impossible to search videos otherwise. Also you can copy the transcript into a LLM and ask questions from video content like that.


All of it, I have tested it and want to automate it currently.

Edit: the biggest hurdle previously was no midjourney API. Now that DALLE-3 is released it is good enough and has an API


And what is this unique opportunity to lead in AI?


Right now: voice-bot telephony. So, immediately have a twilio number that you can call/text and talk to a GPT much like I do with the ChatGPT app interface. This rapidly could couple into actions API for doordash amazon etc…so Twilio would be handling all this scalable AI-Telephony data. They have the infrastructure and engineering for it.

Long term: own the infrastructure for infinitely scaling call centers for any task you would need to do over sms/gsm voice


Still no API...


Literally spent the last hour trying to figure out why my app was failing. Only thing I was seeing from my side is "permission denied".

why can't Google be bothered to put outages in Firebase console... Or allow us to see better logs?

Pretty sure this affected Hacker News login as well


Literally spent the last hour trying to figure out why my app was failing when Google can not even be bothered to put outages in Firebase console...

Pretty sure this affected Hacker News login as well


While it seems YouTube's auto-generated are hit or miss, I wonder if feeding them through an LLM can fix the mistakes and still get the video's idea out of them


I've found that to be the case. I typically don't want a full transcript -- I want the materials list, or a summary, or a counterargument. I've found it is totally sufficient to just plop the transcript into an LLM and ask for my desired output. No need to clean of the transcript ahead of time.


Wow, why are they so expensive? Like even the regular whisperAPI by OpenAI is less expensive.

This is also why I decided to create https://www.betterwhisperapi.com/ . I believe most of the companies are charging pretty insane amounts for transcriptions...


Deepgram is really good and around your price point too. They also have $200 free credit which should be more than enough for most hobby protects.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: