Show HN: EnfinBref- {GPT3-5|Mistral-7B} YouTube summaries, segment by segment

SushiHippie · 2023-11-01T01:34:48

Cool I really like this, I've used https://www.summarize.tech/ very often, will try your site next time I need it, too. Thanks

el_isma · 2023-10-31T14:40:20

How do you break down the segments/sections? Is it just fixed time? What happens if there are more than one topic discussed in the segment?

Are you using both chatgpt and mistral? Do you use them for different tasks?

bclavie · 2023-10-31T15:54:33

> How do you break down the segments/sections? Is it just fixed time? What happens if there are more than one topic discussed in the segment?

Currently it's just a dumb fixed time rule, based on max video length (3 or 5mn segments). I played around a bit and it's the easiest way to implement things that works remarkably well. If there are multiple topics, there a few branching paths in the code, but a lot of it comes down to believing in the LLM's ability to make sense of it. I've got some ideas to improve, but would need a bunch of work to implement well.

> Are you using both chatgpt and mistral? Do you use them for different tasks?

There's a degree of A/B testing (well, "A/B testing", since we're not collecting feedback) where some of the summaries are GPT, some of them are mistral, mixed together for the same video. Mistral being superbly fast means it's also really useful to support the branching coding logic (e.g. something I'm working on right now is having an entirely different summarisation style if a video is about sports, and while a logistic regression would do that pretty well, it's not particularly robust, and won't tell me what sport it is if the transcript is full of typo) or to clean up the video transcripts.

isaacfrond · 2023-10-31T08:31:30

Now this could actually be useful.

From time to time information is only provided in the form of video. But watching video is much less convenient for me. Even if useful, it just doesn't vibe with office work.

bclavie · 2023-10-31T15:55:49

Thanks! I agree -- I find it much easier to skim a few paragraphs than to skim through a video when trying to consume information quickly if I'm not sure I want to commit to a full, long vid. Hoping to make it useful enough that it ends up paying for its own server costs so I can keep it around!

zdimension · 2023-10-31T12:11:47

Pardon my French, but merde, this is impressive. I've tried it on 20-40min French videos and the summary and section-dividing is spot on.

bclavie · 2023-10-31T15:54:55

Merci! It's early on but I'm quite happy with how the first prototype turned out.

bionsystem · 2023-10-31T08:56:40

Impressive, thanks. How could one run something like that on local videos ?

Btw I love the name.

bclavie · 2023-10-31T11:26:56

> Impressive, thanks. How could one run something like that on local videos ?

It depends how involved you'd want it to be really. You can get a very simple summary using something like Whisper to transcribe a video and having basic LLM calls. More involved summaries/segment breakdown/fine-tuned models would be a lot more work, but might not be needed for something quick with local vids?

> Btw I love the name.

Thank you! This is actually a project I had back in 2018ish, which fizzled out because I didn't have enough time to get good enough summaries going during the pre-LLM era. I let the domain name expire and a few weeks ago realised it was still free, so got building again and re-bought it!

bionsystem · 2023-10-31T15:36:44

Merci pour les conseils ! Je pense que je vais m'abonner dès que je retrouve un taf :)

bclavie · 2023-10-31T15:51:03

Haha merci! Prends ton temps, pour le moment le tout tourne sur des crédits cloud gratuits alors la seule utilité du bouton premium c'est de faire de la lumière. Bonne chance pour ta recherche!