Hacker Newsnew | past | comments | ask | show | jobs | submit | pinter69's commentslogin

1000 true fans and Seth Godin's approach to marketing - that you should focus on the smallest group of people in your market and amaze them

These concepts helped me a lot with marketing to consumers and SMBs

Also, follow @mtlynch's content - even though he says he's not an expert, I feel that atleast regarding marketing to devs and builders - he is one of the best out there


Thanks for your work Michael!


You mean Vulkan? In the blog post there is reference to all vulkan supported platforms

If you mean ffmpeg build with whisper - from memory I didn't see ffmpeg-builds for mac, so you will probably need to compile yourself


This is a 3 part series, the first one discusses the new native whisper integration. And correct, for the first post - the point is to show that you can only use ffmpeg to transcribe and embed subtitles in a video


While there's appeal in having one tool do several things I'm more a fan of the traditional UNIX philosophy that a tool should do one thing, do it extremely well, and allow for chaining of several tools together to achieve a multi-step process.


I tend to agree. The thing I like most about version 8 is actually pad_cuda - nice performance boost for resizing video with an Nvidia GPU


Author here, available for questions


Whisper will hallucinate on audio segments that don't have any speech. VAD mitigates that. Expect worse results without it, especially on non-English audio.


This is great I’ll have to give it a shot


"Lenovo laptop with Nvidia RTX 4040" 4060?


Correct. I fixed the typo


People in the community can be hardcore there sometimes, r/ffmpeg especially. But, there are communities online and information resources that help.

This is a nice resource: https://amiaopensource.github.io/ffmprovisr/

And also I've written this cheatsheet, which is designed to be used alongside an LLM: https://github.com/rendi-api/ffmpeg-cheatsheet

Let me know if you're interested in more resources


Love the cheat sheet, forked it after reading a couple sections that were instantly useful lol

Can't promise it'll be soon but I may be able to expand on a couple of your repo's "possible future topics list" items.

I've been working on a personal project involving doing object detection on multiple camera feed inputs that all have different resolutions, frame rates, and encodings and sending a single consolidated and annotated feed to a remote streaming service.

That sent me down a really interesting rabbit hole and I've got tons of notes and links along with some Gemini chats that I'm gonna go through and see if there's anything there that might be worth including.


Thanks Dan, as usual, deep and insightful


Your question is very interesting.

Going over the comments the only plausible explanation I could see is KV cache being extremely useful - don't know if this is really just the case.

Would love to know the true answer to the question.


Author here. Happy to take any questions or feedback.


It doesn't look to be self hosted


It is, but they don't make it obvious on the home page. They have self-hosted getting started docs here: https://ghost.org/docs/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: