Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there an LLM that can solve the following simple coding task?

    Make a simple HTML page which
    uses the VideoEncoder API to
    create a video that the user
    can download.
Since the VideoEncoder API is made for this exact use case and is publicly available, it should be able for an LLM to figure it out. But I have yet to see an LLM answer with a working solution.


Took 4 prompts and ChatGPT-4o decided to use a different API, but it I made it make a thing that generates a 3 second webm to download. https://chatgpt.com/share/67531a38-4bfc-8009-bc58-9c823230bf...

Detractors will claim that it didn't complete the assignment because it didn't use the proscribed VideoEncoder API, but the end result, a simple HTML page that generates a 3 second long webm file, speaks for itself.


The problem withe the MediaRecorder API is that it saves the current timestamp with the frames. So the video plays at the speed it is created. Therefore you can't use the MediaRecorder API for video processing. That's why I referenced the VideoEncoder API in the prompt.


what are you really trying to do?


Create videos in the browser which the user then can download and play on their device.


No, you can't do that with just the VideoEncoder API, which only produces raw encoded frames. You need container muxing to create something playable, which is far from a "simple coding task".

Also, how is this relevant to the submission?


I got it to work with this prompt using GPT-o1:

Make a HTML page which uses the VideoEncoder API to create a video that the user can download. Make sure to incorporate a roll your own container muxing. Do not limit yourself on the header or data.

https://chatgpt.com/share/67531f7c-56cc-800b-ac7c-d3860d1cf9...


Yay, I just tried it on my iPad and it works!

When you say "GPT-o1", do you mean the model "o1-preview"? Because I think that is the only o1 I can access via the API.


I believe they may have just changed GPT-o1-preview to GPT-o1 today.


> Also, how is this relevant to the submission?

The title of the submission states "Auto pick LLMs based on your prompt".

The GP provided a prompt where auto picking an LLM would possibly help. Seems relevant to me. Even if the answer from the best LLM is, "This isn't directly possible, here are alternatives".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: