Hacker Newsnew | past | comments | ask | show | jobs | submit | selvan's commentslogin


Ironically no sound here on ios

Very very poor format support on Apple devices. I have this problem when re-encoding videos for re-upload to Discord, SimpleX, and other destinations which have iOS users.

Only fix seems to be ditching Apple. I can only recommend GrapheneOS for [Pixel] phones (never Samsung), and a favorite flavor of GNU/Linux for general purpose (me: Qubes OS). Apple wants to sell a beautiful but limited walled garden, for a premium price... not worth it!


What are you possibly encoding for social media that doesn’t work on iOS?

Also the project in the original link is Mac only (hence the irony), so evangelizing Graphene and Qubes seems a lot out of left field.


Build and play AI musical instruments on your laptop!. It is a live, interactive model that you can control with MIDI and audio, in addition to text.

Creating an AI native solution to manage workflows of my live streaming business (https://www.cheerarena.com)

Most workflow softwares are complex to extend & customize. Building an AI native, structured workflow orchestrator from scratch for agentic era.

As a starting point, have designed and implemented an AI native data store to store semantic linked structured input & output data of workflow steps/tasks. These structured input/output act as spec and guard rails for the workflow tasks.


From the blog " Gemini CLI spawns a new process within a pseudo-terminal in the background, leveraging the node-pty library...So how does this virtual terminal running in the background show up on your screen? Think of it like a video stream. Our new serializer takes a snapshot of the pseudo terminal at every moment—capturing every piece of text, every color, and even the cursor's position. These snapshots are then streamed to you, allowing you to see and interact with the terminal application in real-time. It's not just a stream of text; it's a live feed."

Terminal serializer code: https://github.com/google-gemini/gemini-cli/blob/main/packag...

Uses @xterm/headless npm package.


That excerpt sounds like it was written by an LLM.


Your link 404s


Thanks. Fixed it.


An MCP server exposes tools that a model can call during a conversation and returns results according to the tool contracts. Those results can include extra metadata—such as inline HTML—that the Apps SDK uses to render rich UI components (widgets) alongside assistant messages.

More: https://github.com/openai/openai-apps-sdk-examples?tab=readm...


Imagine rendering content from an app with user submitted data.


May be personalization for narration ?. Different narration style, based on their own interest.

edit: Their demo video shows they allow learners to set different narration style based on their interest.


May be, we are couple of years away from experiencing patent free video codecs based on deep learning.

DCVC-RT (https://github.com/microsoft/DCVC) - A deep learning based video codec claims to deliver 21% more compression than h266.

One of the compelling edge AI usecases is to create deep learning based audio/video codecs on consumer hardwares.

One of the large/enterprise AI usecases is to create a coding model that generates deep learning based audio/video codecs for consumer hardwares.


Cursor - co-pilot/AI pair programming usecases.

Claude Code - Agentic/Autonomous coding usecases.

Both have their own place in programming, though there are overlaps.


Ship AI Agents as a web page :-)


CheerArena - Your Own TV Grade Live Channel on Youtube

Have created a real-time media mixing mobile app that helps to setup TV grade Live channel on Youtube/Facebook/Twitch/Instagram.

Our product scales from individual to institutions, camera in mobiles to network of cameras, indoor to outdoor sports and events.

Details: https://www.cheerarena.com/

Realtime mixing studio - https://play.google.com/store/apps/details?id=com.cheerarena...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: