More

selvan · 2026-06-05T09:29:55 1780651795

Feature demo videos: https://magenta.withgoogle.com/mrt2

thenthenthen · 2026-06-05T09:43:12 1780652592

Ironically no sound here on ios

rustcleaner · 2026-06-05T11:52:23 1780660343

Very very poor format support on Apple devices. I have this problem when re-encoding videos for re-upload to Discord, SimpleX, and other destinations which have iOS users.

Only fix seems to be ditching Apple. I can only recommend GrapheneOS for [Pixel] phones (never Samsung), and a favorite flavor of GNU/Linux for general purpose (me: Qubes OS). Apple wants to sell a beautiful but limited walled garden, for a premium price... not worth it!

ZekeSulastin · 2026-06-05T12:42:09 1780663329

What are you possibly encoding for social media that doesn’t work on iOS?

Also the project in the original link is Mac only (hence the irony), so evangelizing Graphene and Qubes seems a lot out of left field.

selvan · 2026-06-05T04:09:53 1780632593

Build and play AI musical instruments on your laptop!. It is a live, interactive model that you can control with MIDI and audio, in addition to text.

selvan · 2026-05-11T06:45:02 1778481902

Creating an AI native solution to manage workflows of my live streaming business (https://www.cheerarena.com)

Most workflow softwares are complex to extend & customize. Building an AI native, structured workflow orchestrator from scratch for agentic era.

As a starting point, have designed and implemented an AI native data store to store semantic linked structured input & output data of workflow steps/tasks. These structured input/output act as spec and guard rails for the workflow tasks.

selvan · 2025-10-23T04:59:05 1761195545

From the blog " Gemini CLI spawns a new process within a pseudo-terminal in the background, leveraging the node-pty library...So how does this virtual terminal running in the background show up on your screen? Think of it like a video stream. Our new serializer takes a snapshot of the pseudo terminal at every moment—capturing every piece of text, every color, and even the cursor's position. These snapshots are then streamed to you, allowing you to see and interact with the terminal application in real-time. It's not just a stream of text; it's a live feed."

Terminal serializer code: https://github.com/google-gemini/gemini-cli/blob/main/packag...

Uses @xterm/headless npm package.

breakingcups · 2025-10-24T08:37:53 1761295073

That excerpt sounds like it was written by an LLM.

ffsm8 · 2025-10-23T05:09:20 1761196160

Your link 404s

selvan · 2025-10-23T05:36:23 1761197783

Thanks. Fixed it.

selvan · 2025-10-07T06:55:41 1759820141

An MCP server exposes tools that a model can call during a conversation and returns results according to the tool contracts. Those results can include extra metadata—such as inline HTML—that the Apps SDK uses to render rich UI components (widgets) alongside assistant messages.

More: https://github.com/openai/openai-apps-sdk-examples?tab=readm...

ares623 · 2025-10-07T08:01:33 1759824093

Imagine rendering content from an app with user submitted data.

selvan · 2025-09-19T03:44:52 1758253492

May be personalization for narration ?. Different narration style, based on their own interest.

edit: Their demo video shows they allow learners to set different narration style based on their interest.

selvan · 2025-08-07T13:06:46 1754572006

May be, we are couple of years away from experiencing patent free video codecs based on deep learning.

DCVC-RT (https://github.com/microsoft/DCVC) - A deep learning based video codec claims to deliver 21% more compression than h266.

One of the compelling edge AI usecases is to create deep learning based audio/video codecs on consumer hardwares.

One of the large/enterprise AI usecases is to create a coding model that generates deep learning based audio/video codecs for consumer hardwares.

selvan · 2025-07-12T08:33:46 1752309226

Cursor - co-pilot/AI pair programming usecases.

Claude Code - Agentic/Autonomous coding usecases.

Both have their own place in programming, though there are overlaps.

selvan · 2025-07-04T07:14:28 1751613268

Ship AI Agents as a web page :-)

selvan · 2025-06-30T03:50:25 1751255425

CheerArena - Your Own TV Grade Live Channel on Youtube

Have created a real-time media mixing mobile app that helps to setup TV grade Live channel on Youtube/Facebook/Twitch/Instagram.

Our product scales from individual to institutions, camera in mobiles to network of cameras, indoor to outdoor sports and events.

Details: https://www.cheerarena.com/

Realtime mixing studio - https://play.google.com/store/apps/details?id=com.cheerarena...