Hacker News new | past | comments | ask | show | jobs | submit login
OpenAI Spring Update [video] (youtube.com)
56 points by georgehill 16 days ago | hide | past | favorite | 17 comments



Anyone else find it a bit odd how they quickly set it up a day before Google I/O?

They clearly want to steal the thunder from Google's announcements. But they could only do that if they had inside information into what Google is launching. Otherwise it would be smarter to wait and react. Remember they already said this is not a search engine or gpt-5. So while this is probably cool it's not a silver bullet and probably won't overshadow what Google launches.

Unless it's almost exactly the same main announcement. In which case a day would make a difference.

Chatter seem to suggest they're announcing a voice to voice assistant, apple partnership and Google drive integration. All things Google is also rumoured to announce.

It definitely feels like the big guys are panicking to find their moat.


I wasn’t prepared to be this impressed.

Voice to voice chatGPT. You can interrupt it any time (don’t have to wait for it to finish), it’s real time (no 2-3 second lag), and it picks up on emotion (!).


https://cdn.openai.com/hello-gpt-4o/coins-01.jpg

It's GPT-4o, the o stands for "omni", which I'm guessing means multimodal input and output. Rumors of an end-to-end audio-to-audio real time voice demo, that's what I'd really like to see.

Source: https://x.com/btibor91/status/1790053718416605335


I wonder what the justification is for buying ChatGPT Pro now they make GPT-4 available for free.

I also wonder if they got a Scarlett Johansson impersonator to do the voice. It sounds eerily like the AI in Her, except for the still present problem with voices dropping out or glitching.

Nonetheless the real-time interaction is extremely impressive. So is the desktop app. Their optimization work must have been incredible to both speed things up and free up enough GPU capacity to make it freely available.


It's just a high-quality generic female voice. I understand it's easy to read a lot into it when we've all wanted to get the "Her"-style AI for a while, but that's not it.

Source: I've listened to a lot of AI voice samples including many inspired by Scarlett's voice (we're adding an voice-controlled AI assistant to our software), and I can tell you that this sounded nothing like her :-)


I'm not sure I do want a Her style AI :) I kinda like the professional personality used in text mode chat. The overly emotional tone used in the demo would get annoying or weird to me quite fast. Like, did it really sigh when he asked it to sing?


5x more usage available, free version will have rate-limits like the current one.


The different voices were incredible!

I'm dreaming of the possibilities of an audio-to-audio model for any sort of sound. I want to be able to do things like: "What's the tune I'm humming?" "Why is my car making this noise?" "Can you separate the speech from the instruments in this clip?" "Can you make the sound of a steel drum in a hailstorm?"


I really appreciate that the demos were shown live, mistakes and all. This is in stark contrast to Google's Gemini demos that were heavily edited and cherrypicked.


I am observing an extremely high rate of hallucinations with gpt-4o (gpt-4o-2024-05-13) as tested via the API. I advise extreme caution with it. In contrast, I see no such concern with gpt-4-turbo-preview (gpt-4-0125-preview).


Anyone watching the OpenAI livestream: did they "paste" the code after hitting CTRL+C ? Or did the desktop app just read the clipboard?

Edit: I'm asking because of the obvious data security implications of having your desktop app read from the clipboard _in the live demo_... That would definitely put a damper to my fanboyish enthusiasm about that desktop app.


just read the clipboard


Spike Jonze should be flattered. The conversation mode is a straight copy of the OS in Her.


Sam Altman seems like a world class douche, is anyone else getting this vibe?

I need to know if my spidey senses are on point.


Well my spidey senses tingle when I see that guy.


Something must be done about him, quite frankly. He’s too creepy.


This is a common Silicon valley illness. He just lost all perspective outside his bubble




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: