More

yobid20 · 2025-03-04T13:11:51 1741093911

This is not specific to Apple. Its the modern "agile" culture of hacking shit script kiddies pushing early, regardless of known bugs and broken features, under direction of management. Then management forcing you to move onto the next hack without allowing you to go back and clean up your previous work. Its probkem is now endemic to the modern era of software development. Agile is the worst fucking thing ever created for our industry.

yobid20 · 2025-03-02T15:38:55 1740929935

I have so many questions. Is the model running client side? I was expecting to see webrtc used to send audio to a backend service, but instead i think i the audio waveform processing is done client side? Is it sending audio tokens over websockets to a backend service that is hosting the model? 1/16 slices are enough to accurately be able to recreate an audible sentence? Or is a speech to text model also running client side and are both text and tokens being sent to backend service? Is the backend sending audio tokens back or just text , with the text to speech running 100% client side? Is this using mimi codec or facebook's encodec?

yobid20 · 2025-02-28T21:29:49 1740778189

The people of the US do not stand behind trump. Trump's word is not the word of the American people. Zelensky might not have Trump's support, but know that he does have overwhelming support from pretty much everyone else aside from the Trump clan.

yobid20 · 2025-02-28T07:36:44 1740728204

Cash grab because they see the writing on the wall. OpenAI is collapsing. Their models suck now.

yobid20 · 2025-02-20T14:11:12 1740060672

I didnt even know this existed.

yobid20 · 2025-02-05T02:35:58 1738722958

We need an OpenOpenAI to open source OpenAI who should actually be called ClosedAI, since there's nothing open about them other than their banks to take all your money.

yobid20 · 2025-01-28T12:46:17 1738068377

The power requirement for 5x5090s is 10x higher , so you'll spend far more than $6000 in electricity over time.

yobid20 · 2025-01-28T12:43:29 1738068209

I canceled my OpenAI subscription last night, as did many many others. There were some threads in reddit with everyone chiming in they all just canceled too. imo OpenAI is done, and will go through massive cuts and probably acquired by the end of the year for a very tiny fraction of its current value.

Voloskaya · 2025-01-28T13:39:11 1738071551

You want to bet? The panic around deepseek is getting completely disconnected from reality.

Don’t get me wrong what DS did is great, but anyone thinking this reshape the fundamental trend of scaling laws and make compute irrelevant is dead wrong. I’m sure OpenAI doesn’t really enjoy the PR right now, but guess what OpenAI/Google/Meta/Anthropic can do if you give them a recipe for 11x more efficient training ? They can scale it to their 100k GPUs clusters and still blow everything. This will be textbook Jevons paradox.

Compute is still king and OpenAI has worked on their training platform longer than anyone.

Of course as soon as the next best model is released, we can train on its output and catch up at a fraction of the cost, and thus the infinite bunny hopping will continue.

But OpenAI is very much alive.

lolinder · 2025-01-28T14:12:56 1738073576

> The panic around deepseek is getting completely disconnected from reality.

This entire hype cycle has long been completely disconnected from reality. I've watched a lot of hype waves, and I've never seen one that oscillates so wildly.

I think you're right that OpenAI isn't as hurt by DeepSeek as the mass panic would lead one to believe, but it's also true that DeepSeek exposes how blown out of proportion the initial hype waves were and how inflated the valuations are for this tech.

Meta has been demonstrating for a while that models are a commodity, not a product you can build a business on. DeepSeek proves that conclusively. OpenAI isn't finished, but they need to continue down the path they've already started and give up the idea that "getting to AGI" is a business model that doesn't require them to think about product.

regularfry · 2025-01-28T14:28:34 1738074514

In a sense it doesn't, in that if DeepSeek can do this, making OpenAI-type capabilities available for Llama-type infrastructure costs, then if you apply OpenAI scale infrastructure again to a much more efficient training/evaluation system, everything multiplies back up. I think that's where they'll have to head: using their infrastructure moat (such as it is) to apply these efficiency learnings to allow much more capable models at the top end. Yes, they can't sleep-walk into it, but I don't think that was ever the game.

throwup238 · 2025-01-28T15:15:40 1738077340

> The panic around deepseek is getting completely disconnected from reality.

Couldn’t agree more! Nobody here read the manual. The last paragraph of DeepSeek’s R1 paper:

> Software Engineering Tasks: Due to the long evaluation times, which impact the efficiency of the RL process, large-scale RL has not been applied extensively in software engineering tasks. As a result, DeepSeek-R1 has not demonstrated a huge improvement over DeepSeek-V3 on software engineering benchmarks. Future versions will address this by implementing rejection sampling on software engineering data or incorporating asynchronous evaluations during the RL process to improve efficiency.

Just based on my evaluations so far, R1 is not even an improvement on V3 in terms of real world coding problems because it gets stuck in stupid reasoning loops like whether “write C++ code to …” means it can use a C library or has to find a C++ wrapper which doesn’t exist.

csomar · 2025-01-29T05:53:16 1738129996

OpenAI issue might be that it is extremely inefficient with money (high salaries, high compute costs, high expenses, etc..). This is fine when you have an absolute monopoly as investors will throw money your way (open ai is burning cash) but once an alternative is clear, you can no longer do that.

OpenAI doesn't have an advantage in compute more than Google, Microsoft or someone with a few billions of $$.

miroljub · 2025-01-28T16:48:45 1738082925

> You want to bet?

Why would anyone bet? They can just short the OpenAI / MS stocks, and see in a few months if they were right or not.

IAmGraydon · 2025-01-28T17:08:40 1738084120

OpenAI isn't publicly traded and MSFT's stake is so minor compared to their other business that it will have a negligible impact on their stock price.

icedchai · 2025-01-28T17:08:28 1738084108

1) OpenAI isn't public, so not possible. 2) MS is one of the most well diversified tech companies, so, if anything, this will be a positive.

skeaker · 2025-01-28T22:01:23 1738101683

How is that any different from a bet?

dimgl · 2025-01-28T20:25:13 1738095913

Deepseek is not the only reason. I cancelled my OpenAI subscription because I've replaced it wholesale with Anthropic.

joseda-hg · 2025-01-29T03:05:22 1738119922

I replaced that with kagi, unliminted access to multiple models including Claude, O1 and V3/R1 + you also get Kagi, which was already a good deal

hnben · 2025-01-29T13:24:02 1738157042

oh wow. I have been using kagi premium for months, and never noticed, that their AI assistant now has all the good AIs too. I was using kagi exclusively for search, and perplexity for ai stuff. I guess I can cut down on my subscriptions too. Thanks for your hint. (Also I noticed that kagi has a pwa for their ai assistent, which is also cool)

coliveira · 2025-01-28T14:55:28 1738076128

Computing is not king, DeepSeek just demonstrated otherwise. And yes, OpenAI will have to reinvent itself to copy DS, but this means they'll have to throw away a lot of their investment in existing tech. They might recover but it is not a minor hiccup as you suggest.

nlh · 2025-01-28T15:42:22 1738078942

I just don't see how this is true. OpenAI has a massive cash & hardware pile -- they'll adapt and learn from what DeepSeek has done and be in a position to build and train 10x-50x-100x (or however) faster and better. They are getting a wake-up call for sure but I don't think much is going to be thrown away.

diedyesterday · 2025-01-30T20:02:37 1738267357

...until they distill that "11x efficient training" again ...

generalizations · 2025-01-28T13:55:03 1738072503

In my experience with deepseek and o1, openai's big talk about (and investment into) hallucination avoidance might save their hides here. Deepseek may be smarter, and understand complex problems better, but it also seems to make mistakes more often. (It's as if it's comprehension is better, but it's worse at memorization/recall.)

Need an LLM to one-shot some complex network scripting? as of last night, o1 is still where its at.

flir · 2025-01-28T14:26:09 1738074369

My experience gels with yours. Given the same code sample, DeepSeek has better, more creative suggestions about how to improve it, but it can't implement them without breaking the code. o1, generally, can implement DeepSeek's suggestions successfully. I think chaining them together might have quite interesting results.

wordpad25 · 2025-01-28T15:13:30 1738077210

Is there a tool that can automate chaining like that?

throwup238 · 2025-01-28T15:18:50 1738077530

Aider has an architect mode where it asks one model to plan out the changes and another to actually write the code.

manmal · 2025-01-28T22:33:24 1738103604

I've used it today, with R1 as architect and Sonnet as editor model. So far, this works great. There's no need to use a reasoning model as editor IMO.

Alex (https://alexcodes.app) also does this now btw.

HarHarVeryFunny · 2025-01-28T15:30:22 1738078222

That's ok if all you want to know is which model should I use today, but a test like that is totally dependent on training data, and there is no reason to expect that either DeepSeek-V3 (the base model for R1) or the additional training data for R1 is that same as what OpenAI used for O1 and whatever base model it was built on.

The benchmark comparisons are perhaps, for now, the best way to compare reasoning prowess of R1 vs O1, since it seems pretty certain they both trained for those cases.

I think the real significance of R1 isn't the released model/weights itself, but more the paper detailing (sans training data) how to replicate it, and how effective "distillation" (i.e. generate synthetic reasoning data for SFT) can be to enhance reasoning even without using RL.

wqaatwt · 2025-01-28T14:06:50 1738073210

IMHO o1 it’s still comparable to a lot better for accomplishing actual stuff than DeepSeek. At least for my use cases.

Of course cost is incomparably higher since plus has a very low limit. Which of course is a huge deal.

conradfr · 2025-01-28T14:21:11 1738074071

Why every time there is a new model all the other competitors are declared immediately dead?

HarHarVeryFunny · 2025-01-28T15:36:40 1738078600

The big deal here isn't that R1 makes any other models obsolete in terms of performance, but how cheap it is $2 vs $60 per million output tokens compared to O1 (which it matches in benchmark performance).

O1 vs R1 performance on specific non-benchmark problems is also not that relevant until people have replicated R1 and/or tried fine-tuning it with additional data. What would be interesting to see is whether (given the different usage of RL) there is any difference in how well R1 vs O1 generalize to reasoning capability over domains they were not specifically trained for. I'd expect that neither do that well, but not knowing details of what they were trained on makes it hard to test.

osigurdson · 2025-01-28T14:51:53 1738075913

Because we like drama.

nicce · 2025-01-28T20:32:35 1738096355

1. You can get all the models by buying Kagi subscription (excluding o1). Includes DeepSeek models. You can also feed the assistant with search data that you can filter.

2. If you have GitHub Copilot, you get o1 chat also there.

I haven't seen much value with OpenAI subscription for ages.

sureglymop · 2025-01-29T00:46:36 1738111596

I have Kagi Ultimate and it is nice for this. But a cheaper suggestion would be to use OpenRouter and then use these models via Fireworks or TogetherAI. It also integrates into much more applications. AFAIK Kagi doesn't document a user facing API for the assistant feature.

phaedrix · 2025-01-29T03:46:42 1738122402

Unfortunately those are both 10-15x the cost of deepseek direct.

Deepinfra is pretty cheap though as a deepseek provider.

sureglymop · 2025-01-29T16:28:30 1738168110

Sure. I meant moreso that this would be cheaper than Kagi while providing the same selection of models.

As for deepseek, I couldn't even sign up because my email domain is not on their whitelist. To just try it out for now I don't mind the increased cost.

pants2 · 2025-01-29T06:46:35 1738133195

That's because DeepSeek is subsidizing their API massively to get more training data.

kebaman · 2025-01-28T13:39:37 1738071577

Doesn't Microsoft own 49% of OpenAI? They'll end up with it all as a division of Microsoft.

wqaatwt · 2025-01-28T14:08:23 1738073303

I think they “own” 49% of OpenAI’s net income until a certain very high amount. Not a share of the actual company.

nicce · 2025-01-28T20:34:04 1738096444

They "own" even 75% of profits until Microsoft has recouped its $13 billion investment. 49% comes after that.

yieldcrv · 2025-01-28T17:28:12 1738085292

I disagree, I don't really need "conversational chat responses", I need multimodal

ChatGPT is the king of the multimodal experience still. Anthropic is a distant second, only because it lets you upload images from the clipboard and responds to them, but it can't do anything else like generate images - sometimes it will do a flowchat which is kind of cool, GPT won't do that - but will it speak to you, have tones, listen to you? no.

And in the open source side, this area has been stagnant for like 18 months. There is no cohesive multimodal experience yet. Just a couple vision models with chat capabilities and pretty pathetic GUIs to support them. You have to still do everything yourself there.

There is a huge utility for me, and many others that dont know it yet, if we could just load a couple models at once that work together seamlessly in a single seamless GUI like how ChatGPT works.

yobid20 · 2025-01-26T13:44:50 1737899090

OpenAI is bust and will go bankrupt. The red flags have been there the whole time. Now it is just glaringly obvious. The AI bubble has burst!!!

1W6MIC49CYX9GAP · 2025-01-26T18:16:02 1737915362

> Red flags have been there

Pun intended?

dyauspitr · 2025-01-27T15:15:43 1737990943

Burst? If anything AI has been solidified.

qoez · 2025-01-26T13:50:13 1737899413

They just got 500 billion and they'll probably make that back in military contracts so this is unlikely (unfortunately)

whimsicalism · 2025-01-26T17:06:27 1737911187

that would be like 75%+ of the entire military budget

motoboi · 2025-01-26T18:51:50 1737917510

… in a year. Theirs is for 4 years.

kcb · 2025-01-27T14:32:58 1737988378

The AI bubble is not bursting because of more AI.

yobid20 · 2025-01-22T00:29:55 1737505795

Oh but crypto mining was bad lol wheres the power going to come from