More

smcleod · 2025-02-28T22:55:55 1740783355

LiteLLM requires you to call them to discuss pricing to enable SSO... sure part of it is open source but not the part you need to scale it.

smcleod · 2025-02-28T13:40:10 1740750010

16GB? In 2025 that seems very small for anything other than gaming.

PedroBatista · 2025-02-28T13:48:43 1740750523

These are gaming cards. If you think AMD is lacking here with just 16GB I can only assume you never bought NVIDIA gaming cards in the last 10 years.

imtringued · 2025-02-28T14:31:32 1740753092

You can buy used nvidia tesla p40 with 24GB from 2016 and unlike AMD, they still have CUDA support. The only thing they are missing is support for the latest data types like fp8.

PedroBatista · 2025-02-28T14:41:52 1740753712

I don't know if I'm in a parallel Universe or hallucinating, but these are graphics cards, designed and created for people to install in they deskop PCs and run games. Just like they have been doing it for decades now.

What's with the P40, CUDA and fp8? Seriously people, chill. AI has been using graphics cards because that was the best available route at the time, not the other way around.

Otherwise I must question, why don't you talk about DisplayPort2.1, DirectX12, FidelityFX.. and other dozen features graphics related.

atemerev · 2025-02-28T17:08:36 1740762516

Because, unless you are a Big Corp, you can't afford to buy H100s. Being able to run AI on consumer hardware is essential for many AI hackers out there. And no, renting hardware on "AI clouds" is not reliable — still outrageously expensive, and the rug can be pulled out of you at any moment.

My rig is 4x4090, and it did cost a fortune to me already (still less expensive than a single H100). I would have happily used cheaper AMD cards, but they are not available for reasons like this.

Last time I checked, this site was called "Hacker News", not "Bigtech Corporate Employee News".

dragonwriter · 2025-02-28T17:15:41 1740762941

> I don't know if I'm in a parallel Universe or hallucinating, but these are graphics cards, designed and created for people to install in they deskop PCs and run games.

Graphics cards have been for more then running games even before they were broad-purpose computer engines that they have been since NVidia radically reshaped the market—decades ago.

Heck, Nvidia has had multipe driver series for the same cards because of this, even before AI was a major driver.

Aurornis · 2025-02-28T17:53:37 1740765217

> You can buy used nvidia tesla p40 with 24GB from 2016

Then do that instead if it fits your workload?

There's more to a video card than the amount of RAM, though.

itishappy · 2025-02-28T15:39:45 1740757185

Which games support fp8?

gregors · 2025-02-28T13:46:44 1740750404

I'm actually happy about that. It specifically targets gamers who want to play video games.

nickthegreek · 2025-02-28T13:45:33 1740750333

NVIDIA offers one new consumer card with over 16gb.

dragontamer · 2025-02-28T14:32:46 1740753166

And that one card is $2500 in practice and sold out (only launched with like 500 inventory across USA). 5090 is pure unobtanium right now unless you feel like paying scalper prices in $3k++

masklinn · 2025-02-28T15:39:29 1740757169

> And that one card is $2500 in practice and sold out (only launched with like 500 inventory across USA).

And the power connector melts if you look at it funny.

Aurornis · 2025-02-28T14:02:29 1740751349

These cards are $550-$600

16GB is great for what these are: Gaming cards.

Even on the nVidia side you have to spend $2000 (likely more) to get more RAM than that.

You could buy 3 of these for the price of a single nVidia 5090.

Plasmoid2000ad · 2025-02-28T15:58:43 1740758323

I suppose it's dissapointing that the 6800xt launched over 4 years ago at $649 with 16gb as welll. Inflation can partly explain this, buts still - we had been used to progress across the board each generation previously, at least with AMD.

imtringued · 2025-02-28T14:33:30 1740753210

On the Nvidia side every GPU released in the last ten years competes against what AMD is doing today. People who need more VRAM go with dual 3090.

itishappy · 2025-02-28T15:41:44 1740757304

For the low low price of $2.5k you can outcompete a $600 card. Wow.

newsclues · 2025-02-28T13:45:28 1740750328

They don't want to make cards good for AI, at the price gamers will pay, thus annoying the gamers and cutting into their AI chip sales.

nVidia created the AI chip market on the backs off gamers, but that doesn't mean that trend will continue forever. The demand is there, so they can extract greater profits and margins on AI chips.

dhruvdh · 2025-02-28T13:53:35 1740750815

To be fair, you can buy ~3 of these for the price Nvidia charges for 24GB/32GB models.

dragontamer · 2025-02-28T17:34:29 1740764069

If people want more VRAM the 24GB 7900xtx is right there and has been there for years.

WithinReason · 2025-02-28T16:25:41 1740759941

Yes but you can't easily put 3 GPUs in 1 PC

bee_rider · 2025-02-28T14:03:18 1740751398

High ram cards are just nice for their longer lifespans though.

Since Nvidia dominates the AI market anyway, AMD has the opportunity here to not try and help them protect it, and sell cards with more RAM. IMO that’d be a good move. They’d be keeping gamers happy by not hampering their product artificially. And, some poor grad students might use their cards to build the “next thing.”

Hikikomori · 2025-02-28T13:40:31 1740750031

It's a gaming card.

smcleod · 2025-02-27T21:33:06 1740691986

GPT 4.5 is insanely over price, it makes Anthropic look affordable!

smcleod · 2025-02-26T21:24:52 1740605092

Especially not one owned by Bezos

smcleod · 2025-02-11T05:03:29 1739250209

IMO the only good feature in it is the notification summaries, they're pretty accurate and useful.

Siri on the other hand is embarrassingly bad.

latexr · 2025-02-11T13:20:04 1739280004

> the notification summaries, they're pretty accurate and useful.

Far from a popular opinion.

https://arstechnica.com/apple/2024/11/apple-intelligence-not...

https://www.bbc.com/news/articles/cge93de21n0o

Tagbert · 2025-02-11T05:06:25 1739250385

Siri doesn’t use Apple Intelligence. It is 10+ year-old technology that does not do learning. We won’t seen a truly smarter Siri until June when Apple previews the LLM Siri.

Reason077 · 2025-02-11T07:44:12 1739259852

> "We won’t seen a truly smarter Siri until June when Apple previews the LLM Siri."

Really, they should have started with Siri, the moment LLMs became a thing. Not try to graft battery-draining AI assistants into existing apps in often-not-very-useful ways!

Tagbert · 2025-02-18T20:36:11 1739910971

Perhaps replacing Siri with an LLM within the resources of a phone is complicated and takes time to get right. I’m willing to wait to see where they go with this a while longer.

singularity2001 · 2025-02-11T06:52:57 1739256777

   Siri on the other hand is embarrassingly bad.

and any attempt to replace this crapware with something better is sabotaged by the App Store guardians

xyst · 2025-02-11T06:03:12 1739253792

> Siri on the other hand is embarrassingly bad

Always has been (c.2012)

hedora · 2025-02-11T06:40:23 1739256023

But hey, I bet the metrics are up this year!

You can no longer disable it without breaking CarPlay, and it’s started randomly activating when you press the power button on my phone when carplay is on.

My theory is that typing a GPS destination into my car dashboard at a stoplight is too dangerous, and typing it on a phone keyboard is also too dangerous, but unlocking the phone, frantically yelling “cancel” until you find the “STFU Siri” button on the steering wheel, dismissing random useless carplay displays and THEN typing the address into the phone is completely safe.

On a related note, they keep adding more distractions to the apple maps carplay screen. The other day, I was at an intersection, and the map rendered the sidewalks similarly to the road. It knew I was driving. Does it think using sidewalks as shortcuts is legal or something?

smcleod · 2025-02-08T03:45:03 1738986303

It should be noted here they're talking about cold water (above freezing) - not ice baths which is what a lot of people seem to think of and is quite a fad in the "sports medicine" industry.

snailmailstare · 2025-02-08T16:40:05 1739032805

It's a meta study that potentially includes these as well as other cold water exposure:

> undergoing acute or long-term CWI exposure via cold shower, ice bath, or plunge with water temperature ≤15°C for at least 30 seconds.

liamwire · 2025-02-09T08:02:46 1739088166

Yes, but also above 7°C

snailmailstare · 2025-02-09T10:51:14 1739098274

Yes, they could work on their grammatical ambiguity when using 'or'. From the studies list, it appears they found no ice baths to study perhaps due to safety concerns.

smcleod · 2025-02-07T23:03:37 1738969417

Actual is cool but its lack of solid integration with the open banking standard (as is used in Australia and other countries) made it a bit annoying as its partnered integrations are pretty rough.

dugite-code · 2025-02-08T00:06:57 1738973217

The open banking standard isn't actually all that open.

As a customer you cannot gain access, at least easily. You need to apply for accreditation as a third party financial services provider.

smcleod · 2025-02-01T13:59:52 1738418392

I get around 4-5t/s with the unsloth 1.58bit quant on my home server that has 2x3090 and 192GB of DDR5 Ryzen 9, usable but slow.

segmondy · 2025-02-01T20:01:52 1738440112

how much context size?

smcleod · 2025-02-01T20:03:33 1738440213

Just 4K. Because deepseek doesn't allow for the use of flash attention it means you can't run quantised qkv

smcleod · 2025-01-31T14:41:58 1738334518

That qkv PR was mine! Small world.

smcleod · 2025-01-31T03:00:15 1738292415

Stats is great! The only issue I've had with it is each time it updates itself as it's unsigned the binary is flagged and can't be opened until you run xattr -rc /Applications/Stats.app on it.

LeoPanthera · 2025-01-31T03:55:18 1738295718

Huh? That doesn't happen to me. Mine is signed.