Nvidia GeForce RTX 2080 Ti with 22GB Memory Appears on eBay for USD500

mdorazio · 2024-02-16T13:11:04 1708089064

I wish US sellers would start doing this, too. Nvidia's continuing refusal to put higher amounts of VRAM into its "affordable" cards combined with AMD's head scratching resistance to entering the ML space leaves average consumers who want to run bigger models locally with very few options.

OJFord · 2024-02-16T13:48:11 1708091291

> I wish US sellers would start doing this, too.

Isn't that exactly what's happening? From OP:

> The GeForce RTX 2080 Ti 22GB has found its way outside China. eBay seller customgpu_official, an upgrade and repair store in Palo Alto, California, sells a similar MSI GeForce RTX 2080 Ti Aero with 22GB for $499. The store is touting the graphics card as a budget alternative for students and startups that want to get their feet wet in AI workloads. The graphics card is allegedly stable in Stable diffusion, large language models (LLMs), and Llama 2. According to the merchant's website, it has sold over 500 units of the GeForce RTX 2080 Ti 22GB.

The China stuff is just background story I think.

> Nvidia's continuing refusal to put higher amounts of VRAM into its "affordable" cards [...] leaves average consumers who want to run bigger models locally with very few options.

That's market segmentation and product differentiation for you. You want that much VRAM you're doing ML; you want to do ML you can pay us ML prices.

What 'average consumer' wants to 'run bigger models locally' anyway!

If they made the 'affordable' cards with VRAM ranging from say 8GB to 80GB, it would be only gamers buying at the bottom end and only MLers at the top. And they can charge the latter a lot more - so even if they did what you want, you'd end up with Applesque pricing bumps for beefier/more GDDR chips.

KeplerBoy · 2024-02-16T14:06:13 1708092373

That's not how this traditionally worked. A few generations ago board partners were allowed to solder whatever RAM they wanted onto the GPU board.

Boards from different OEMs were different in meaningful ways and OEMs had the chance to differentiate themselves from the competition with some actual engineering. More RAM, multiple GPUs per board, AGP to PCIe chips, you name it.

Nowadays Nvidia restricts is partners from all that fun and undercuts them with their own models. No wonder EVGA quit this game.

causi · 2024-02-16T15:03:49 1708095829

The murder of SLI made me so angry. It was so wonderful to be able to buy a card, then a few years later buy another of the same card for a bargain price and boost your performance by ~60%.

bombcar · 2024-02-16T15:12:23 1708096343

That and increasing RAM down the road for the system itself were some of the few upgrades that made sense.

Usually by the time you could buy a high-end CPU replacement, it was a "better deal" to just replace the entire motherboard/CPU combo and get a whole new computer. But a second video card, some more RAM, additional SSDs, those made sense.

paulmd · 2024-02-16T19:39:28 1708112368

SLI was ultimately killed by the rise of TAA. Interleaving frames breaks the assumptions of TAA models about the presence of temporal history. That's why NVIDIA went down the path of building a better TAA model/upscaler instead.

causi · 2024-02-20T12:19:09 1708431549

TAA is also terrible and turns visuals into a smeary mess, particularly because devs keep building it into games in such a way as to be nearly impossible to turn off like in Halo Infinite.

paulmd · 2024-02-22T06:15:39 1708582539

Nah, you hate bad TAA. DLSS is TAA too and it's actually great, DLAA is real nice, or the cool kids are doing some DLSS+DSR thing (bigger intermediate space?). Halo infinite is the poster child for fucked-up TAA, same for RDR2, you know what works better than anything else? DLAA lol. It provides a minimum floor.

Games are just going to use temporal accumulation now. There is too much signal to be gained from re-using past samples. It makes things way cheaper and opens the door to extrapolation and non-uniform sampling etc. It's the least bad of all the options, and DLSS actually is quite good at weighting samples pretty reasonably. DLSS 2.5, 3.0, 3.5 and upwards are actually significantly better and that can be injected back into (non-anticheat) games and the bar will likely continue to be raised. It is a signal processing technique that recovers a lot of signal with very low "noise factor".

https://en.wikipedia.org/wiki/Noise_figure

vinyl7 · 2024-02-16T15:32:54 1708097574

The only difference between the consumer RTX cards and a Quadro card is memory (and maybe power pin location, which is better on Quadro cards for rackmounted chassis). If nvidia put more memory inside their consumer cards, professionals would just buy the relatively less expensive consumer cards rather than the Quadro cards which are twice the price.

OJFord · 2024-02-16T17:03:18 1708102998

True, but they weren't doing such a range, it was like 512MB or 1GB, 2GB or 4GB, and there wasn't such ML interest in them as there is/would be today - if there had been that market then EVGA et al. would've been lapping up that premium too, I don't think it really affects the argument who's responsible for placing the chips. (And I wasn't really aware of that shift.)

John23832 · 2024-02-16T15:45:03 1708098303

>That's not how this traditionally worked. A few generations ago board partners were allowed to solder whatever RAM they wanted onto the GPU board.

A few generations ago (1080/2080) there was no mainstream ML. Products change, product lines change.

KeplerBoy · 2024-02-16T16:02:46 1708099366

Interestingly these changes precede what you might call mainstream ML. The crazy OEM stuff was already over by the time Maxwell arrived.

HWR_14 · 2024-02-16T14:31:24 1708093884

Yes, and NViDIA didn't care when it was all selling to gamers. Now it wants crypto bros (obv a little historical there) and ML people who are using it to make money to pay a higher price point. So they do it by restricting the options available at the affordable level. See also, what happens when you want to use group administration features in Windows.

piltdownman · 2024-02-16T14:49:33 1708094973

NVIDIA did care when it was all selling to gamers - although at least they had the decency to use ECC memory and some other CAD relevant bits, as well as signed production-ready drivers. They've just transitioned from QUADRO to RTX as ML became the dominant professional tier.

pirate787 · 2024-02-16T14:42:31 1708094551

Most crypto is Proof of Stake -- how much GPU market is there really for Nvidia?

HWR_14 · 2024-02-16T14:45:58 1708094758

Not much anymore (hence my historical comment)

iqml2568 · 2024-02-16T14:55:10 1708095310

> See also, what happens when you want to use group administration features in Windows.

Care to elaborate?

pdpi · 2024-02-16T15:12:32 1708096352

I think they're saying that Microsoft uses features like group admin as differentiating factors between Windows variant so that they can sell professional/enterprise-oriented variants for a higher price than Home. So, same sort of "pro features in the pricier Pro variants" logic.

HWR_14 · 2024-02-16T15:48:12 1708098492

Thank you for explaining what I meant better than I did.

izacus · 2024-02-16T13:17:26 1708089446

I think it leaves them with an option nVidia wants right? Paying them more for the VRAM.

michaelt · 2024-02-16T14:41:39 1708094499

Even that, nvidia doesn't make easy.

The RTX 4090 is so huge and power-hungry you can barely fit two into a case, let alone any more than that.

And even if you replace your case, and your power supply (and maybe your motherboard and CPU too, gotta have enough PCIe lanes) you'll still only have 48GB of vram.

OK, so we can't use an RTX GPU. Good luck even understanding the rest of nvidia's product range. The A40? "The World’s Most Powerful Data Center GPU for Visual Computing". V100? "The most advanced data center GPU ever built". H100? "Unprecedented performance, scalability, and security for every data center". A100? "Unprecedented acceleration at every scale to power the world’s highest-performing elastic data centers". H200? "The world’s most powerful GPU" L4? "The breakthrough universal accelerator".

Good luck figuring out who sells them in your country, or if they're in stock.

Oh, and remember to read the entire spec in excruciating detail. That $2700 L4 24GB is slower than an RTX 4090.

Think you'll launch something in the cloud? With Google Cloud half the cards are only available in some regions - and even if you're in a region that offers a given GPU, maybe there's a shortage and support tells you to just keep requesting GPUs repeatedly until you get lucky.

gymbeaux · 2024-02-16T23:36:49 1708126609

Yeah I went down the rabbit hole of trying to buy a GPU for local model training and execution about a year ago. My takeaway was “anything that starts with A is too expensive and often trades blows with a 3090 or 4090 anyway”. I ended up with a used 3090. 24GB of VRAM, which is more than some of the A series GPUs, and it was around $700 on eBay at the time. Now apparently they’re around a grand.

I had to upgrade to a 1000W PSU for the 3090 because the 3090s will occasionally voltage spike and trigger OCP on power supplies. TDP is supposed to be like 350W but for a slip second it might pull around 600-700W. Plenty of complaints about it on Reddit and elsewhere. It’s a beast but it cooks my home office when I use it. Cest la vie.

MichaelZuo · 2024-02-16T14:45:25 1708094725

Nvidia specifically indicates the VRAM size and whether the product is suitable, so you pick the ones with enough to satisfy your expected needs. What’s there to not understand?

michaelt · 2024-02-16T15:25:06 1708097106

* Is a given product their most advanced, or was it merely their most advanced when the marketing copy was written 5 years ago?

* Is the product actually available for purchase today, and at what price?

* In what form factors - Full height? Half height? Two slots? Three? SXM? Does it have a fan built in?

* What is the relative performance? Half the time the specs all quote different numbers. Why does one product quote 'single precision performance' and 'rt core performance' and 'tensor performance' while another quotes the 'tensor cores' and 'shader cores' and another quotes the 'FP64', 'FP32', 'FP16', 'BFLOAT16', 'TF32' and 'INT8'?

And don't imagine you're going to get away with ignoring those specs. A $2700 L4 24GB is slower than an RTX 4090, for example, because it's for power-efficient servers or something.

MichaelZuo · 2024-02-16T15:44:01 1708098241

Most of this information is indicated on their website or online spec sheets?

Like I said Nvidia themselves indicate whether it’s suitable.

And if your still confused or doubt the accuracy, or don’t want to spend any time reading spec sheets, I imagine the sales channel folks will be able to guarantee it in writing for a fee.

michaelt · 2024-02-16T16:52:25 1708102345

> Most of this information is indicated on their website or online spec sheets?

You'd think so, wouldn't you?

But nvidia is not that smart. They have decided that the GPUs should be split over at least three different pages, and those pages should be camouflaged.

I can only assume the marketing team are judged based on time spent on site or number of pages viewed, rather than on sales made.

When you go to the home page, point to products and click on "NVIDIA RTX / Quadro" you might expect to find the RTX 4090 and some Quadro products. You will in fact find neither - the RTX 4090 is under 'GeForce' and the 'Quadro' brand is no longer used.

Maybe on the home page you choose "NVIDIA RTX-Powered AI Workstations" - that'll give you a list of their workstation-suitable cards for ML, right? No, that page contains no products at all. The only option is to Find A Partner which links you to a bunch of partners - several of whom do not in fact sell AI Workstations.

Other partners sell AI workstations... without GPUs. As far as HP is concerned, $4000 only gets you the base model workstation, with 32GB of RAM and a 1TB hard disk. If you want GPUs with that, you'll need to call the sales team, who might perhaps deign to sell you one.

Or perhaps you're at nvidia's home page, and you're looking for data centre grade GPUs? For that product page, simply choose whether you mean the DGX, EGX, IGX, HGX, MGX or OVX platform?

But they make it very easy to find the keynote speech by His Excellency Omar Sultan Al Olama on the latest breakthroughs in AI. He does, in fairness, have a very appropriate surname.

izacus · 2024-02-17T05:49:58 1708148998

This is such a bizarre rant - you're buying an AI GPU and are ranting be used you need to understand what kind of workloads you need with it? And while also ranting that you won't pay the price of the obvious high end models if you're too lazy about it?

What, exactly, is your expectation here? If you want the simple Apple model where you don't need to look at the spec, do the Apple think and pay up for the pricy GPU.

larodi · 2024-02-16T17:25:22 1708104322

The Al Olama joke really made my day. I support what I say in the model lines and assured website. But perhaps only MS and IBM so far know how to do corporate sites…

gymbeaux · 2024-02-16T23:40:47 1708126847

It’s overwhelmingly apparent to me that you haven’t had to seriously consider this problem of which GPU to buy in 2023/2024.

MichaelZuo · 2024-02-17T00:53:12 1708131192

Did you even finish reading my comment? The whole point of the last sentence is that you don’t need to “seriously consider this problem of which GPU to buy”, if your willing to pay more for written guarantees through the sales channel.

gymbeaux · 2024-02-17T00:57:41 1708131461

I did

MichaelZuo · 2024-02-17T01:05:45 1708131945

Sure, then you get the point.

gymbeaux · 2024-02-16T23:39:10 1708126750

It’s not even remotely that simple. Well, I guess it is if you have infinite money. For the rest of us, VRAM is just the tip of the iceberg. There’s what you can actually order and get shipped to you, how much that costs versus MSRP, whether to get one giant GPU or two, or to leave room in your computer/PSU budget for two down the line… it’s the Wild West. I think most of us end up with 3080s or 3090s mostly due to price/what’s available on the market.

izacus · 2024-02-17T05:51:12 1708149072

You're calling market choice "Wild West"?

gymbeaux · 2024-02-18T18:17:34 1708280254

There's a lot of chaos that stems from the crypto boom, COVID, the AI boom, the fact that NVidia is essentially a monopoly, some groups are making custom GPUs so they have more VRAM....

beeboobaa · 2024-02-16T15:21:42 1708096902

That VRAM isn't everything, apparently.

cyanydeez · 2024-02-16T13:44:03 1708091043

right. it's bizzare watching tech heads confused about price segregation like it's some phantom force and not one of the glorious tools of monopolies and price gouging.

alecco · 2024-02-16T13:57:53 1708091873

Yeah, plus no NVLink for RTX 40xx cards. And 4060 with only half PCIe lanes.

wruza · 2024-02-16T14:36:01 1708094161

Are PCIe lanes really that important for ML? I've seen reports that x4 isn't any different from x16 for a single card.

borissk · 2024-02-16T14:44:59 1708094699

If the card onboard memory is not enough, the card can use the PC RAM over the PCIe connection - and then the PCIe bandwidth is very important. But the main reason for less lanes is to cut costs.

wruza · 2024-02-16T15:05:15 1708095915

Yes, but it starts to crawl on 16x anyway. My firsthand experience with LLMs is that there's little difference between VRAM-to-RAM spills and running on just RAM. Same for LoRA training. When I accidentally disturbed too much VRAM by e.g. parallel upscales, it went from 1.3it/s to 30-40s/it and never recovered. NVidia even added a new setting in their control panel to disable CUDA RAM fallback to work around unintentional slowdowns in lowvram cards, afaiu.

sva_ · 2024-02-16T13:07:47 1708088867

> the modified GeForce RTX 2080 Ti will not face any Nvidia driver issues as the graphics card is seemingly supported and working without vBIOS modifications.

That seems very surprising. Bet Nvidia will push out an emergency update soon, and then you'll have bricked and unbricked cards.

mdorazio · 2024-02-16T13:12:09 1708089129

Easy enough to get around - don't update your BIOS/drivers when they do. Same game that used to be played with pirated software and even iPhone updates for jailbreaking.

cyanydeez · 2024-02-16T13:45:02 1708091102

the speed at which open source models are moving, that's not a long term solution

gymbeaux · 2024-02-16T23:43:17 1708126997

There aren’t really VBIOS updates for GPUs like there are for motherboards. They exist, but for the most part NVidia nerfs cards with driver updates. For a while, to mine Ethereum you just needed to use a specific GPU driver version. It really was that simple.

greggsy · 2024-02-16T13:20:02 1708089602

The market buying these would probably be savvy enough to avoid driver interference

RobotToaster · 2024-02-16T14:08:25 1708092505

>Bet Nvidia will push out an emergency update

The attempt on their profits has left them scarred and deformed

fredoliveira · 2024-02-16T14:07:48 1708092468

> Bet Nvidia will push out an emergency update soon, and then you'll have bricked and unbricked cards.

The crazy potential for that backfiring is, IMHO, reason alone for them to not even try.

calamari4065 · 2024-02-16T14:44:26 1708094666

What possible consequences could there be? Are people going to start buying GPUs from the competition?

There is no competition. There's no choice for the consumer. You buy nvidia or you don't get to play.

Nvidia's reputation is already in the toilet. They have literally nothing to lose. There's no consequences for their actions.

borissk · 2024-02-16T14:46:44 1708094804

People may start calling for monopoly abuse investigation in US and EU.

tivert · 2024-02-16T15:28:52 1708097332

> People may start calling for monopoly abuse investigation in US and EU.

And (sadly) how long will that take, if it happens at all?

People can "start to call" all they want, but it won't matter unless NVIDIA has something to fear.

calamari4065 · 2024-02-16T16:50:09 1708102209

Aren't they already? That's still not much of a consequence. If anything happens, it'll be years from now, which is totally out of scope for quarterly profit management.

borissk · 2024-02-16T18:18:44 1708107524

Don't think nVidia is a company run for the quarterly profit report.

calamari4065 · 2024-02-16T20:14:17 1708114457

If that were the case, they wouldn't be running these anticompetitive strategies and leaving regular consumers at the mercy of GPU scalpers.

borissk · 2024-02-16T21:13:38 1708118018

You think anti competitive strategies are only effective for 3 months?

Or scalpers stop their business after 3 months?

alkonaut · 2024-02-16T14:30:00 1708093800

The dream of game graphics was murdered first by crypto and then by LLMs. Is there any hope for a time when the market for high-spec GPUs will actually be games? Or has that ship sailed forever? At least AI has some sort of real world utility I guess, but I still wish neither revolution had happened.

I_Am_Nous · 2024-02-16T16:20:33 1708100433

I think for the most part graphics have been "good enough" for a while, and since raytracing is the new hotness that's what games are working on improving rather than pure polygon count or resolution. The market for gamers spending insane amounts of money on hardware is kind of disappearing as well, I know so many people who just buy "upgrades" to their existing hardware to play newer games, but they certainly aren't buying 4090s.

It's a similar issue with EVs, the market they were trying to sell to didn't really exist. They wanted people with enough money to buy an expensive car, but also people who are not expecting oodles of luxury features which would normally come on a car that expensive. So now they are having to lower their EV prices in general because inflation kind of murdered any market they may have had. Add on the EV growing pains of battery tech still being susceptible to reduced range in the cold, increased tire wear, and defects...I don't know many people who will take such a risk for such a price.

wruza · 2024-02-16T14:55:00 1708095300

Tbh, graphics is not what games lack today. Personally I was fine with what we had 7-10 years ago graphics-wise, and most modern cards hit FPS limits for these games, apart from few pathological cases. But if you're into graphics, I agree it sucks, and probably forever now. Otoh those who were into graphics brought us here, didn't they, so maybe it's karma ;). Imagine that the great filter is not supernovae nor nuclear, but the fact that species may happen to have good enough eyesight to glimpse into the box of pandora computing.

alkonaut · 2024-02-16T15:19:02 1708096742

I really do like fancy graphics in games. Once you have seen the good ones, I now find it pretty off-putting when something looks like it was made 10 years ago (Recent example: Starfield).

I think some games like the modern iterations of CP2077 are just mind blowing - all gameplay aside. But it's a shame that it costs basically north of $1000 to play it the way its developers hope you will.

wruza · 2024-02-17T13:21:18 1708176078

Ugh. Cyberpunk I don’t like specifically for its neon darkness and these RTX-enabled monochrome colors and ambients. Same for modern young streamers who like “pink from one side, blue from the other side” lighting. Also speech is all f on f on f. Recently I watched SF/CP comparison video that ought to show superiority of the latter. But it left very mixed aftertaste. Yes, CP is much more “technological” and “produced”, but SF is at least not a bunch of teens who can’t change a lightbulb and learned about the word “fuck” yesterday.

semi · 2024-02-16T15:35:33 1708097733

That's always been the problem.. it's hard to justify the expense of making a game few people have the hardware to run.

So instead most games just target whatever modern consoles are which aside from right after a new release is usually relatively old for a PC.. which then makes it hard for PC gamers to justify spending that $1000+ when few games utilize it.

natpalmer1776 · 2024-02-16T18:04:58 1708106698

Personally I spend a ton on gaming hardware so I can (ab)use mods that are horribly unoptimized, buggy, and require about as much effort to install and use as would be required to become competent in another programming language.

Minecraft and Skyrim have been my two constant companions for years now.

imtringued · 2024-02-16T19:13:33 1708110813

Ah yes, Minecraft. Nothing else comes close to waiting 20 minutes for it to load your modpack and the game using 10GB of your RAM.

tacocataco · 2024-02-16T18:30:23 1708108223

I personally prefer larger maps in game over graphics.

alkonaut · 2024-02-16T20:57:10 1708117030

It depends. If it feels empty or walled off with loading screens then it doesn’t feel large just because it is big in extent. The experienced “size” of a world feels more related to density than how long it takes to traverse. Starfield vs CP2077 feels like an example here again.

scorxn · 2024-02-16T16:29:53 1708100993

Hmm, not sure what you mean. The consumer cards from AMD and Nvidia are still gaming-first, and games utilize new features like ray-tracing and DSS. Speaking as a lifelong PC gamer and current Twitch addict, the industry still seems very healthy.

alkonaut · 2024-02-16T20:59:38 1708117178

Games are targeting 3090 or 4090 for decent settings and frame rates. My 2070S is tired as hell. The mid range jumped from $150 to $500 in a couple of years. The enthusiast range jumped from $250 to $1500. That’s painful.

borissk · 2024-02-16T14:48:58 1708094938

The price of gaming GPUs have risen mainly because very few gamers buy AMD and Intel GPUs.

nottorp · 2024-02-16T13:08:54 1708088934

Ok maybe I'm a pessimist but when I saw the title I thought they just rewrote the bios to say 22 Gb, like sd cards...

jsheard · 2024-02-16T13:17:08 1708089428

It's happened before, silly as it is, AMD once decided that a new 8GB card needed a 4GB variant at the last minute before launch so they shipped the first batch of "4GB" cards with 8GB installed and half of it just disabled in the VBIOS. You could easily flash it to unlock the full 8GB - the one time you really could download more RAM.

mysterydip · 2024-02-16T13:27:46 1708090066

I recall sometime around the 2000s they had Athlons that you could overclock by using a pencil to connect a couple contacts on top of the chip.

dtx1 · 2024-02-16T13:31:01 1708090261

Those were the Phenom Tripple Core Chips you are thinking of. What the Pencil mod did was enable the 4th core on the Tripple Core Chips. However, the 4th Core was likely a factory rejected core for some reason or another, so enabling it could lead to instability

jsheard · 2024-02-16T13:39:20 1708090760

> However, the 4th Core was likely a factory rejected core for some reason or another, so enabling it could lead to instability

It may have been a defective core, but when they make these cut-down SKUs they need to meet a quota of units regardless of whether that many salvagable defective chips roll off the line, so it's not uncommon for them to disable perfectly functional hardware. Especially as the process matures and yields improve.

iwontberude · 2024-02-16T14:30:47 1708093847

I remember you could turn a GeForce 6600GS into a GeForce 6800GT using the pencil mod and a firmware flash that was pretty scandalous.

alias_neo · 2024-02-16T15:07:49 1708096069

I remember doing exactly that with one of my Athlon's. Don't remember if it was an Athlon XP or Athlon 64, but they'd removed a jumper on the top and you just needed a 2B pencil to join the pads to get a free upgrade.

Good times.

aurareturn · 2024-02-16T15:12:35 1708096355

You could turn an Athlon Barton 2500+ into a 3200+ just by turning the FSB up to 200MHz. It was the easiest overclock I've ever done.

ryukoposting · 2024-02-16T13:29:51 1708090191

Couldn't you also turn your Athlon x2 into an Athlon x3 occssionally?

jakeinspace · 2024-02-16T13:46:13 1708091173

I did that with a Phenom II X2 555. The 3rd core was stable up to around 3.8ghz if I remember correctly, whereas the 2 enabled by default were fine up to at least 4.2.

pcdoodle · 2024-02-16T14:26:09 1708093569

This was Athlon XP era and i believe it allowed unsupported multipliers.

__alexs · 2024-02-16T14:28:46 1708093726

There were a bunch of Ryzen 1600s that came with 8 cores unlocked instead of 4. Probably a similar reason.

https://www.reddit.com/r/Amd/comments/7pipjq/just_got_a_ryze...

nottorp · 2024-02-16T18:12:56 1708107176

That's not like sd cards though. Everyone in this subthread is optimistic assuming the hardware is there but disabled.

I was thinking the hardware is not there but the bios is modified to falsely report it is.

HWR_14 · 2024-02-16T14:28:49 1708093729

I mean, Tesla did the same thing with battery packs. The range limitations were in software (at least initially, and maybe also now)

coffeebeqn · 2024-02-16T14:16:22 1708092982

You could at least physically inspect the board when you get it and report them for fraud if it doesn’t have 22GB of memory chips

abledon · 2024-02-16T13:04:36 1708088676

dreams of a time where EU regulates GPU manufacturers into providing a pluggable interface for VRAM on the cards... 2035?

jsheard · 2024-02-16T13:08:04 1708088884

The signaling requirements are way too tight for pluggable VRAM to ever be a thing. If anything we're headed in the other direction, with CPUs losing pluggable memory in order to achieve tighter timings like GPUs do, Apple is already doing it and Intel is set to follow.

https://www.tomshardware.com/news/intel-demos-meteor-lake-cp...

sspiff · 2024-02-16T13:14:03 1708089243

Exactly. There's a reason these chips are always surrounding the processor (since the 2000s) and why we haven't seen GDDR based plugable memory modules.

For this same reason (timing precision) you see that soldered DDR5 memory often reaches way higher speeds than what's available in DIMM or SODIMM form.

magicalhippo · 2024-02-16T13:33:34 1708090414

We're already half-way in a heterogeneous future, with chiplets[1] and mixed cores[2][3] etc. Could we expand this to memory, having some soldered (on-chip?) high-speed memory, and then slots for additional slower, yet faster then the alternatives, DIMMs?

Or would the cost of the extra complexity of the memory controller likely not be worth it ever?

[1]: https://www.anandtech.com/show/13560/amd-unveils-chiplet-des...

[2]: https://www.intel.com/content/www/us/en/gaming/resources/how...

[3]: https://en.wikipedia.org/wiki/ARM_big.LITTLE

jsheard · 2024-02-16T13:51:26 1708091486

> Could we expand this to memory, having some soldered (on-chip?) high-speed memory, and then slots for additional slower, yet faster then the alternatives, DIMMs?

Intel's already doing that with Xeon Max, it has both onboard HBM and an outboard DDR5 interface. It can be configured to run entirely from HBM with no DDR5 installed at all, or use the HBM as a huge cache in front of the DDR5, or to map the HBM and DDR5 into different memory regions to let software decide how to use each. I don't think there's been any indication of that approach filtering down to consumer architectures though, Intel is talking about doing RAM-on-package there but without any outboard memory interface alongside it.

wongarsu · 2024-02-16T14:09:52 1708092592

Obviously high-end consumer CPUs already have about 30MB of on-chip memory, with server CPUs reaching a solid 300MB. We just prefer to call it L2 and L3 cache. If we add more memory in a chiplet format I suspect mainstream CPUs would simply expose (or rather hide) it as L3 or L4 cache.

Most software isn't even NUMA aware, and would completely fail to take advantage of a tiered memory hierarchy if it was given the option. But if we make the fast memory a big cache and let the CPU worry about it it's a "cheap" win.

Though there is the Xeon Phi which has about 16GB of on-package memory that can either be configured as cache or as "scratchpad" memory. But of course that's not meant for general-purpose software

dist-epoch · 2024-02-16T14:15:37 1708092937

> Obviously high-end consumer CPUs already have about 30MB of on-chip memory,

AMD 7950X3D, a desktop CPU, has 144 MB of L2+L3 cache memory on-chip.

cyanydeez · 2024-02-16T13:48:09 1708091289

to do this, I assume they're gonna need to stop die shrinks and drastically improve yields.

the reason to separate all the components are to ensure high percentage of functional pieces

SAI_Peregrinus · 2024-02-16T14:17:22 1708093042

Chiplets help with yield a lot.

jsheard · 2024-02-16T13:20:51 1708089651

I'm looking forward to the performance, but not looking forward to higher capacity RAM being segmented off to overpriced "professional" SKUs like high VRAM capacity is on GPUs. Currently you can run up to 192GB RAM on a consumer CPU platform but I doubt RAM-on-package consumer parts will scale that high.

sspiff · 2024-02-16T13:33:37 1708090417

Yeah manufacturers love this evolution because it means everyone who wants or needs high memory will be forced to buy for their projected memory needs throughout the live cycle of the product on day one, and the only place they can get it is at inflated prices from the vendor.

I wonder how they will do this in the workstation and server space, I don't really see how they can do away with socketed CPUs.

I wonder if we will go back to slotted CPUs, with a SOM style board with CPU and memory being plugged into a motherboard/chassis that's really just an I/O back plane. How will multi Cpu communication look then?

I guess we already have memory being pinned to a NUMA node and connecting to others via a vendor specific interconnect, so maybe it's not that strange and different from today.

jsheard · 2024-02-16T13:45:31 1708091131

> I wonder how they will do this in the workstation and server space, I don't really see how they can do away with socketed CPUs.

I'm guessing the endgame will be consumer parts all being RAM-on-package with no external memory interface, and workstation/server parts will take a hybrid approach like Intel is already doing with the Xeon Max chips which have 64GB HBM on the package and an external DDR5 interface supporting terabytes of slower bulk memory.

dist-epoch · 2024-02-16T14:18:40 1708093120

Almost nobody upgrades a socketed CPU.

Sockets still make sense because you can choose between 10 or so different CPUs for a particular socket format.

But with just in time manufacturing you can imagine ordering the CPU directly from the motherboard manufacturer which solders it in place.

ChoGGi · 2024-02-16T14:54:54 1708095294

I haven't upgraded this CPU yet, as it's still too new, but my last motherboard got two new CPUs, and previous was 2-3? (Maybe more it was awhile ago, thanks AMD)

Liftyee · 2024-02-16T14:39:33 1708094373

> Almost nobody upgrades a socketed CPU.

Given that AMD has been releasing AM4 CPUs since 2016, I think it's reasonable to assume that many of those who know how to build computers in the first place have upgraded their CPU. Why switch the whole motherboard/CPU combination when you can just plug in a better CPU?

bonton89 · 2024-02-16T15:49:31 1708098571

Well, if you've been using Intel platforms you do because Intel obsoletes the chipsets at a rapid pace so there often isn't anything appreciably better to upgrade to on the platform.

dist-epoch · 2024-02-16T14:13:08 1708092788

One could imagine a two deck PCB, where you have another PCB board underneath the main one for additional close memory chip locations with a high density vertical interconnect.

DrNosferatu · 2024-02-16T13:31:07 1708090267

Or the EU enforces competition on the AI hardware market by funding an “Airbus for GPUs”.

I would say it’s even more strategic than the original.

dist-epoch · 2024-02-16T14:20:38 1708093238

The EU Airbus for GPUs will appear 10 years from now and have the performance of a GPU from two years ago.

DrNosferatu · 2024-02-16T14:25:16 1708093516

They won't start from zero. Nevertheless, what you point out is one more reason to get started ASAP.

DrNosferatu · 2024-02-16T14:23:44 1708093424

As with the Concorde, the UK could be included. So this would encompass:

ARM Holdings;

Imagination Technologies (UK) - PowerVR GPUs (mobile, automotive, embedded)

NXP Semiconductors (Netherlands) - GPUs for automotive & industrial

STMicroelectronics (France/Italy) - GPUs for automotive, industrial & consumer

BrainChip (Australia, subsidiary in France) - neuromorphic computing chips (similar to GPUs)

Graphcore (UK) - intelligence processing units (IPUs) for machine learning (alternative to GPUs for some applications)

InCore Semiconductor (Netherlands) - custom high-performance computing (HPC) solutions, including GPUs

Kalray (France) - programmable processors for data centers (alternative to GPUs for some applications)

RISC-V International (non-profit, enables European companies to design own GPUs)

Think Silicon (Greece)

...any others?

lvl102 · 2024-02-16T13:14:29 1708089269

[flagged]

wongarsu · 2024-02-16T13:28:59 1708090139

EU companies selling in the US have to follow US regulations, US companies selling in the EU have to follow EU regulations. And in both cases, only for activities in the respective market.

For example Apple would be free to sell lightning port iPhones in the US and USB-C iPhones in the EU. Or don't make USB-C iPhones and not sell any iPhones in the EU, if they don't want to be "leeched". Same for Nvidia and this hypothetical regulation for swappable RAM (which is never going to happen because it isn't technically viable)

Cheer2171 · 2024-02-16T13:29:07 1708090147

If you do business with citizens of another country, in that country, you should expect to have to follow that country's regulations.

Your position is literally American exceptionalism. Do you think that because of NATO that US companies should be able to ignore EU consumer protection laws?

DrNosferatu · 2024-02-16T13:33:58 1708090438

The US funds their companies a lot - via the DoD.

The EU is only making it right.

Havoc · 2024-02-16T13:11:44 1708089104

Love the upcycling, but still reckon a 2nd hand 3090 is the better play.

50% more price, 100% more performance (at least in resnet). Plus ofc 2GB extra and newer architecture

postalrat · 2024-02-16T14:05:14 1708092314

I'm hoping hear more about the 44gb 2080's.

Havoc · 2024-02-16T17:06:57 1708103217

It can apparently be done:

https://videocardz.com/newz/nvidia-geforce-rtx-2080-ti-gets-...

...but sounds rather janky (custom driver)

givinguflac · 2024-02-16T14:40:05 1708094405

So close. I want 24GB so I can run stable diffusion video! That’s the bare minimum anyway is my understanding. Some day…

rldjbpin · 2024-02-19T09:16:05 1708334165

this is not a new kind of mod, and it existed from many years now and has been demonstrated in 30 series GPU also (see https://news.ycombinator.com/item?id=26389996 from 3+ years ago).

i am still unsure why doubling the vram works so seemlessly, but i wonder if you need modifications at driver/vbios level to fully utilize the new capacity.

adultSwim · 2024-02-18T03:05:23 1708225523

Ever since the U.S. banned Nvidia from selling its most prominent AI products .. in China

This is so wrongheaded and shortsighted of us. This isn't the way to build a world of cooperation and prosperity.

aaron695 · 2024-02-16T15:18:53 1708096733

Here's the thread were Aisawk does it - https://tiebac.baidu.com/p/8178695637?pn=1 (It's over a year old)

The news is there is one for sale on eBay US?

They are on AliExpress I think? - https://www.aliexpress.com/w/wholesale-2080TI-22G.html

Video of someone else modding it - https://www.bilibili.com/video/BV1sc411q73B/

Not sure I get the story.

sva_ · 2024-02-17T00:05:08 1708128308

Not sure why this comment got nuked, but I vouched for it. Perhaps automatically falsely detected as spam? Anyways the first and last link show the modification. Very cool.