A Case for Self-Managing DRAM Chips

buescher · on Sept 22, 2022

I always wondered why DRAM didn't just, after some Moore's law point, have refresh control built in. There's probably a reason I've forgotten about. Nice to see someone at least thinking about it.

wmf · on Sept 22, 2022

You'd have to introduce a handshake between the DRAM and the memory controller because some internal operation might already be in progress when the memory controller sends a command. You'd probably also want to sync all the DRAM chips on a DIMM so the memory controller doesn't have to deal with commands being partially blocked.

buescher · on Sept 22, 2022

But you have that handshake anyway now, between the CPU and the memory controller, effectively at least, don't you?

wmf · on Sept 22, 2022

No, you don't. AFAIK DRAM chips just sit idle until the memory controller gives them a command and then they perform the command immediately.

buescher · on Sept 22, 2022

But the memory controller still has to refresh them. So at some point you have to negotiate a wait state, don't you?

londons_explore · on Sept 23, 2022

Yes - everything 'upstrram' of the memory controller has variable latency, partly because of the chance that a read or write request has to wait for a refreshing operation.

mistrial9 · on Sept 23, 2022

they design for a very, very small edge on the signal state transition.. so yes technically but its meant to be negligible, I believe.. (not a hardware designer but I have looked over the shoulder a few times)

mumumu · on Sept 23, 2022

They can sleep, but waking up has a large latency.

mumumu · on Sept 23, 2022

I has been a while since I wrote a DRAM controller. But it's all about managing latencies. To achieve maximum throughput the controller needs to issue the operation in a specific order, different for each workload.

DRAM have self refresh mode.

pyrolistical · on Sept 22, 2022

PLEASE UPDATE YOUR DRAM FIRMWARE TO REMAIN SECURE.

tenebrisalietum · on Sept 22, 2022

IBM has some serial RAM they’ve developed that has controllers requiring firmware. It’s coming, but one can argue modern DRAM is storage-like given all the caching going on.

corysama · on Sept 23, 2022

The "On The Metal" podcast had an interview with someone who had to deal with debugging DRAM drivers. But, I don't remember which one it was...

dmitrygr · on Sept 22, 2022

> new low-cost DRAM architecture

Provides no evidence that putting more logic into the DRAM will somehow be cheaper

wtallis · on Sept 22, 2022

The DRAM chips don't actually have to be cheaper. They just need to enable the system as a whole to be more cost effective. If putting a bit more logic in the DRAM makes it significantly cheaper and quicker to design and validate a processor's memory controller, and enables better performance and power efficiency from the memory, then it's very likely a net win even if the marginal cost of DRAM chips went up slightly.

Lramseyer · on Sept 22, 2022

While you're correct that overall system cost is what matters, most functionality is offloaded to the controller because a single controller often controls multiple DRAM chips (Like the chips all share a CA bus[1]) So it's cheaper to put the functionality in the controller.

With that being said, it gets a bit nuanced, because as we make faster and faster DRAMs, we need more complicated input/output electronics (Phy and ECC.) GDDR actually has self refresh as part of the spec and even allows DRAM vendors to put a PLL inside so you can run a half rate clock, which technically makes it "Quadruple Data Rate" instead of "Double Data Rate." But usually with GDDR, you're running one controller per DRAM. Clamshell mode is not super common in my experience.

[1] I'm not super well versed on DDR4 and all of its configurations. So someone who is can elaborate if they would like.

londons_explore · on Sept 23, 2022

Putting a controller in the dram allows that controller to detect bad memory cells and repair data (ecc-like) or re-map data to other addresses.

Both of those things would be possible while implementing the regular 'dumb' dram interface, but any error detection or remapping would have to operate within the latency budget of the module, since the latency is fixed. Typically that means not much remapping can be done, and ram yields have to be really good.

With this change, ram becomes robust to far more silicon defects like nand flash is. That's what makes it cheaper.

FounderBurr · on Sept 22, 2022

Pseudo-sram is a thing. A very expensive thing compared to unmanaged dram.

ignaloidas · on Sept 22, 2022

For what it's worth, it's still way cheaper than true SRAM. And the architecture suggested is still way closer to unmanaged DRAM than to PSRAM