I see us not getting rid of CPU, but CPU and GPU being eventually consolidated i...

nine_k · 2026-03-04T11:36:53 1772624213

CPU and GPU have very different ways of scheduling instructions, requiring somehow different interfaces and programming models.. I'd hazard to say that a GPU and CPU with unified memory access (like the Apple's M series, and most mobile chips) is already such a consolidated system.

amelius · 2026-03-04T13:01:53 1772629313

nVidia Jetson also has unified memory access btw.

jagged-chisel · 2026-03-04T10:39:59 1772620799

Agreed. Much like “RISC is gonna replace everything” - it didn’t. Because the CPU makers incorporated lessons from RISC into their designs.

I can see the same happening to the CPU. It will just take on the appropriate functionality to keep all the compute in the same chip.

It’s gonna take awhile because Nvidia et al like their moats.

StilesCrisis · 2026-03-04T13:27:58 1772630878

CISC only survived because CPUs now dedicate a ton of silicon to decoding the CISC stream into RISC-y microcode. RISC CPUs can avoid this completely, but it turns out backwards compatibility was important to the market and the transistor cost of "instruction decode" just adds like +1 pipeline depth or something.

ndiddy · 2026-03-04T19:57:07 1772654227

> CISC only survived because CPUs now dedicate a ton of silicon to decoding the CISC stream into RISC-y microcode.

For Intel CPUs, this was somewhat true starting from the Pentium Pro (1995). The Pentium M (2004) introduced a technique called "micro-op fusion" that would bind multiple micro-ops together so you'd get combined micro-ops for things like "add a value from memory to a register". From that point onward, the Intel micro-ops got less and less RISCy until by Sandy Bridge (2011) they pretty much stopped resembling a RISC instruction set altogether. Other x86 implementations like K7/K8/K10 and Zen never had micro-ops that resembled RISC instructions.

zephen · 2026-03-04T16:04:11 1772640251

> CPUs now dedicate a ton of silicon to decoding the CISC stream into RISC-y microcode.

In absolute terms, this is true. But in relative terms, you're talking less than 1% of the die area on a modern, heavily cached, heavily speculative, heavily predictive CPU.

FartyMcFarter · 2026-03-04T20:12:17 1772655137

Didn't there use to be a joke about Intel being the biggest RAM manufacturer (given the amount of physical space caches take on a CPU)?

zephen · 2026-03-04T21:43:34 1772660614

I hadn't heard that, but certainly, there must have been many times when Intel held the crown of "biggest working hunk of silicon area devoted to RAM."

zozbot234 · 2026-03-04T10:48:34 1772621314

> It will just take on the appropriate functionality to keep all the compute in the same chip.

So, an iGPU/APU? Those exist already. Regardless, the most GPU-like CPU architecture in common use today is probably SPARC, with its 8-way SMT. Add per-thread vector SIMD compute to something like that, and you end up with something that has broadly similar performance constraints to an iGPU.

junon · 2026-03-04T14:51:21 1772635881

We're getting there already with e.g. Grace-Blackwell chips.