Better support than MPS and nothing Apple is shipping today can compete with eve...

MangoToupe · 2025-11-10T18:04:51 1762797891

Presumably the second point is irrelevant if you're choosing among devices with unified memory.

bigyabai · 2025-11-10T18:35:24 1762799724

It is not. Unified memory is not a panacea, it says nothing about the compute performance of the hardware.

The Spark's GPU gets ~4x the FP16 compute performance of an M3 Ultra GPU on less than half the Mac Studio's total TDP.

MangoToupe · 2025-11-10T19:34:44 1762803284

right, but that doesn't describe a "high end consumer CUDA device". Nothing under that description has unified memory.

bigyabai · 2025-11-10T20:42:00 1762807320

Every CUDA-compatible GPU has had support for unified memory since 2014: https://developer.nvidia.com/blog/unified-memory-cuda-beginn...

Can you be a bit more specific what technology you're actually referring to? "Unified memory" is just a marketing term, you could mean unified address space, dual-use memory controllers, SOC integration or Northbridge coprocessors. All are technologies that Nvidia has shipped in consumer products at one point or another, though (Nintendo Switch, Tegra Infotainment, 200X MacBook to name a few).

nl · 2025-11-10T21:54:45 1762811685

They mean the ability to run a large model entirely on the GPU without paging it out of a separate memory system.

bigyabai · 2025-11-10T22:09:11 1762812551

They're basically describing the Jetson and Tegra lineup, then. Those were featured in several high-end consumer devices, like smart-cars and the Nintendo Switch.

nl · 2025-11-10T22:22:03 1762813323

Sure but neither had enough memory to be useful for large LLMs.

And neither were really consumer offerings.

whywhywhywhy · 2025-11-11T09:36:32 1762853792

Depends if you care how fast the result arrives. Imagery gen is a very different tool at <12 seconds an image vs nearer to 1 minute.