They mean the ability to run a large model entirely on the GPU without paging it... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		nl 35 days ago \| parent \| context \| favorite \| on: Asus Ascent GX10 They mean the ability to run a large model entirely on the GPU without paging it out of a separate memory system.

bigyabai 35 days ago [–]

They're basically describing the Jetson and Tegra lineup, then. Those were featured in several high-end consumer devices, like smart-cars and the Nintendo Switch.

nl 35 days ago | [–]

Sure but neither had enough memory to be useful for large LLMs.

And neither were really consumer offerings.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact