I wonder if one reasons new versions of GPT appear to get better - say at coding...

its-kostya · 2025-08-09T14:04:12 1754748252

> ... just because they have new knowledge.

This means there is a future where AI is training on data it self generated, and I worry that might not be sustainable.

jgalt212 · 2025-08-09T15:05:49 1754751949

A software based Habsburg Jaw if you will.

techpineapple · 2025-08-09T14:07:33 1754748453

I’ve heard of this idea of training on synthetic data, I wonder what is that data and does this increase or decrease hallucinations? Is the goal of training on synthetic data to better wear certain paths, or to increase the amount of knowledge / types of data.

Because the second seems vaguely impossible to do.

lazide · 2025-08-09T15:23:28 1754753008

This is already occurring, is not sustainable, and produces an effect known as Model Collapse.