It was true for models up to o3, but there isn't enough public info to say much ...

		sigmoid10 3 months ago \| parent \| context \| favorite \| on: Mistral raises 1.7B€, partners with ASML It was true for models up to o3, but there isn't enough public info to say much about GPT-5. Grok 4 seems to be the first major model that scaled RL compute 10x to near pre-training effort.