It seems like you don’t understand reinforcement learning. The signal is reinfor...

		catigula 12 days ago \| parent \| context \| favorite \| on: AI should only run as fast as we can catch up It seems like you don’t understand reinforcement learning. The signal is reinforced because it correlates to behavior, hacking the signal itself is misalignment.