Towards a Science of AI Agency: Modelling, Measuring, and Intervening on Goal-Directed Behaviour
Explore model reasoning traces on grid tasks