Mechanistic interpretability has made major strides. A tour through Anthropic's "On the Biology of a Large Language Model."
Giving agents a more expressive way to act