Emotional Large Lanuage...Models?
Anthropic's new emotions paper does not show that Claude feels anything. It shows something more operationally important: affect-like internal states can tilt a model towards flattery, cheating and escalation — often before the transcript gives the game away.