Shipping AI at enterprise scale: multi-step agents, reliability, and organizational reality.
1 post
The interesting failure modes in agent systems come from tool surfaces that look fine to a human and confusing to a model.