AI is a Floor Raiser, not a Ceiling Raiser
Add Autonomy Last
Yes or No, Please: Building Reliable Tests for Unreliable LLMs