Build High-Quality AI Agents and Manage Them with Rigorous Evaluations