Bridging Gaps in AI Safety: New Framework for Comprehensive Evaluation
I’m excited to share our new paper on AI safety evaluation! One of the biggest challenges I’ve encountered in my work is the confusion caused when different communities use the same terms to mean very different things. For example, “AI testing” means one thing to policymakers, another to AI system/software testers, and yet another to…













