AI is being adopted so quickly that security teams are struggling to respond, with harmful applications developing faster than the defenses meant to contain them. HackerOne and Snap collaborated to create scalable benchmarks through AI red teaming, helping expose vulnerabilities and strengthen defenses across the generative AI landscape. This work highlights the importance of human-driven testing in uncovering new threats, while AI tools support faster remediation and help demonstrate the impact of security efforts.