1 link tagged with all of: automation + security + red-teaming + defenses
Click any tag below to further narrow down your results
Links
This article discusses the ongoing efforts to secure ChatGPT Atlas from prompt injection attacks, which can manipulate the AI's behavior by embedding malicious instructions. OpenAI is implementing automated red teaming and rapid response cycles to discover and mitigate these threats effectively.