AUTO-UPDATED

Anthropic Says Its New AI Model Is So Good at Finding Security Risks, You Can't Use It

Anthropic has restricted the public release of its new Claude Mythos AI model, citing its advanced ability to identify and exploit severe cybersecurity vulnerabilities in major software systems.

Key Points

  • Anthropic’s Claude Mythos model has identified thousands of critical security flaws across major operating systems and web browsers.
  • The company launched Project Glasswing, a consortium of over 40 organizations including Microsoft, Google, Apple, and Nvidia, to address these risks.
  • Anthropic is committing $100 million in usage credits and $4 million in donations to support open-source security efforts through the consortium.
  • Participating companies are using the model to proactively patch vulnerabilities and strengthen infrastructure before the technology can be misused.
  • CEO Dario Amodei stated the initiative aims to leverage AI capabilities to create a more secure internet rather than allowing the model to pose a public threat.

Why it Matters

This development highlights a critical turning point where AI capabilities have surpassed human efficiency in discovering software vulnerabilities, necessitating new collaborative security frameworks. By restricting access to the model, Anthropic is attempting to balance the potential for rapid defensive patching against the significant risk of malicious exploitation by bad actors.
CNET Published by Omar Gallaga
Read original