Anthropic has restricted the public release of its new Claude Mythos AI model, citing its advanced ability to identify and exploit severe cybersecurity vulnerabilities in major software systems.
Key Points
- Anthropic’s Claude Mythos model has identified thousands of critical security flaws across major operating systems and web browsers.
- The company launched Project Glasswing, a consortium of over 40 organizations including Microsoft, Google, Apple, and Nvidia, to address these risks.
- Anthropic is committing $100 million in usage credits and $4 million in donations to support open-source security efforts through the consortium.
- Participating companies are using the model to proactively patch vulnerabilities and strengthen infrastructure before the technology can be misused.
- CEO Dario Amodei stated the initiative aims to leverage AI capabilities to create a more secure internet rather than allowing the model to pose a public threat.