“Dangerous” AI models are coming no matter what

Antropic, a company that specializes in artificial intelligence (AI) technology, has taken its AI models offline after receiving a directive from the United States government to comply with new export-control regulations. The decision was made due to concerns that these models could be used for malicious purposes.

The models in question are Claude Fable 5 and Mythos 5, which were released by Anthropic earlier this year. These advanced AI models have been touted as having the ability to find software vulnerabilities and help defenders patch them, but also have the potential to be exploited by bad actors. The company itself acknowledged this double-edged sword in its initial release of these models.

Anthropic initially released a version of Mythos called “Mythos Preview” to a select group as part of a working group known as Project Glasswing. This allowed researchers and experts to test the capabilities of the model while also implementing safeguards to prevent it from being used for malicious purposes. However, with the release of Claude Fable 5 to the general public, Anthropic introduced specific blocks on its ability to give responses to questions about biology and cybersecurity.

It seems that these measures may not have been enough to satisfy the concerns of the White House, which has placed restrictions on both models. The administration’s stance is that these models pose a national security risk due to their potential for exploitation by malicious actors.

But experts are saying that this institutional clash is just delaying or masking a hard truth: advanced AI capabilities will become increasingly common in the near future. Tarah Wheeler, chief security officer of TPO Group, a specialist cybersecurity consulting firm, notes that other companies may already have similar capabilities and are waiting to see how Anthropic’s regulatory environment plays out before releasing them.

Anthropic has emphasized this point since the launch of Mythos Preview. Logan Graham, its frontier red team lead, highlighted the issue when speaking with WIRED in April: “We need to prepare now for a world where these capabilities are broadly available in 6, 12, 24 months.”

This isn’t just about Anthropic or its models; it’s about the broader trend of AI technology development. OpenAI has also released a private model with advanced cybersecurity capabilities, and researchers note that existing AI offerings can already be used for sophisticated vulnerability-hunting and exploit development.

A group of cybersecurity leaders has written an open letter to the administration arguing that their approach is misguided: “It’s not one model; it’s the general trend of technology.” Smaller, cheaper, and open-source models are becoming increasingly capable, sometimes matching or even surpassing more advanced technology.

This has left experts calling on governments to develop broader, more transparent plans for containing the risks associated with AI capabilities in sensitive areas. “The policy question is not whether a technology has risk,” says Chris Wysopal, cofounder of Veracode. “The question is whether a specific restriction meaningfully reduces that risk or whether it mainly slows down the people trying to make systems safer.”

The White House’s action may be seen as a short-term fix, but experts warn that it will not address the fundamental issue at play: the rapidly advancing capabilities of AI technology. As we move forward, it is crucial for governments and industry leaders to engage in open discussions about how to mitigate the risks associated with these advancements.

One thing is certain – the genie isn’t going back into the bottle. AI models with advanced hacking capabilities will soon be a common occurrence, regardless of regulatory measures. The real challenge lies not in preventing this progress, but in developing responsible and transparent strategies for addressing its implications.

Why it matters:

The development of advanced AI capabilities has reached a critical inflection point. With models like Claude Fable 5 and Mythos 5 already available, the potential risks associated with their misuse are becoming increasingly stark. What’s essential now is that governments and industry leaders work together to create comprehensive plans for addressing this challenge.

Governments cannot simply restrict access to these technologies in an attempt to control their development or use. As research has shown, smaller, open-source models can already rival advanced AI capabilities like Mythos/Fable. It’s time for policymakers to move beyond the idea of restricting access and instead focus on building a system of transparent governance that addresses the broader implications of this technology.

Ultimately, it is the responsibility of both governments and industry leaders to work together in fostering an environment where innovation in AI can flourish while also safeguarding against potential risks. It’s a delicate balance, but one that must be struck if we’re to fully realize the benefits of this transformative technology.

Source: Ars Technica