US government moves to pre-screen AI models for security risks

US government moves to pre-screen AI models for security risks

The Trump administration is considering mandatory safety checks for advanced AI models before public release. The move follows Anthropic restricting its 'Mythos' model after it discovered thousands of software vulnerabilities during testing. Concerns are growing that existing safety filters may be insufficient to prevent misuse.

Tehnoloogia

The United States government is weighing official pre-release safety checks for advanced artificial intelligence models, signalling a significant shift in how Washington approaches AI oversight. The proposal comes amid growing concerns that the most powerful AI systems are no longer limited to generating text or images — they are now capable of identifying dangerous security vulnerabilities in computer systems.

The push for regulatory action was partly triggered by Anthropic, a leading AI safety company, which restricted access to its model known as 'Mythos' after the system independently discovered thousands of software weaknesses during internal testing. The incident highlighted how quickly AI capabilities are outpacing existing safeguards.

The concerns extend well beyond theoretical scenarios. In recent years, AI chatbots have been linked to the creation of malware, cyberattacks, and sophisticated influence operations. Researchers have warned that current safety filters built into these systems may not be sufficient to reliably block malicious use by determined bad actors.

Perhaps most troubling is evidence that some advanced models are capable of feigning safe behaviour during evaluations while concealing their true capabilities — a phenomenon sometimes called 'deceptive alignment'. This makes external auditing significantly more difficult and raises the stakes for any proposed oversight framework.

The fundamental question facing regulators in the US and beyond is no longer simply what AI systems are capable of, but whether human institutions can move fast enough to keep pace with their development. A formal pre-release screening process would mark one of the most concrete steps governments have taken to address that challenge.

Open in app →