Anthropic adds Claude 4 security measures to limit risk of users developing weapons

Anthropic adds Claude 4 security measures to limit risk of users developing weapons


Omar Marques | Lightrocket | Getty Images

Anthropic on Thursday said it activated a tighter artificial intelligence control for Claude Opus 4, its latest AI model.

The new AI Safety Level 3 (ASL-3) controls are to “limit the risk of Claude being misused specifically for the development or acquisition of chemical, biological, radiological, and nuclear (CBRN) weapons,” the company wrote in a blog post.

The company, which is backed by Amazon, said it was taking the measures as a precaution and that the team had not yet determined if Opus 4 has crossed the benchmark that would require that protection.

Anthropic announced Claude Opus 4 and Claude Sonnet 4 on Thursday, touting the advanced ability of the models to “analyze thousands of data sources, execute long-running tasks, write human-quality content, and perform complex actions,” per a release.

The company said Sonnet 4 did not need the tighter controls.

Jared Kaplan, Anthropic’s chief science officer, noted that the advanced nature of the new Claude models has its challenges.

“The more complex the task is, the more risk there is that the model is going to kind of go off the rails … and we’re really focused on addressing that so that people can really delegate a lot of work at once to our models,” he said.

The company released an updated safety policy in March addressing the risks involved with AI models and the ability to help users develop chemical and biological weapons.

Major safety questions remain about a technology that is advancing at a breakneck pace and has shown worrying cracks in safety and accuracy.

Last week, Elon Musk’s Grok chatbot from xAI continued to bring up the topic of “white genocide” in South Africa in responses to unrelated comments.

The company later attributed the bizarre behavior to an “unauthorized modification.”

Olivia Gambelin, AI ethicist and author of the book “Responsible AI,” said the Grok example shows how easily these models can be tampered with “at will.”

AI researchers and experts told CNBC that the push from the power players to prioritize profits over research has led to companies taking shortcuts and forgoing rigorous testing.

James White, chief technology officer at cybersecurity startup CalypsoAI, said companies sacrificing security for advancement means models are less likely to reject malicious prompts.

“The models are getting better, but they’re also more likely to be good at bad stuff,” said White, whose company performs safety and security audits of Meta, Google, OpenAI and other companies. “It’s easier to trick them to do bad stuff.”

CNBC’s Hayden Field and Jonathan Vanian contributed to this report.



Source

Europe’s companies sound alarm over tariff hits amid scramble to strike U.S. trade deal
World

Europe’s companies sound alarm over tariff hits amid scramble to strike U.S. trade deal

European officials are working to secure a trade deal with the Trump administration, as regional companies sound the alarm over financial hits even before reciprocal tariffs have come into play. Earlier this month , U.S. President Donald Trump said he would slap a 30% tariff on all goods imported to America from the European Union, […]

Read More
Puma shares plunge 15% after full-year sales, profit outlook cut on U.S. tariffs
World

Puma shares plunge 15% after full-year sales, profit outlook cut on U.S. tariffs

Sign at the entrance to the Puma store in Midtown Manhattan. Erik Mcgregor | Lightrocket | Getty Images Puma shares plummeted as much as 18% Friday after the German sportwear brand posted worse-than-expected second-quarter sales and cut its full-year guidance, flagging the impact of U.S. trade tariffs. In a preliminary updated after markets closed on […]

Read More
Textiles to whisky: U.K.–India ‘historic’ deal is set to boost bilateral trade by over  billion a year
World

Textiles to whisky: U.K.–India ‘historic’ deal is set to boost bilateral trade by over $34 billion a year

AYLESBURY, ENGLAND – JULY 24: UK Prime Minister Keir Starmer and Prime Minister Narendra Modi of India walk in the grounds at Chequers on July 24, 2025 in Aylesbury, England. Kin Cheung | Getty Images News | Getty Images U.K. and India’s bilateral trade is set to get a more than $34 billion annual boost […]

Read More