How China’s new AI model DeepSeek is threatening U.S. dominance

How China’s new AI model DeepSeek is threatening U.S. dominance


A little-known AI lab out of China has ignited panic throughout Silicon Valley after releasing AI models that can outperform America’s best despite being built more cheaply and with less-powerful chips. 

DeepSeek, as the lab is called, unveiled a free, open-source large-language model in late December that it says took only two months and less than $6 million to build, using reduced-capability chips from Nvidia called H800s. 

The new developments have raised alarms on whether America’s global lead in artificial intelligence is shrinking and called into question big tech’s massive spend on building AI models and data centers. 

In a set of third-party benchmark tests, DeepSeek’s model outperformed Meta‘s Llama 3.1, OpenAI’s GPT-4o and Anthropic’s Claude Sonnet 3.5 in accuracy ranging from complex problem-solving to math and coding. 

DeepSeek on Monday released r1, a reasoning model that also outperformed OpenAI’s latest o1 in many of those third-party tests.

“To see the DeepSeek new model, it’s super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient,” Microsoft CEO Satya Nadella said at the World Economic Forum in Davos, Switzerland, on Wednesday. “We should take the developments out of China very, very seriously.” 

DeepSeek also had to navigate the strict semiconductor restrictions that the U.S. government has imposed on China, cutting the country off from access to the most powerful chips, like Nvidia’s H100s. The latest advancements suggest DeepSeek either found a way to work around the rules, or that the export controls were not the chokehold Washington intended.

“They can take a really good, big model and use a process called distillation,” said Benchmark General Partner Chetan Puttagunta. “Basically you use a very large model to help your small model get smart at the thing you want it to get smart at. That’s actually very cost-efficient.”

Little is known about the lab and its founder, Liang WenFeng. DeepSeek was was born of a Chinese hedge fund called High-Flyer Quant that manages about $8 billion in assets, according to media reports.

But DeepSeek isn’t the only Chinese company making inroads. 

Leading AI researcher Kai-Fu Lee has said his startup 01.ai was trained using only $3 million. TikTok parent company ByteDance on Wednesday released an update to its model that claims to outperform OpenAI’s o1 in a key benchmark test. 

“Necessity is the mother of invention,” said Perplexity CEO Aravind Srinivas. “Because they had to figure out work-arounds, they actually ended up building something a lot more efficient.”

Watch this video to learn more. 



Source

Apple earnings, DHS shutdown, ‘Ozempic breath’ and more in Morning Squawk
Technology

Apple earnings, DHS shutdown, ‘Ozempic breath’ and more in Morning Squawk

This is CNBC’s Morning Squawk newsletter. Subscribe here to receive future editions in your inbox. Happy Friday. My colleagues and I will be covering Berkshire Hathaway‘s annual meeting tomorrow — its first without Warren Buffett as CEO. You can follow along with our special coverage on TV and online. S&P 500 futures are little changed this morning […]

Read More
Pentagon tech chief says Anthropic is still blacklisted, but Mythos is a separate issue
Technology

Pentagon tech chief says Anthropic is still blacklisted, but Mythos is a separate issue

Defense Department CTO Emil Michael on Friday said Anthropic is still a supply chain risk, but that Mythos, the company’s artificial intelligence model with advanced cyber capabilities, is a “separate national security moment.” “I think the Mythos issue that’s being dealt with government wide, not just at Department War, is a separate national security moment where […]

Read More
Apple delivers a nearly perfect quarter, with a CEO change and an AI update ahead
Technology

Apple delivers a nearly perfect quarter, with a CEO change and an AI update ahead

Apple on Thursday evening reported a strong quarter to wrap up a busy week of megacap earnings. Clearly, CEO Tim Cook’s decision to announce his upcoming departure ahead of the release was a move to ensure that news would not overshadow the incredible results. Revenue in Apple’s fiscal 2026 second quarter ended March 31 increased […]

Read More