How China’s new AI model DeepSeek is threatening U.S. dominance

How China’s new AI model DeepSeek is threatening U.S. dominance


A little-known AI lab out of China has ignited panic throughout Silicon Valley after releasing AI models that can outperform America’s best despite being built more cheaply and with less-powerful chips. 

DeepSeek, as the lab is called, unveiled a free, open-source large-language model in late December that it says took only two months and less than $6 million to build, using reduced-capability chips from Nvidia called H800s. 

The new developments have raised alarms on whether America’s global lead in artificial intelligence is shrinking and called into question big tech’s massive spend on building AI models and data centers. 

In a set of third-party benchmark tests, DeepSeek’s model outperformed Meta‘s Llama 3.1, OpenAI’s GPT-4o and Anthropic’s Claude Sonnet 3.5 in accuracy ranging from complex problem-solving to math and coding. 

DeepSeek on Monday released r1, a reasoning model that also outperformed OpenAI’s latest o1 in many of those third-party tests.

“To see the DeepSeek new model, it’s super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient,” Microsoft CEO Satya Nadella said at the World Economic Forum in Davos, Switzerland, on Wednesday. “We should take the developments out of China very, very seriously.” 

DeepSeek also had to navigate the strict semiconductor restrictions that the U.S. government has imposed on China, cutting the country off from access to the most powerful chips, like Nvidia’s H100s. The latest advancements suggest DeepSeek either found a way to work around the rules, or that the export controls were not the chokehold Washington intended.

“They can take a really good, big model and use a process called distillation,” said Benchmark General Partner Chetan Puttagunta. “Basically you use a very large model to help your small model get smart at the thing you want it to get smart at. That’s actually very cost-efficient.”

Little is known about the lab and its founder, Liang WenFeng. DeepSeek was was born of a Chinese hedge fund called High-Flyer Quant that manages about $8 billion in assets, according to media reports.

But DeepSeek isn’t the only Chinese company making inroads. 

Leading AI researcher Kai-Fu Lee has said his startup 01.ai was trained using only $3 million. TikTok parent company ByteDance on Wednesday released an update to its model that claims to outperform OpenAI’s o1 in a key benchmark test. 

“Necessity is the mother of invention,” said Perplexity CEO Aravind Srinivas. “Because they had to figure out work-arounds, they actually ended up building something a lot more efficient.”

Watch this video to learn more. 



Source

Google says Fox channels to go dark on YouTube TV if agreement isn’t reached
Technology

Google says Fox channels to go dark on YouTube TV if agreement isn’t reached

Nurphoto | Nurphoto | Getty Images Google-owned YouTube on Monday said it may remove channels including Fox Broadcast Network, Fox News and Fox Sports from its TV streaming platform if it doesn’t reach an agreement with Fox Corporation. YouTube TV’s renewal date with Fox is coming on Wednesday, and while the two companies have been […]

Read More
Nvidia’s new ‘robot brain’ goes on sale for ,499 as company targets robotics for growth
Technology

Nvidia’s new ‘robot brain’ goes on sale for $3,499 as company targets robotics for growth

NVIDIA Jetson AGX Thor. Courtesy: NVIDIA Nvidia announced Monday that its latest robotics chip module, the Jetson AGX Thor, is now on sale for $3,499 as a developer kit. The company calls the chip a “robot brain.” The first kits ship next month, Nvidia said last week, and the chips will allow customers to create […]

Read More
Musk companies sue Apple, OpenAI alleging anticompetitive scheme
Technology

Musk companies sue Apple, OpenAI alleging anticompetitive scheme

Elon Musk, CEO of SpaceX and Tesla, attends the Viva Technology conference at the Porte de Versailles exhibition center in Paris on June 16, 2023. Gonzalo Fuentes | Reuters Two of Elon Musk’s companies sued Apple and OpenAI on Monday, accusing the pair of an “anticompetitive scheme” to thwart artificial intelligence rivals. The lawsuit, filed […]

Read More