How China’s new AI model DeepSeek is threatening U.S. dominance

How China’s new AI model DeepSeek is threatening U.S. dominance


A little-known AI lab out of China has ignited panic throughout Silicon Valley after releasing AI models that can outperform America’s best despite being built more cheaply and with less-powerful chips. 

DeepSeek, as the lab is called, unveiled a free, open-source large-language model in late December that it says took only two months and less than $6 million to build, using reduced-capability chips from Nvidia called H800s. 

The new developments have raised alarms on whether America’s global lead in artificial intelligence is shrinking and called into question big tech’s massive spend on building AI models and data centers. 

In a set of third-party benchmark tests, DeepSeek’s model outperformed Meta‘s Llama 3.1, OpenAI’s GPT-4o and Anthropic’s Claude Sonnet 3.5 in accuracy ranging from complex problem-solving to math and coding. 

DeepSeek on Monday released r1, a reasoning model that also outperformed OpenAI’s latest o1 in many of those third-party tests.

“To see the DeepSeek new model, it’s super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient,” Microsoft CEO Satya Nadella said at the World Economic Forum in Davos, Switzerland, on Wednesday. “We should take the developments out of China very, very seriously.” 

DeepSeek also had to navigate the strict semiconductor restrictions that the U.S. government has imposed on China, cutting the country off from access to the most powerful chips, like Nvidia’s H100s. The latest advancements suggest DeepSeek either found a way to work around the rules, or that the export controls were not the chokehold Washington intended.

“They can take a really good, big model and use a process called distillation,” said Benchmark General Partner Chetan Puttagunta. “Basically you use a very large model to help your small model get smart at the thing you want it to get smart at. That’s actually very cost-efficient.”

Little is known about the lab and its founder, Liang WenFeng. DeepSeek was was born of a Chinese hedge fund called High-Flyer Quant that manages about $8 billion in assets, according to media reports.

But DeepSeek isn’t the only Chinese company making inroads. 

Leading AI researcher Kai-Fu Lee has said his startup 01.ai was trained using only $3 million. TikTok parent company ByteDance on Wednesday released an update to its model that claims to outperform OpenAI’s o1 in a key benchmark test. 

“Necessity is the mother of invention,” said Perplexity CEO Aravind Srinivas. “Because they had to figure out work-arounds, they actually ended up building something a lot more efficient.”

Watch this video to learn more. 



Source

Meta to begin testing ads on Threads, its micro-blogging app
Technology

Meta to begin testing ads on Threads, its micro-blogging app

Thilina Kaluthotage | Nurphoto | Getty Images Meta will begin testing ads on its Threads microblogging service with a few companies in the U.S. and Japan, the company said in a blog post Friday. The experiment marks Meta’s first run at generating revenue from Threads. Meta launched the app in July 2023 to rival X, […]

Read More
Twilio shares pop 22% and head for biggest gain since Covid pandemic on growth forecast
Technology

Twilio shares pop 22% and head for biggest gain since Covid pandemic on growth forecast

Twilio CEO Khozema Shipchandler speaks at Twilio’s Signal event in Sao Paulo on Aug. 14, 2024. Courtesy: Twilio Twilio shares soared more than 20% on Friday and headed for their biggest gain since the early days of the Covid pandemic after the cloud communications software vendor issued an uplifting profit forecast for the coming years. The […]

Read More
Zuckerberg sets Meta’s AI targets for the year, expects to spend  billion on growth
Technology

Zuckerberg sets Meta’s AI targets for the year, expects to spend $60 billion on growth

Meta CEO Mark Zuckerberg looks on before the luncheon on the inauguration day of U.S. President Donald Trump’s second Presidential term in Washington, U.S., Jan. 20, 2025.  Evelyn Hockstein | Reuters Meta CEO Mark Zuckerberg on Friday announced the company plans to invest around $60 to $65 billion in capital expenditure in 2025 as it continues […]

Read More