DeepSeek’s hardware spend could be as high as $500 million, new report estimates

DeepSeek’s hardware spend could be as high as 0 million, new report estimates


Faisal Bashir | Lightrocket | Getty Images

China’s DeepSeek became the biggest topic in tech this week, with many in the industry and on Wall Street focused on a single number: $6 million.

In DeepSeek’s paper about its newest artificial intelligence model, the company said that its total training costs amounted to $5.576 million, based on the rental price of Nvidia’s graphics processing units. DeepSeek included a clear caveat, saying that the number included only the model’s “official training” and excluded the costs tied to “prior research and ablation experiments on architectures, algorithms, or data.”

Early in the week, DeepSeek’s AI Assistant took the coveted spot for most-downloaded free app in the U.S. on Apple’s App Store, dethroning OpenAI’s ChatGPT. Global tech stocks sold off, with chipmakers Nvidia and Broadcom losing a combined $800 billion in market cap on Monday.

A new report from SemiAnalysis, a semiconductor research and consulting firm, added more context to DeepSeek’s expenses. The firm estimated that DeepSeek’s hardware spend is “well higher than $500M over the company history,” adding that R&D costs and total cost of ownership are significant. Generating “synthetic data” for the model to train on would require “considerable amount of compute,” SemiAnalysis wrote.

The report said the Claude 3.5 Sonnet from Anthropic cost “$10s of millions to train,” but noted that Anthropic raised billions for dollars from Amazon and Google, an indication of how much more money is required to run the models and the company.

“It’s because they have to experiment, come up with new architectures, gather and clean data, pay employees, and much more,” SemiAnalysis said.

DeepSeek’s own paper does not include an estimation of its compute costs. The company didn’t immediately respond to a request for comment.

“To be clear DeepSeek is unique in that they achieved this level of cost and capabilities first,” SemiAnalysts wrote. The firm added that DeepSeek’s R1 “is a very good model” and that “catching up to the reasoning edge this quickly is objectively impressive.”

Experts and analysts this week touted the quality of DeepSeek’s model, and noted how impressive it is considering the U.S. curbed chip exports to China three times in three years. That led to concerns that the U.S. is falling behind its chief adversary in a market that’s predicted to top $1 trillion in revenue within a decade.

Big tech rushes to adopt DeepSeek R1

Bernstein analysts wrote in a note Monday that “according to the many (occasionally hysterical) hot takes we saw [over the weekend,] the implications range anywhere from ‘That’s really interesting’ to ‘This is the death-knell of the AI infrastructure complex as we know it.'”

DeepSeek was founded in 2023 by Liang Wenfeng, co-founder of High-Flyer, a quantitative hedge fund focused on AI. The AI startup reportedly grew out of the hedge fund’s AI research unit in April 2023 to focus on large language models and reaching artificial general intelligence, or AGI — a branch of AI that equals or surpasses human intellect on a wide range of tasks, and that OpenAI and others are pursuing.

DeepSeek is still wholly owned by and funded by High-Flyer, according to analysts at Jefferies.

The buzz around DeepSeek began picking up steam earlier this month, when the startup released R1, its reasoning model that rivals OpenAI’s o1. It’s open-source, meaning that any AI developer can use it.

Like other Chinese chatbots, DeepSeek’s has limitations on certain topics: When asked about some of Chinese leader Xi Jinping’s policies, for instance, DeepSeek reportedly steers the user away from similar lines of questioning.

OpenAI CEO Sam Altman has praised the model publicly, but the company has also said it believes there’s evidence that DeepSeek improperly harvested OpenAI data to build its product.

At an event in Washington, D.C., on Thursday hosted by OpenAI, Altman said DeepSeek is “clearly a great model.”

“This is a reminder of the level of competition and the need for democratic Al to win,” he said. He said it also points to the “level of interest in reasoning, the level of interest in open source.”

WATCH: Nvidia CEO Jensen Huang and President Trump meet on AI policy

Nvidia CEO Jensen Huang and President Trump to meet on AI policy, China restrictions, and DeepSeek



Source

How to help kids form healthy relationships with money: They ‘end up in much better financial shape’ as adults, psychologist says
World

How to help kids form healthy relationships with money: They ‘end up in much better financial shape’ as adults, psychologist says

As parents contend with rising costs and an overall tighter economic climate, more of them are using those challenges as an opportunity to have frank talks with their kids about money, according to a recent survey. Honest conversations — including telling your kids “no” when they ask you to buy something, and explaining why — […]

Read More
Berkshire electric utility’s court win could save it billions
World

Berkshire electric utility’s court win could save it billions

(This is the Warren Buffett Watch newsletter, news and analysis on all things Warren Buffett and Berkshire Hathaway. You can sign up here to receive it every Friday evening in your inbox.) PacifiCorp court win could reduce wildfire damages by $1B or more  An Oregon Court of Appeals ruling this week accepted PacifiCorp’s argument the judge in a 2023 trial mistakenly […]

Read More
Cathay Pacific to cut flights from mid-May to end-June as jet fuel prices surge
World

Cathay Pacific to cut flights from mid-May to end-June as jet fuel prices surge

A Cathay Pacific Airbus A350 aircraft at Kingsford Smith Airport on August 18, 2021 in Sydney, Australia. Cathay Pacific Airways Ltd., is the flag carrier of Hong Kong with its main hub being at Hong Kong International Airport. James D. Morgan | Getty Images News | Getty Images Cathay Pacific Airways said on Saturday it […]

Read More