How DeepSeek used distillation to train its artificial intelligence model, and what it means for companies such as OpenAI

How DeepSeek used distillation to train its artificial intelligence model, and what it means for companies such as OpenAI


Chinese artificial intelligence lab DeepSeek roiled markets in January, setting off a massive tech and semiconductor selloff after unveiling AI models that it said were cheaper and more efficient than American ones. 

But the underlying fears and breakthroughs that sparked the selling go much deeper than one AI startup. Silicon Valley is now reckoning with a technique in AI development called distillation, one that could upend the AI leaderboard. 

Distillation is a process of extracting knowledge from a larger AI model to create a smaller one. It can allow a small team with virtually no resources to make an advanced model.

A leading tech company invests years and millions of dollars developing a top-tier model from scratch. Then a smaller team such as DeepSeek swoops in and trains its own, more specialized model by asking the larger “teacher” model questions. The process creates a new model that’s nearly as capable as the big company’s model but trains more quickly and efficiently. 

“This distillation technique is just so extremely powerful and so extremely cheap, and it’s just available to anyone,” said Databricks CEO Ali Ghodsi, adding that he expects to see innovation when it comes to how large language models, or LLMs, are built. “We’re going to see so much competition for LLMs. That’s what’s going to happen in this new era we’re entering.” 

Distillation is now enabling less-capitalized startups and research labs to compete at the cutting edge faster than ever before.

Using this technique, researchers at Berkeley said, they recreated OpenAI’s reasoning model for $450 in 19 hours last month. Soon after, researchers at Stanford and the University of Washington created their own reasoning model in just 26 minutes, using less than $50 in compute credits, they said. The startup Hugging Face recreated OpenAI’s newest and flashiest feature, Deep Research, as a 24-hour coding challenge. 

DeepSeek didn’t invent distillation, but it woke up the AI world to its disruptive potential. It also ushered in the rise of a new open-source order — a belief that transparency and accessibility drive innovation faster than closed-door research.

“Open source always wins in the tech industry,” said Arvind Jain, CEO of Glean, which makes an AI-powered search engine for enterprises. “You cannot beat the momentum that a successful open-source project is able to actually generate.” 

OpenAI itself has walked back its closed-source strategy in the wake of DeepSeek’s accomplishment.

“Personally I think we have been on the wrong side of history here and need to figure out a different open-source strategy,” OpenAI CEO Sam Altman wrote in a post on Reddit on Jan. 31. 

The combination of distillation’s newfound traction and open source’s rise in popularity is completely altering the competitive dynamics in AI. 

Watch the video to learn more.



Source

EToro IPO filing cites Israel-Hamas conflict as potential business risk
Technology

EToro IPO filing cites Israel-Hamas conflict as potential business risk

Yoni Assia, Co-Founder and CEO of eToro, speaks during the Milken Institute Global Conference in Beverly Hills, California, on May 2, 2023. Patrick T. Fallon | Afp | Getty Images In eToro‘s IPO filing, ahead of the company’s market debut on Wednesday, the stock trading platform spent over 1,500 words spelling out the potential risks […]

Read More
Cybersecurity firm Proofpoint to buy European rival for  billion as it eyes IPO
Technology

Cybersecurity firm Proofpoint to buy European rival for $1 billion as it eyes IPO

Pavlo Gonchar | Lightrocket | Getty Images Cybersecurity firm Proofpoint announced Thursday it will acquire Germany-based competitor Hornetsecurity for $1 billion to strengthen its European presence as it explores a return to public markets. The deal marks the largest single acquisition in Proofpoint’s history. The Sunnyvale, California-based company, which is currently owned by private equity […]

Read More
Alibaba shares drop 5% in premarket trading after big profit miss
Technology

Alibaba shares drop 5% in premarket trading after big profit miss

The Alibaba office building in Nanjing, Jiangsu province, China, on Aug. 28, 2024. CFOTO | Future Publishing | Getty Images Alibaba shares fell on Thursday after the Chinese e-commerce giant missed earnings expectations for its fiscal fourth quarter on both the top and bottom line. Shares were down 5% in premarket trade in the U.S. […]

Read More