AWS’ custom chip strategy is showing results, and cutting into Nvidia’s AI dominance

AWS’ custom chip strategy is showing results, and cutting into Nvidia’s AI dominance


AWS announces new CPU chip: Here's what to know

Amazon Web Services is set to announce an update to its Graviton4 chip that includes 600 gigabytes per second of network bandwidth, what the company calls the highest offering in the public cloud.

Ali Saidi, a distinguished engineer at AWS, likened the speed to a machine reading 100 music CDs a second.

Graviton4, a central processing unit, or CPU, is one of many chip products that come from Amazon’s Annapurna Labs in Austin, Texas. The chip is a win for the company’s custom strategy and putting it up against traditional semiconductor players like Intel and AMD.

But the real battle is with Nvidia in the artificial intelligence infrastructure space.

At AWS’s re:Invent 2024 conference last December, the company announced Project Rainier – an AI supercomputer built for startup Anthropic. AWS has put $8 billion into backing Anthropic.

AWS Senior Director for Customer and Project Engineering Gadi Hutt said Amazon is looking to reduce AI training costs and provide an alternative to Nvidia’s expensive graphics processing units, or GPUs.

Anthropic’s Claude Opus 4 AI model launched on Trainium2 GPUs, according to AWS, and Project Rainier is powered by over half a million of the chips – an order that would have traditionally gone to Nvidia.

Hutt said that while Nvidia’s Blackwell is a higher-performing chip than Trainium2, the AWS chip offers better cost performance.

“Trainium3 is coming up this year, and it’s doubling the performance of Trainium2, and it’s going to save energy by an additional 50%,” he said.

The demand for these chips is already outpacing supply, according to Rami Sinno, director of engineering at AWS’ Annapurna Labs.

“Our supply is very, very large, but every single service that we build has a customer attached to it,” he said.

With Graviton4’s upgrade on the horizon and Project Rainier’s Trainium chips, Amazon is demonstrating its broader ambition to control the entire AI infrastructure stack, from networking to training to inference.

And as more major AI models like Claude 4 prove they can train successfully on non-Nvidia hardware, the question isn’t whether AWS can compete with the chip giant — it’s how much market share it can take.

The release schedule for the Graviton4 update will be provided by the end of June, according to an AWS spokesperson.



Source

3 takeaways from Intel earnings: Cash flow, foundry progress and hardware surprise
Technology

3 takeaways from Intel earnings: Cash flow, foundry progress and hardware surprise

Intel snapped a losing streak of six straight quarterly losses and returned to profitability in the third quarter. In its first earnings report since the Trump administration acquired a 10% stake in the company, the U.S. chipmaker posted strong revenue, noting robust demand for chips that it expects to continue into 2026. Client computing revenue, […]

Read More
What Cramer expects from 10 stocks reporting earnings next week; calls two buys
Technology

What Cramer expects from 10 stocks reporting earnings next week; calls two buys

Earnings season next week goes into overdrive as more than 150 companies in the S & P 500 report their quarterly results. Most of the “Magnificent Seven” tech firms are among them. With Tesla already out and Nvidia not out until Nov. 19, that leaves Alphabet and Club names Amazon , Apple , Meta Platforms […]

Read More
OpenAI’s new Sora 2 video generation app went viral. Is it a real threat to Meta?
Technology

OpenAI’s new Sora 2 video generation app went viral. Is it a real threat to Meta?

Meta is facing new pressure from OpenAI, the juggernaut behind ChatGPT, which is now making waves in short-form video with its viral hit, Sora 2. The new app combines AI-powered video generation with a social feed that mimics TikTok and Instagram Reels. Less than five days after its Sept. 30 launch , Sora 2 racked […]

Read More