AWS’ custom chip strategy is showing results, and cutting into Nvidia’s AI dominance

AWS’ custom chip strategy is showing results, and cutting into Nvidia’s AI dominance


AWS announces new CPU chip: Here's what to know

Amazon Web Services is set to announce an update to its Graviton4 chip that includes 600 gigabytes per second of network bandwidth, what the company calls the highest offering in the public cloud.

Ali Saidi, a distinguished engineer at AWS, likened the speed to a machine reading 100 music CDs a second.

Graviton4, a central processing unit, or CPU, is one of many chip products that come from Amazon’s Annapurna Labs in Austin, Texas. The chip is a win for the company’s custom strategy and putting it up against traditional semiconductor players like Intel and AMD.

But the real battle is with Nvidia in the artificial intelligence infrastructure space.

At AWS’s re:Invent 2024 conference last December, the company announced Project Rainier – an AI supercomputer built for startup Anthropic. AWS has put $8 billion into backing Anthropic.

AWS Senior Director for Customer and Project Engineering Gadi Hutt said Amazon is looking to reduce AI training costs and provide an alternative to Nvidia’s expensive graphics processing units, or GPUs.

Anthropic’s Claude Opus 4 AI model launched on Trainium2 GPUs, according to AWS, and Project Rainier is powered by over half a million of the chips – an order that would have traditionally gone to Nvidia.

Hutt said that while Nvidia’s Blackwell is a higher-performing chip than Trainium2, the AWS chip offers better cost performance.

“Trainium3 is coming up this year, and it’s doubling the performance of Trainium2, and it’s going to save energy by an additional 50%,” he said.

The demand for these chips is already outpacing supply, according to Rami Sinno, director of engineering at AWS’ Annapurna Labs.

“Our supply is very, very large, but every single service that we build has a customer attached to it,” he said.

With Graviton4’s upgrade on the horizon and Project Rainier’s Trainium chips, Amazon is demonstrating its broader ambition to control the entire AI infrastructure stack, from networking to training to inference.

And as more major AI models like Claude 4 prove they can train successfully on non-Nvidia hardware, the question isn’t whether AWS can compete with the chip giant — it’s how much market share it can take.

The release schedule for the Graviton4 update will be provided by the end of June, according to an AWS spokesperson.



Source

OpenAI loses multiple executives in latest leadership shakeup
Technology

OpenAI loses multiple executives in latest leadership shakeup

Kevin Weil, chief product officer of OpenAI, speaks during the Hill & Valley forum at the US Capitol in Washington, DC, US, on Wednesday, April 30, 2025. Al Drago | Bloomberg | Getty Images Three OpenAI executives announced their departures from the company on Friday, the latest in a series of leadership shakeups at the […]

Read More
Jim Cramer on the market’s ‘remarkable’ rally — and what to watch in a big earnings week ahead
Technology

Jim Cramer on the market’s ‘remarkable’ rally — and what to watch in a big earnings week ahead

CNBC’s Jim Cramer on Friday laid out his game plan for the week ahead after what he called one of the most “remarkable” rallies he’s ever seen. “If you didn’t believe we could have still one more week where we’d rally 3%, you’d be right,” Cramer said. “We actually rallied 4% thanks to today’s gigantic […]

Read More
Perspective: AI demand is inflated, and only Anthropic is being realistic
Technology

Perspective: AI demand is inflated, and only Anthropic is being realistic

The main demand signal for artificial intelligence looks explosive on paper, but it may be significantly overstated. Anthropic, by pricing its tools for that reality, might be the best-positioned AI company if a correction comes. Tokens are the basic unit of AI usage: words and characters that make up both the queries users send and […]

Read More