Apple says its AI models were trained on Google’s custom chips

Apple says its AI models were trained on Google’s custom chips


Sundar Pichai and Tim Cook

Source: Reuters; Apple

Apple said on Monday that the artificial intelligence models underpinning Apple Intelligence, its AI system, were pretrained on processors designed by Google, a sign that Big Tech companies are looking for alternatives to Nvidia when it comes to the training of cutting-edge AI.

Apple’s choice of Google’s homegrown Tensor Processing Unit (TPU) for training was detailed in a technical paper just published by the company. Separately, Apple released a preview version of Apple Intelligence for some devices on Monday.

Nvidia’s pricey graphics processing units (GPUs) dominate the market for high-end AI training chips, and have been in such high demand over the past couple years that they’ve been difficult to procure in the required quantities. OpenAI, Microsoft, and Anthropic are all using Nvidia’s GPUs for their models, while other tech companies, including Google, Meta, Oracle and Tesla are snapping them up to build out their AI systems and offerings.

Meta CEO Mark Zuckerberg and Alphabet CEO Sundar Pichai both made comments last week suggesting that their companies and others in the industry may be overinvesting in AI infrastructure, but acknowledged the business risk of doing otherwise was too high.

“The downside of being behind is that you’re out of position for like the most important technology for the next 10 to 15 years,” Zuckerberg said on a podcast with Bloomberg’s Emily Chang.

Apple doesn’t name Google or Nvidia in its 47-page paper, but did note its Apple Foundation Model (AFM) and AFM server are trained on “Cloud TPU clusters.” That means Apple rented servers from a cloud provider to perform the calculations.

“This system allows us to train the AFM models efficiently and scalably, including AFM-on-device, AFM-server, and larger models,” Apple said in the paper.

Representatives for Apple and Google didn’t respond to requests for comment.

Skepticism in AI is healthy for the tech sector, says Light Street's Glen Kacher

Apple was later to reveal its AI plans than many of its peers, which loudly embraced generative AI soon after OpenAI’s launch of ChatGPT in late 2022. On Monday, Apple introduced Apple Intelligence. The system includes several new features, such as a refreshed look for Siri, better natural language processing and AI-generated summaries in text fields.

Over the next year, Apple plans to roll out functions based on generative AI, including image generation, emoji generation and a powered-up Siri that can access the user’s personal information and take actions inside of apps.

In Monday’s paper, Apple said that AFM on-device was trained on a single “slice” of 2048 TPU v5p chips working together. That’s the most advanced TPU, first launched in December. AFM-server was trained on 8192 TPU v4 chips that were configured to work together as eight slices through a data center network, according to the paper.

Google’s latest TPUs cost under $2 per hour the chip is in use when booked for three years in advance, according to Google’s website. Google first introduced its TPUs in 2015 for internal workloads, and made them available to the public in 2017. They’re now among the most mature custom chips designed for artificial intelligence.

Still, Google is one of Nvidia’s top customers. It uses Nvidia’s GPUs its own TPUs for training AI systems, and also sells access to Nvidia’s technology on its cloud.

Apple previously said that inferencing, which means taking a pretrained AI model and running it to generate content or make predictions, would partially happen on Apple’s own chips in its data centers.

This is the second technical paper about Apple’s AI system, after a more general version was published in June. Apple said at the time that it was using TPUs as it developed its AI models.

Apple is scheduled to report quarterly results after the close of trading on Thursday.

Don’t miss these insights from CNBC PRO

How the massive power draw of generative AI is overtaxing our grid



Source

Micron to invest  billion in Singapore plant as AI boom strains global memory supply
Technology

Micron to invest $24 billion in Singapore plant as AI boom strains global memory supply

A general view of Micron Technology’s building in Singapore, June 23, 2020.  Micron Gcm Studio | Reuters Micron Technology on Tuesday committed approximately $24 billion to expand its wafer manufacturing operations in Singapore, as the American memory chipmaker moves to expand production amid global shortages.  In a press release, Micron said the investment would add […]

Read More
Big Tech’s AI data center push is spawning a new heat economy
Technology

Big Tech’s AI data center push is spawning a new heat economy

Students at a tech university in Dublin are enjoying an unexpected perk of artificial intelligence — it’s helping heat their campus. Since 2023, the Technical University of Dublin’s Tallaght campus has been one of a growing number of buildings in the southwest suburban area of the city to be heated by waste heat from a […]

Read More
Microsoft’s plans for 15 more data centers win approval at former Wisconsin Foxconn site
Technology

Microsoft’s plans for 15 more data centers win approval at former Wisconsin Foxconn site

The Microsoft data center campus, currently under construction, is reflected in Mount Pleasant, Wisconsin, September 18, 2025. Audrey Richardson | Reuters Local officials have signed off on Microsoft’s plans to build 15 more data centers in Mount Pleasant, Wisconsin, near an existing site that the technology company is expanding. Additional data center capacity will allow […]

Read More