Apple Intelligence Servers Expected to Start Using M4 Chips Next Year After M2 Ultra This Year - MacRumors
Skip to Content

Apple Intelligence Servers Expected to Start Using M4 Chips Next Year After M2 Ultra This Year

Apple plans to start using the M4 chip in its Apple Intelligence servers next year, according to a Nikkei Asia report this week, citing TrendForce analyst Frank Kung. Apple Intelligence servers are currently powered by the M2 Ultra chip, per previous reports.

Apple Intelligence General Feature 2
The report claims that Apple has approached its largest manufacturing partner Foxconn about building additional Apple Intelligence servers in Taiwan.

It is unclear if the new servers will be equipped with the standard M4 chip, or a higher-end variant like the M4 Pro, M4 Max, or yet-to-be-announced M4 Ultra. It is also unclear if the existing servers with the M2 Ultra will be immediately upgraded to M4 chips.

Apple's plan to use M4 chips in servers was previously revealed by Haitong analyst Jeff Pu.

While some Apple Intelligence features rely entirely on on-device processing, Apple says requests that "require more processing power" rely on Private Cloud Compute models that are stored on the Apple Intelligence servers. When using Private Cloud Compute, Apple says that a user's data is never stored or shared with the company.

iOS 18.1 was released last month with the first Apple Intelligence features on the iPhone, such as writing tools and notification summaries. iOS 18.2 will be released to the public in December with additional Apple Intelligence features, including Genmoji for custom emoji, Image Playground for image generation, ChatGPT integration for Siri, and more.

Popular Stories

Apple Intelligence General Feature 2

Apple Intelligence Accidentally Goes Live in China Before Regulatory Approval

Monday March 30, 2026 12:48 pm PDT by
Apple accidentally started rolling out Apple Intelligence features in China before receiving regulatory approval, reports Bloomberg's Mark Gurman. Some Chinese users began seeing Apple Intelligence features listed as available and active in the Settings app on their iPhones, but Apple made no formal announcement. Gurman says that Apple is not planning to launch the feature imminently, and...
iOS 27 on iPhone 17 1

Apple to Unveil iOS 27 in June With These New Features Beyond Siri

Friday April 17, 2026 8:40 am PDT by
Apple is expected to unveil iOS 27 during its WWDC 2026 keynote on June 8, and the update should be released in September. Below, we outline some of the new features that are rumored to be coming with iOS 27, beyond the long-awaited more personalized version of Siri. iOS 27 will reportedly support 5G satellite internet connectivity, although this functionality might be limited to the...
General YouTube Feature Redux

Apple Sued by Three YouTube Channels

Monday April 6, 2026 8:53 am PDT by
Three established YouTube channels have sued Apple, alleging that the company violated the U.S. Digital Millennium Copyright Act (DMCA) by unlawfully accessing and scraping millions of copyrighted videos from YouTube to train its AI models. In a class action lawsuit filed in California federal court last week, the owners of the YouTube channels h3h3Productions (plus H3 Podcast and H3 Podcast ...

Top Rated Comments

Gnattu Avatar
19 months ago
Actually this is a good sign for people want even faster Mac Studio and Mac Pro if Apple is building chips for their own AI server, which could lead to more resources being put into making those high performance chips.
Score: 17 Votes (Like | Disagree)
bradman83 Avatar
19 months ago

But seriously, the base M4 NPU outperforms the M2 Ultra. The upgrade is well deserved.
The base M4 does not outperform the M2 Ultra in NPU function, though if you're not familiar with how TOPS ratings are determined it's easy to misunderstand. The short of it is you need to know what operation the TOPS rating is measuring when comparing.

M1 - M3 Neural Engines were measured using FP16 operations, whereas the M4 chips (and A17 and A18) are measured using INT8 operations. FP16 operations handle about twice as much data per operation than INT8. They're not entirely interchangeable but 20 FP16 operations would equalize out to about 40 INT8 operations.

The M2 Ultra is rated at 31.6 TOPS in FP16, which would be equate to roughly 62-64 TOPS in INT8. The M4 is rated at 38 TOPS in INT8.

Similar confusion occurred with the M3. Apple measured the M3 Neural Engine with FP16 but the corresponding A17 Neural Engine was measured with INT8 for whatever reason, thus making it seem that the A17 had a faster NPU than the M3 when they were essentially the same. The M4 looks like a huge leap over the M3 on paper because of the TOPS figure, but it's actually only about 5-10% faster. The M2 was actually the biggest boost to NPU performance in the four generations of M chip, about 40% faster than the M1.

For the record this is not Apple being sneaky, they made the change because AMD, Intel, and other companies coming out with NPU hardware are measuring in INT8 and it's become something of the de-facto standard benchmark for NPUs. Apple, with good reason, didn't want their NPUs specs to look worse because of a reason like that.
Score: 13 Votes (Like | Disagree)
19 months ago
Given the scale of Nvidia GPU sales I'd like to hope the Apple gets back into the server game since there is a lot of money to be made on high memory inference hardware. Assume the M4 Ultra is able to come with 384 GB of memory each it will be quite competitive for the 400B+ parameter models.
Score: 13 Votes (Like | Disagree)
JPack Avatar
19 months ago
Did Apple upgrade to M4 because the base memory was increased? 😄

But seriously, the base M4 NPU outperforms the M2 Ultra. The upgrade is well deserved.
Score: 8 Votes (Like | Disagree)
JippaLippa Avatar
19 months ago
I oughta say, the M4 feels like the very first actual successor to the M1.
Score: 7 Votes (Like | Disagree)
maruyama Avatar
19 months ago
I just want to know when this guy's coming back.
Score: 6 Votes (Like | Disagree)