Skip to Content

Apple Intelligence Servers Expected to Start Using M4 Chips Next Year After M2 Ultra This Year

Apple plans to start using the M4 chip in its Apple Intelligence servers next year, according to a Nikkei Asia report this week, citing TrendForce analyst Frank Kung. Apple Intelligence servers are currently powered by the M2 Ultra chip, per previous reports.

Apple Intelligence General Feature 2
The report claims that Apple has approached its largest manufacturing partner Foxconn about building additional Apple Intelligence servers in Taiwan.

It is unclear if the new servers will be equipped with the standard M4 chip, or a higher-end variant like the M4 Pro, M4 Max, or yet-to-be-announced M4 Ultra. It is also unclear if the existing servers with the M2 Ultra will be immediately upgraded to M4 chips.

Apple's plan to use M4 chips in servers was previously revealed by Haitong analyst Jeff Pu.

While some Apple Intelligence features rely entirely on on-device processing, Apple says requests that "require more processing power" rely on Private Cloud Compute models that are stored on the Apple Intelligence servers. When using Private Cloud Compute, Apple says that a user's data is never stored or shared with the company.

iOS 18.1 was released last month with the first Apple Intelligence features on the iPhone, such as writing tools and notification summaries. iOS 18.2 will be released to the public in December with additional Apple Intelligence features, including Genmoji for custom emoji, Image Playground for image generation, ChatGPT integration for Siri, and more.

Popular Stories

Apple Event Logo

Apple Released Seven New Products Today

Wednesday March 11, 2026 7:05 am PDT by
Starting today, the seven new Apple products that were announced last week are available at Apple Stores and beginning to arrive to customers. The colorful MacBook Neo and all of the other new products are on display at most Apple Store locations around the world starting today. Apple Stores have inventory of the new products for both walk-in customers and Apple Store pickup, but...
iOS 27 Mock Quick

10+ New Features Coming in iOS 27

Friday March 13, 2026 2:13 pm PDT by
We're only three months away from Apple's WWDC 2026 event, which will see the company unveil iOS 27. With the fully revamped version of Siri possibly delayed until September, iOS 27 is shaping up to be the update we wanted iOS 26 to be. There will be new Apple Intelligence features, updates for the iPhone Fold, and more, with the latest rumors summarized below. Foldable iPhone Features...
Apple 50 Years of Thinking Different

Apple Announces 50th Anniversary Plans

Thursday March 12, 2026 6:10 am PDT by
Apple today announced that it will celebrate the company's 50th anniversary over the coming weeks, but it has yet to reveal any specific plans. Apple was founded on April 1, 1976, so the company will turn 50 on April 1, 2026. "While Apple is known for looking forward, this milestone offers a special moment to reflect on the journey that has brought the company here, to celebrate the...

Top Rated Comments

Gnattu Avatar
18 months ago
Actually this is a good sign for people want even faster Mac Studio and Mac Pro if Apple is building chips for their own AI server, which could lead to more resources being put into making those high performance chips.
Score: 17 Votes (Like | Disagree)
bradman83 Avatar
18 months ago

But seriously, the base M4 NPU outperforms the M2 Ultra. The upgrade is well deserved.
The base M4 does not outperform the M2 Ultra in NPU function, though if you're not familiar with how TOPS ratings are determined it's easy to misunderstand. The short of it is you need to know what operation the TOPS rating is measuring when comparing.

M1 - M3 Neural Engines were measured using FP16 operations, whereas the M4 chips (and A17 and A18) are measured using INT8 operations. FP16 operations handle about twice as much data per operation than INT8. They're not entirely interchangeable but 20 FP16 operations would equalize out to about 40 INT8 operations.

The M2 Ultra is rated at 31.6 TOPS in FP16, which would be equate to roughly 62-64 TOPS in INT8. The M4 is rated at 38 TOPS in INT8.

Similar confusion occurred with the M3. Apple measured the M3 Neural Engine with FP16 but the corresponding A17 Neural Engine was measured with INT8 for whatever reason, thus making it seem that the A17 had a faster NPU than the M3 when they were essentially the same. The M4 looks like a huge leap over the M3 on paper because of the TOPS figure, but it's actually only about 5-10% faster. The M2 was actually the biggest boost to NPU performance in the four generations of M chip, about 40% faster than the M1.

For the record this is not Apple being sneaky, they made the change because AMD, Intel, and other companies coming out with NPU hardware are measuring in INT8 and it's become something of the de-facto standard benchmark for NPUs. Apple, with good reason, didn't want their NPUs specs to look worse because of a reason like that.
Score: 13 Votes (Like | Disagree)
18 months ago
Given the scale of Nvidia GPU sales I'd like to hope the Apple gets back into the server game since there is a lot of money to be made on high memory inference hardware. Assume the M4 Ultra is able to come with 384 GB of memory each it will be quite competitive for the 400B+ parameter models.
Score: 13 Votes (Like | Disagree)
JPack Avatar
18 months ago
Did Apple upgrade to M4 because the base memory was increased? 😄

But seriously, the base M4 NPU outperforms the M2 Ultra. The upgrade is well deserved.
Score: 8 Votes (Like | Disagree)
JippaLippa Avatar
18 months ago
I oughta say, the M4 feels like the very first actual successor to the M1.
Score: 7 Votes (Like | Disagree)
maruyama Avatar
18 months ago
I just want to know when this guy's coming back.
Score: 6 Votes (Like | Disagree)