Apple Teams Up With NVIDIA to Speed Up AI Language Models

Apple has shared details on a collaboration with NVIDIA to greatly improve the performance of large language models (LLMs) by implementing a new text generation technique that offers substantial speed improvements for AI applications.

ml research apple
Apple earlier this year published and open-sourced Recurrent Drafter (ReDrafter), an approach that combines beam search and dynamic tree attention methods to accelerate text generation. Beam search explores multiple potential text sequences at once for better results, while tree attention organizes and removes redundant overlaps among these sequences to improve efficiency.

Apple has now integrated the technology into NVIDIA's TensorRT-LLM framework, which optimizes LLMs running on NVIDIA GPUs, where it achieved "state of the art performance," according to Apple. The integration saw the technique manage a 2.7x speed increase in tokens generated per second during testing with a production model containing tens of billions of parameters.

Apple says the improved performance not only reduces user-perceived latency but also leads to decreased GPU usage and power consumption. From Apple's Machine Learning Research blog:

"LLMs are increasingly being used to power production applications, and improving inference efficiency can both impact computational costs and reduce latency for users. With ReDrafter's novel approach to speculative decoding integrated into the NVIDIA TensorRT-LLM framework, developers can now benefit from faster token generation on NVIDIA GPUs for their production LLM applications."

Developers interested in implementing ReDrafter can find detailed information on both Apple's website and NVIDIA's developer blog.

Tag: Nvidia

Popular Stories

Apple Watch Ultra 2 Complications

Apple Watch Ultra 3 Just Weeks Away: Eight Reasons to Upgrade

Wednesday August 20, 2025 6:44 am PDT by
We're only weeks away from Apple's annual iPhone event – rumored to take place on September 9 – and along with the new iPhone 17 series, we're going to get a new version of the Apple Watch Ultra for the first time since 2023. By the time the Ultra 3 is unveiled, it will have been two years since the previous model arrived. The intervening period has left plenty of room for enhancements,...
apple wallet drivers license feature iPhone 15 pro

iPhone Driver's Licenses in Apple Wallet Now Available in 10 U.S. States

Wednesday August 20, 2025 12:00 pm PDT by
In select U.S. states, residents can add their driver's license or state ID to the Wallet app on the iPhone and Apple Watch, providing a convenient and contactless way to display proof of identity or age at select airports and businesses, and in select apps. This week, Apple announced the 10th U.S. state that has implemented the feature: Montana. Below, we have recapped key details about...
iPhone 17 Pro Dark Blue and Orange

When Is iPhone 17 Coming Out?

Wednesday August 20, 2025 5:00 am PDT by
Apple's iPhone 17 series is expected to debut in September 2025. This release follows Apple's recent trend of introducing new iPhone models annually in the fall. To unveil the iPhone 17, iPhone 17 Air, iPhone 17 Pro, and iPhone 17 Pro Max, Apple is expected to hold its annual iPhone announcement event during the week of September 8, 2025, with September 9 or 10 emerging as the most likely...
AirPods Pro 3 Heart Rate Tracking Feature

AirPods Pro 3 Expected to Launch This Year With Key New Feature

Sunday August 24, 2025 7:16 am PDT by
Bloomberg's Mark Gurman expects Apple to release new AirPods Pro this year, and he said the earbuds will have a key new feature: heart rate monitoring. From his Power On newsletter today, with emphasis added:As for Apple's other devices, there's a lot in the fall pipeline — though many of the new products are only incremental upgrades. There will be Apple Watch updates, faster Vision...
maxresdefault

The MacRumors Show: Apple Watch Series 11 and Ultra 3 or Wait for Next Year?

Friday August 22, 2025 9:15 am PDT by
On this week's episode of The MacRumors Show, we talk through what to expect from the Apple Watch SE 3, Series 11, and Ultra 3, and whether it's worth holding off on an upgrade until next year. Subscribe to The MacRumors Show YouTube channel for more videos The third-generation Apple Watch SE is rumored to feature a larger display (perhaps like the Apple Watch Series 7), the S11 chip, and...
apple design award 2025

Apple Announces 2025 Design Award Winners Ahead of WWDC 2025

Tuesday June 3, 2025 10:14 am PDT by
As we wait for WWDC to kick off next Monday, Apple today announced the winners of its annual Apple Design Awards, recognizing apps and games for their innovation, ingenuity, and technical achievement. The 2025 Apple Design Award winners are listed below, with one app and one game selected per category: Delight and Fun - CapWords (App) and Balatro (Game) Innovation - Play (App) and PBJ -...

Top Rated Comments

attohs Avatar
9 months ago
NVidia? Did hell freeze over again?
Score: 37 Votes (Like | Disagree)
vegetassj4 Avatar
9 months ago
NVIDIA and Apple??!!? Working together again?



Attachment Image
Score: 13 Votes (Like | Disagree)
Delgibbons Avatar
9 months ago
Can't wait to put a 5090 in my Ma....

oh.
Score: 12 Votes (Like | Disagree)
redbeard331 Avatar
9 months ago
Good we have to hurry this up.



Attachment Image
Score: 9 Votes (Like | Disagree)
lilkwarrior Avatar
9 months ago
What would be an even better collaboration would be Apple enabling Nvidia GPU options again—at least for the Mac Pro.

It would be AWESOME to be able to use Nvidia’s ray-tracing and tensor cores with my creative professional and AI problems with Titan-class/Prosumer/workstation GPUs (x90 and up) again without having to switch to my PC.

A Nvidia MPX GPU module as capable as a 5090 with no wires and Thunderbolt 5 support would be a nirvana-like outcome—especially if Microsoft, Apple, and/or Valve enables a way to dual boot to Windows on ARM and SteamOS.

While I love building a liquid-cooled PC, I and various prosumers would finally have a choice to stop buying PCs altogether
Score: 7 Votes (Like | Disagree)
Unregistered 4U Avatar
9 months ago

Since Apple now produces its own GPUs there is no need for hell to freeze over. Do you even remember the reason Apple and Nvidia parted ways? It was over Nvidia wanting complete access to macOS’s core. Apple said no way.
And, we’ve since had a REALLY good example (CrowdStrike) of why this would have been a baaaad idea.
Score: 4 Votes (Like | Disagree)