Apple's New Transcription APIs Blow Past Whisper in Speed Tests

Apple's new speech-to-text transcription APIs in iOS 26 and macOS Tahoe are delivering dramatically faster speeds compared to rival tools, including OpenAI's Whisper, based on beta testing conducted by MacStories' John Voorhees.

apple record transcribe phone calls

Call recording and transcription in iOS 18.1

Apple uses its own native speech frameworks to power live transcription features in apps like Notes and Voice Memos, as well as phone call transcription in iOS 18.1. To improve efficiency in iOS 26 and macOS Tahoe, Apple has introduced a new SpeechAnalyzer class and SpeechTranscriber module that deal with similar requests.

According to Voorhees, the new models processed a 34-minute, 7GB video file in just 45 seconds using a command line tool called Yap (developed by Voorhees' son, Finn). That's a full 55% faster than MacWhisper's Large V3 Turbo model, which took 1 minute and 41 seconds for the same file.

Other Whisper-based tools performed even slower, with VidCap taking 1:55 and MacWhisper's Large V2 model requiring 3:55 to complete the same transcription task. Voorhees also reported no noticeable difference in transcription quality across models.

The speed advantage comes from Apple's on-device processing approach, which avoids the network overhead that typically slows cloud-based transcription services.

While the time difference might seem modest for individual files, Voorhees notes that the performance gain increases exponentially when processing multiple videos or longer content. For anyone generating subtitles or transcribing lectures regularly, the efficiency boost could save them hours.

The Speech framework components are available across iPhone, iPad, Mac, and Vision Pro platforms in the current beta releases. Voorhees expects Apple's transcription technology to eventually replace Whisper as the go-to solution for Mac transcription apps.

Related Roundups: iOS 26, iPadOS 26, macOS Tahoe 26
Related Forums: iOS 26, macOS Tahoe

Popular Stories

iPhone 16 Battery Life Feature

iOS 26's New Battery Life Mode is Limited to These iPhone Models

Friday August 1, 2025 8:26 am PDT by
iOS 26 introduces an Adaptive Power Mode on the iPhone, alongside the existing Low Power Mode. Apple says that Adaptive Power Mode can make "small performance adjustments" when necessary to extend an iPhone's battery life, including slightly lowering the display brightness or allowing some activities to "take a little longer." The full description of Adaptive Power Mode, from the iOS 26...
iphone 16 pro models 1

iPhone 17 Pro Max Rumored to Have 3 Advantages Over iPhone 17 Pro

Thursday July 31, 2025 3:00 am PDT by
Apple's highest-end iPhone 17 Pro Max model may have three key advantages over the smaller iPhone 17 Pro model, according to rumors. Specifically, the iPhone 17 Pro Max is expected to have…A larger 6.9-inch display (vs. 6.3-inch display on the iPhone 17 Pro) Even longer battery life (an approximately 5% thicker design may allow for a 5,000 mAh or higher battery capacity) A smaller Dynamic...
Apple WWCD23 Vision Pro EyeSight 230605

Here's What Tim Cook Thinks About Apple's Vision Pro After Low Sales

Friday August 1, 2025 6:46 am PDT by
Apple CEO Tim Cook remains bullish on the Vision Pro, despite reports of low sales since the mixed-reality headset launched nearly 18 months ago. "I was thrilled with the release from the team on visionOS 26," said Cook, on Apple's earnings call on Thursday. "It includes many things in it, like Spatial Widgets to enable users to customize their digital space. The Personas took a huge...
maxresdefault

The MacRumors Show: Latest iPhone 17 and iPhone 17 Air Rumors

Friday August 1, 2025 8:51 am PDT by
On this week's episode of The MacRumors Show, we focus on the latest rumors about the two standard iPhone models expected to arrive this fall: the iPhone 17 and iPhone 17 Air. Subscribe to The MacRumors Show YouTube channel for more videos The iPhone 17 is expected to feature the A19 chip and a larger, 6.3-inch display with slimmer bezels and ProMotion. Color options are likely to include...
iPhone 17 Pro in Hand Feature Lowgo

iPhone 17 Pro's Metal Battery Allegedly Revealed [Updated]

Saturday August 2, 2025 7:30 am PDT by
Update — August 2: Majin Bu now says that this battery is actually for the iPhone 17 Pro, instead of the iPhone 17 Air as they originally claimed. There will apparently be two variants, for models with and without a physical SIM card tray. "Due to a miscommunication with my source, the information I reported yesterday is incorrect," said Majin Bu. Original story follows. A leaker...
iPhone 17 Pro on Desk Centered 1

iPhone 17 Pro Launching in Two Months With These 16 New Features

Saturday July 26, 2025 5:50 am PDT by
Apple's iPhone 17 Pro and iPhone 17 Pro Max should launch in late September, and there are plenty of rumors about the devices. Below, we recap key changes rumored for the iPhone 17 Pro models, as of July 2025:Aluminum frame: iPhone 17 Pro models are rumored to have an aluminum frame, whereas the iPhone 15 Pro and iPhone 16 Pro models have a titanium frame, and the iPhone X through iPhone 14...
watchOS 11 Thumb 2 1

Apple Releases watchOS 11.6

Tuesday July 29, 2025 10:13 am PDT by
Apple today released watchOS 11.6, the sixth update to the operating system that runs on the Apple Watch. watchOS 11.6 comes more than two months after Apple released watchOS 11.5. The update is compatible with the Apple Watch Series 6 and later, all Apple Watch Ultra models, and the Apple Watch SE 2. watchOS 11.6 can be downloaded on a connected iPhone by opening up the Apple Watch app and...

Top Rated Comments

Big_D Avatar
7 weeks ago
Impressive, if it is accurate. What the story doesn't mention is how accurate each of those transcriptions was? Were they all identical? Did one or other have more mistakes? What is the accuracy percentage for each one, and how badly wrong were those mistakes?

I'm not trying to defend ChatGPT, just the speed is a single metric, which isn't very useful if the results are garbage. If the Apple one is faster and more accurate, that is incredible, faster and as accurate, impressive, faster but full of errors, not really that useful.

Hopefully it is the first one: it is faster and more accurate.
Score: 26 Votes (Like | Disagree)
neuropsychguy Avatar
7 weeks ago

Impressive, if it is accurate. What the story doesn't mention is how accurate each of those transcriptions was? Were they all identical? Did one or other have more mistakes? What is the accuracy percentage for each one, and how badly wrong were those mistakes?

I'm not trying to defend ChatGPT, just the speed is a single metric, which isn't very useful if the results are garbage. If the Apple one is faster and more accurate, that is incredible, faster and as accurate, impressive, faster but full of errors, not really that useful.

Hopefully it is the first one: it is faster and more accurate.
Nothing scientific, but in the MacStories post: "What stood out above all else was Yap’s speed. By harnessing SpeechAnalyzer and SpeechTranscriber on-device, the command line tool tore through the 7GB video file a full 55% faster than MacWhisper’s Large V3 Turbo model, with no noticeable difference in transcription quality."

It would be good to see more formal comparisons with data you suggested. Also, it would be good to know what computer John was using for the test.
Score: 17 Votes (Like | Disagree)
Big_D Avatar
7 weeks ago

Impressive, if it is accurate.
OK, I read the original article, they all had similar problems with the podcast name, AppStories, writing it as two words instead of CamelCasing it, which is acceptable, and they all had similar problems with people's names. But the Apple tools weren't any less accurate, despite being much faster.
Score: 15 Votes (Like | Disagree)
jmonster Avatar
7 weeks ago
Not mentioning accuracy at all implies it's not. Lots of models are faster than O3, but they're not better.

This is just silly getting sillier. Write something meaningful.

Whisper works in real time. Anything faster is irrelevant for iOS.

And saying it's because network overhead? When you can run OpenAI's whisper locally?....... mhm.

This is a blatant advertisement just regurgitating apples marketing bullets.
Score: 7 Votes (Like | Disagree)
klasma Avatar
7 weeks ago
Speech-to-text is a good use case for on-device processing, but yes, accuracy is an important question, not to mention (multi-)language support.
Score: 5 Votes (Like | Disagree)
Basic75 Avatar
7 weeks ago

While the time difference might seem modest for individual files, Voorhees notes that the performance gain increases exponentially when processing multiple videos or longer content.
That's not how it works. Recommend maths lesson.
Score: 4 Votes (Like | Disagree)