Apple's Latest Machine Learning Journal Entry Focuses on 'Hey Siri' Trigger Phrase

Apple's latest entry in its online Machine Learning Journal focuses on the personalization process that users partake in when activating "Hey Siri" features on iOS devices. Across all Apple products, "Hey Siri" invokes the company's AI assistant, and can be followed up by questions like "How is the weather?" or "Message Dad I'm on my way."

"Hey Siri" was introduced in iOS 8 on the iPhone 6, and at that time it could only be used while the iPhone was charging. Afterwards, the trigger phrase could be used at all times thanks to a low-power and always-on processor that fueled the iPhone and iPad's ability to continuously listen for "Hey Siri."

hey siri iphone x
In the new Machine Learning Journal entry, Apple's Siri team breaks down its technical approach to the development of a "speaker recognition system." The team created deep neural networks and "set the stage for improvements" in future iterations of Siri, all motivated by the goal of creating "on-device personalization" for users.

Apple's team says that "Hey Siri" as a phrase was chosen because of its "natural" phrasing, and described three scenarios where unintended activations prove troubling for "Hey Siri" functionality. These include "when the primary users says a similar phrase," "when other users say "Hey Siri"," and "when other users say a similar phrase." According to the team, the last scenario is "the most annoying false activation of all."

To lessen these accidental activations of Siri, Apple leverages techniques from the field of speaker recognition. Importantly, the Siri team says that it is focused on "who is speaking" and less on "what was spoken."

The overall goal of speaker recognition (SR) is to ascertain the identity of a person using his or her voice. We are interested in “who is speaking,” as opposed to the problem of speech recognition, which aims to ascertain “what was spoken.” SR performed using a phrase known a priori, such as “Hey Siri,” is often referred to as text-dependent SR; otherwise, the problem is known as text-independent SR.

The journal entry then goes into how users enroll in a personalized "Hey Siri" process using explicit and implicit enrollment. Explicit begins the minute that users speak the trigger phrase a few times, but implicit is "created over a period of time" and made during "real-world situations."

The Siri team says that the remaining challenges faced by speaker recognition is figuring out how to get quality performance in reverberant (large room) and noisy (car) environments. You can check out the full Machine Learning Journal entry on "Hey Siri" right here.

Since it began last summer, Apple has shared numerous entries in its Machine Learning Journal about complex topics, which have already included "Hey Siri", face detection, and more. All past entries can be seen on Apple.com.

Top Rated Comments

ThatGuyInLa Avatar
79 months ago
Apple can post all the journals and blog entries they want. Siri sucks. (When compared to others)

As I’ve said before, they need to fire the entire team and hire new. Poach if needed. They need to JOBS THIS SOB.
Score: 8 Votes (Like | Disagree)
barkomatic Avatar
79 months ago
Blah blah blah, neural networks, blah blah blah, set the stage for improvements, blah. They need to actually *make* those improvements and quickly. Either that or allow us to use Google assistant or Alexa natively on the iPhone. I'm tired of Siri telling me about web searches. It's hard to believe how Apple dropped the ball that they brought to the game.
Score: 6 Votes (Like | Disagree)
WWPD Avatar
79 months ago
I thought they did say 'Computer' often before issuing commands in Star Trek? Maybe my memory is fuzzy though.
Computer was always the trigger word in Star Trek.
Score: 3 Votes (Like | Disagree)
sundog925 Avatar
79 months ago
They're whole siri system needs an overhaul.
Its lagging so far behind others, dictation is awful, through bluetooth is worse. its just all around poor.

Even spotify's new voice control is lightyears more accurate than siri :rolleyes:
Score: 3 Votes (Like | Disagree)
TheBruno Avatar
79 months ago
I had to disable this trigger. I can't say 'serious' anywhere in the vicinity of my iPad Pro.
Score: 2 Votes (Like | Disagree)
iamgalt Avatar
79 months ago
I can't say I've experienced too many unintended activations, but I do still wish apple would allow us to change the triggering phrase to whatever we wanted. That may help with unintended activations for each person.
Score: 2 Votes (Like | Disagree)

Popular Stories

iPhone 15 Pro FineWoven

Apple Reportedly Stops Production of FineWoven Accessories

Sunday April 21, 2024 6:03 am PDT by
Apple has stopped production of FineWoven accessories, according to the Apple leaker and prototype collector known as "Kosutami." In a post on X (formerly Twitter), Kosutami explained that Apple has stopped production of FineWoven accessories due to its poor durability. The company may move to another non-leather material for its premium accessories in the future. Kosutami has revealed...
Provenance Emulator

PlayStation and SEGA Emulator for iPhone and Apple TV Coming to App Store [Updated]

Friday April 19, 2024 8:29 am PDT by
The lead developer of the multi-emulator app Provenance has told iMore that his team is working towards releasing the app on the App Store, but he did not provide a timeframe. Provenance is a frontend for many existing emulators, and it would allow iPhone and Apple TV users to emulate games released for a wide variety of classic game consoles, including the original PlayStation, GameCube, Wii,...
iOS 17 All New Features Thumb

iOS 17.5 Will Add These New Features to Your iPhone

Sunday April 21, 2024 3:00 am PDT by
The upcoming iOS 17.5 update for the iPhone includes only a few new user-facing features, but hidden code changes reveal some additional possibilities. Below, we have recapped everything new in the iOS 17.5 and iPadOS 17.5 beta so far. Web Distribution Starting with the second beta of iOS 17.5, eligible developers are able to distribute their iOS apps to iPhone users located in the EU...
maxresdefault

Apple Announces 'Let Loose' Event on May 7 Amid Rumors of New iPads

Tuesday April 23, 2024 7:11 am PDT by
Apple has announced it will be holding a special event on Tuesday, May 7 at 7 a.m. Pacific Time (10 a.m. Eastern Time), with a live stream to be available on Apple.com and on YouTube as usual. The event invitation has a tagline of "Let Loose" and shows an artistic render of an Apple Pencil, suggesting that iPads will be a focus of the event. Subscribe to the MacRumors YouTube channel for more ...
apple vision pro orange

Apple Vision Pro Customer Interest Dying Down at Some Retail Stores

Monday April 22, 2024 2:12 am PDT by
Apple Vision Pro, Apple's $3,500 spatial computing device, appears to be following a pattern familiar to the AR/VR headset industry – initial enthusiasm giving way to a significant dip in sustained interest and usage. Since its debut in the U.S. in February 2024, excitement for the Apple Vision Pro has noticeably cooled, according to Bloomberg's Mark Gurman. Writing in his latest Power On...