Apple Says 'Hey Siri' Detection Briefly Becomes Extra Sensitive If Your First Try Doesn't Work

A new entry in Apple's Machine Learning Journal provides a closer look at how hardware, software, and internet services work together to power the hands-free "Hey Siri" feature on the latest iPhone and iPad Pro models.

hey siri
Specifically, a very small speech recognizer built into the embedded motion coprocessor runs all the time and listens for "Hey Siri." When just those two words are detected, Siri parses any subsequent speech as a command or query.

The detector uses a Deep Neural Network to convert the acoustic pattern of a user's voice into a probability distribution. It then uses a temporal integration process to compute a confidence score that the phrase uttered was "Hey Siri."

If the score is high enough, Siri wakes up and proceeds to complete the command or answer the query automatically.

If the score exceeds Apple's lower threshold but not the upper threshold, however, the device enters a more sensitive state for a few seconds, so that Siri is much more likely to be invoked if the user repeats the phrase—even without more effort.

"This second-chance mechanism improves the usability of the system significantly, without increasing the false alarm rate too much because it is only in this extra-sensitive state for a short time," said Apple.

To reduce false triggers from strangers, Apple invites users to complete a short enrollment session in which they say five phrases that each begin with "Hey Siri." The examples are saved on the device.

We compare the distances to the reference patterns created during enrollment with another threshold to decide whether the sound that triggered the detector is likely to be "Hey Siri" spoken by the enrolled user.

This process not only reduces the probability that "Hey Siri" spoken by another person will trigger the iPhone, but also reduces the rate at which other, similar-sounding phrases trigger Siri.

Apple also says it created "Hey Siri" recordings both close and far in various environments, such as in the kitchen, car, bedroom, and restaurant, based on native speakers of many languages around the world.

For many more technical details about how "Hey Siri" works, be sure to read Apple's full article on its Machine Learning Journal.

Top Rated Comments

TurboPGT! Avatar
73 months ago
There's a lot going on behind the scenes with Siri. I don't think we give her enough credit.
My biggest problem with Siri is not any of the silly one-off bugs that get memed to death on the internet.

My biggest problem was described beautifully in a recent article I read somewhere. About how a voice assistant with 10 possible working commands is great. And one with unlimited working commands is great. But one with hundreds of working commands is terrible, because the user will never know what all those commands are. They will just use the core few that they know. And if they try a command and it isn't one of those hundreds, it causes confusion and doubt.
Score: 23 Votes (Like | Disagree)
seatton Avatar
73 months ago
Soon enough Siri is going to be smarter than Apple executives realizing that the MacMini needs an update ASAP.
Score: 15 Votes (Like | Disagree)
joshwenke Avatar
73 months ago
There's a lot going on behind the scenes with Siri. I don't think we give her enough credit.
Score: 12 Votes (Like | Disagree)
macTW Avatar
73 months ago
Everything about Siri is amazing, and far beyond any competition.

Except her ability to do stuff.
Score: 8 Votes (Like | Disagree)
840quadra Avatar
73 months ago
There has to be something new going on with regards to it’s understanding. After initiating Hey Siri in the car, her comprehension through the Bluetooth system has been notably better for me during the past year. I was wondering if the 7 was doing some machine learning locally, or if the cloud was parsing my questions better with this device. My 5S still struggles with my voice, though it could also be the mics on that device too.

My biggest problem with Siri is not any of the silly one-off bugs that get memed to death on the internet.

My biggest problem was described beautifully in a recent article I read somewhere. About how a voice assistant with 10 possible working commands is great. And one with unlimited working commands is great. But one with hundreds of working commands is terrible, because the user will never know what all those commands are. They will just use the core few that they know. And if they try a command and it isn't one of those hundreds, it causes confusion and doubt.
Apple would help Siri’s reputation a lot, by keeping an active Wiki going for the service, and what commands it will respond to. Because many people have tried to use Siri once for a specific task, found it didn’t work, and since given up. It is really hard for a general user to discover new tricks.

Places like iMore do a decent job of documenting, but, it would be AWESOME if the source had a good manual for it..
Score: 7 Votes (Like | Disagree)
Schizoid Avatar
73 months ago
I just had a quick check on the Mac to see if anything's changed...

Me: "How many files have I opened today?"
Siri: "on the internet I found 'how big is Allah.'"

maybe next year
Score: 7 Votes (Like | Disagree)

Popular Stories

google drive for desktop1

Google to Roll Out New 'Drive for Desktop' App in the Coming Weeks, Replacing Backup & Sync and Drive File Stream Clients

Tuesday July 13, 2021 1:18 am PDT by
Earlier this year, Google announced that it planned to unify its Drive File Stream and Backup and Sync apps into a single Google Drive for desktop app. The company now says the new sync client will roll out "in the coming weeks" and has released additional information about what users can expect from the transition. To recap, there are currently two desktop sync solutions for using Google...