Apple's Greg Joswiak: Siri Wasn't Engineered to Be Trivial Pursuit

iOS 9 SiriIn iOS 11, Apple's AI-based personal assistant Siri has a much more natural voice that goes a long way towards making Siri sound human like. Siri speaks with a faster, smoother cadence with elongated syllables and pitch variation, a noticeable departure from the more machine like sound in iOS 10.

The team behind Siri, including Siri senior director Alex Acero, has worked for years to improve the way Siri speaks, according to a new interview Acero did alongside Apple VP of marketing Greg Joswiak with Wired. While Siri's voice recognition capabilities were powered by a third-party company early on in Siri's life, Acero's team took over Siri development a few years back, leading to several improvements to the personal assistant since then.

Siri is powered by deep learning and AI, technology that has much improved her speech recognition capabilities. According to Wired, Siri's raw voice recognition capabilities are now able to correctly identify 95 percent of users' speech, on par with rivals like Alexa and Cortana.

Apple is still working to overcome negative perceptions about Siri, and blames many of the early issues on the aforementioned third-party partnership.

"It was like running a race and, you know, somebody else was holding us back," says Greg Joswiak, Apple's VP of product marketing. Joswiak says Apple always had big plans for Siri, "this idea of an assistant you could talk to on your phone, and have it do these things for you in a more easy way," but the tech just wasn't good enough. "You know, garbage in, garbage out," he says.

Joswiak says Apple's aim from the beginning has been to make Siri a "get-s**t-done" machine. "We didn't engineer this thing to be Trivial Pursuit!" he told Wired. Apple wants Siri to serve as an automated friend that can help people do more.

siriwaveform
One unique Siri attribute is its ability to work in multiple languages. Siri supports English, French, Dutch, Mandarin, Cantonese, Finnish, Hebrew, Malay, Arabic, Italian, and Spanish, and more, including dialect variants (like English in the UK and Australia) and accents. The Siri team combines pre-existing databases of local speech with local voice talent and on-device dictation, transcribing and dissecting the content to find all of the individual sounds in a given language and all of the ways those sounds are pronounced.

In areas where Apple offers spoken dictation but no Siri support, it's gathering data for future Siri support, and in places where Siri is already available, spoken interactions between user and device (gathered anonymously) are used to improve algorithms and train the company's neural network.

Creating the right voice for Siri in a given language hinges on the proper voice talent, and Apple uses an "epic search" with hundreds of people to find someone who sounds helpful, friendly, spunky, and happy without overdoing it. Once the right person is found, Apple records them for weeks at a time to create the right sound. So far, Apple has repeated this process for all 21 languages Siri supports.

Ultimately, Acero and his Siri team are aiming to make Siri sound more like a trusted person than a robot, creating an attachment to the AI that will "make Siri great" even when Siri fails to answer a query properly. Apple also wants to make people more aware of what Siri can and can't do and that it exists in the first place, which is why iOS 11 includes Siri-centric features like cross-device syncing and a better understanding of user interests and preferences.

Wired's full piece, which goes into much more detail on how Siri recognizes various aspects of speech and how Apple chooses voice talent can be read over on the site.

Top Rated Comments

duffman9000 Avatar
87 months ago
Yeah, it’s been garbage in and garbage out for years and now Apple wants us to forget that?
Score: 49 Votes (Like | Disagree)
falainber Avatar
87 months ago
Improving Siri's voice is a good thing but it's much lower on the priority list than making it smarter.
Score: 49 Votes (Like | Disagree)
Menopause Avatar
87 months ago
I would be glad if a dog barked at me in place of Siri's "natural voice" if it were 10% more intelligent than what Siri is today. The most pathetic piece of crap ever maintained by Apple.



Score: 39 Votes (Like | Disagree)
nope7308 Avatar
87 months ago
Siri is the result of believing your own hype. I'll just leave it at that.
Score: 23 Votes (Like | Disagree)
desmond2046 Avatar
87 months ago
Whenever my friends tell me how scary AI is and how it is going to destroy the human beings, I tell them to try Siri.
Score: 23 Votes (Like | Disagree)
AngerDanger Avatar
87 months ago
Apple wants Siri to serve as an automated friend that can help people do more.
Mission accomplished. I can't count how many times I've been chillin' with my friends and having a personal discussion, which tends to go a bit like:

"I get the strangest feeling Steve is just using Sharon as a sort of ego boost. You know what I mean?"

"I don't know what 'I get the strangest feeling Steve is just using Sharon as a sort of ego boost. You know what I mean?' means, but I can search the web for it."

Ahaha, we're such nutters!
Score: 19 Votes (Like | Disagree)

Popular Stories

maxresdefault

Apple Announces 'Let Loose' Event on May 7 Amid Rumors of New iPads

Tuesday April 23, 2024 7:11 am PDT by
Apple has announced it will be holding a special event on Tuesday, May 7 at 7 a.m. Pacific Time (10 a.m. Eastern Time), with a live stream to be available on Apple.com and on YouTube as usual. The event invitation has a tagline of "Let Loose" and shows an artistic render of an Apple Pencil, suggesting that iPads will be a focus of the event. Subscribe to the MacRumors YouTube channel for more ...
Apple Silicon AI Optimized Feature Siri

Apple Releases Open Source AI Models That Run On-Device

Wednesday April 24, 2024 3:39 pm PDT by
Apple today released several open source large language models (LLMs) that are designed to run on-device rather than through cloud servers. Called OpenELM (Open-source Efficient Language Models), the LLMs are available on the Hugging Face Hub, a community for sharing AI code. As outlined in a white paper [PDF], there are eight total OpenELM models, four of which were pre-trained using the...
Apple Vision Pro Dual Loop Band Orange Feature 2

Apple Cuts Vision Pro Shipments as Demand Falls 'Sharply Beyond Expectations'

Tuesday April 23, 2024 9:44 am PDT by
Apple has dropped the number of Vision Pro units that it plans to ship in 2024, going from an expected 700 to 800k units to just 400k to 450k units, according to Apple analyst Ming-Chi Kuo. Orders have been scaled back before the Vision Pro has launched in markets outside of the United States, which Kuo says is a sign that demand in the U.S. has "fallen sharply beyond expectations." As a...
iOS 18 Siri Integrated Feature

iOS 18 Rumored to Add These 10 New Features to Your iPhone

Wednesday April 24, 2024 2:05 pm PDT by
Apple is set to unveil iOS 18 during its WWDC keynote on June 10, so the software update is a little over six weeks away from being announced. Below, we recap rumored features and changes planned for the iPhone with iOS 18. iOS 18 will reportedly be the "biggest" update in the iPhone's history, with new ChatGPT-inspired generative AI features, a more customizable Home Screen, and much more....
iPad And Calculator App Feature

Apple Finally Plans to Release a Calculator App for iPad Later This Year

Tuesday April 23, 2024 9:08 am PDT by
Apple is finally planning a Calculator app for the iPad, over 14 years after launching the device, according to a source familiar with the matter. iPadOS 18 will include a built-in Calculator app for all iPad models that are compatible with the software update, which is expected to be unveiled during the opening keynote of Apple's annual developers conference WWDC on June 10. AppleInsider...