Got a tip for us? Share it...

Beyond Multi-Touch: Voice, Gaze, Facial Expression Recognition

Apple's patent applications provide a glimpse into what might lie ahead in our future. For example, we saw an explosion of multi-touch patents in the year before the iPhone's release. And while many of these concepts may never come to market, they provide an interesting look into the direction of Apple's research.

One of the most recent patent applications from Apple is entitled Multitouch Data Fusion and is authored by Wayne Westerman and John Elias (formerly of Fingerworks). Westerman and Elias have been prolific publishers of multi-touch patent applications and likely helped establish the multi-touch technology behind Apple's iPhone. While many have since hoped to see some sort of advanced multi-touch interface for Apple's Macs, only limited multi-touch support has been included into Apple's notebooks.

In Multitouch Data Fusion, however, Westerman and Elias are already exploring the use other inputs to help improve and augment multi-touch interfaces. These include:

- Voice recognition
- Finger identification
- Gaze vector
- Facial expression
- Handheld device movement
- Biometrics (body temp, heart rate, skin impedance, pupil size)

The patent application gives examples of how each of these could be used in conjunction with multi-touch to provide a better user experience. A few highlights are provided here:

Voice - Some tasks are described as being better suited for either voice recognition vs multi-touch. For example, if a user's task is to resize, rotate and change the color of an object on the screen, multi-touch would be best suited for the resize/rotation tasks but changing of color (or inserting text) may be better suited to a voice command.

Finger identification - the article suggests that using built in cameras (such as the iSight) with a swing mirror could provide an over the keyboard view of multi-touch gestures. This video information could be used to better distinguish which fingers are being used in which position. In conjunction with the touch input, this could be used to create more specific and accurate gestures.


Gaze - tracking where the user is looking could help pick windows or objects on the screen. Rather than resorting to moving a mouse pointer to the proper window, a user could simply direct his gaze at the particular window and then invoke a touch gesture.

Facial expression - detecting frustration on the face of a user could provide help prompts or even alter input behavior. The example given is if a user is incorrectly trying to scroll a window using 3 fingers, instead of 2, the computer may be able to detect the frustration and either accept the faulty input or prompt the user.

Many of these technologies are likely years away from the market, but continue to provide an interesting peek into Apple's future.

Top Rated Comments

(View all)

45 months ago
I certainly think Finger identification for security could arrive soon.
Rating: 0 Positives / 0 Negatives
45 months ago
I can see millions of possible uses for this one. For example, the software can detect vasocongestion in a certain extremity, and subsequent tissue swelling. It can then minimize all immediate windows, and open new ones of the user's favorite porn sites. This could usher in a new era of adult-themed convenience! Although they would need to work out some kinks, as I'm sure this would lead to more than a few complications in the workplace.

:apple::apple:Let's hope this doesn't fall through!!!!!:apple::apple:
Rating: 0 Positives / 0 Negatives
45 months ago

Facial expression - detecting frustration on the face of a user could provide help prompts or even alter input behavior.


Cue ClipIt. "It looks like you're angry. Would you like to..."
Rating: 0 Positives / 0 Negatives
45 months ago

Cue ClipIt. "It looks like you're angry. Would you like to..."


God, I hated that thing. Made me wanna bend it out into a traight line and then bend parts up and down until the metal weakends and it just breaks off.
Rating: 0 Positives / 0 Negatives
45 months ago
The What's the story morning glory app?

Whilst some isn't ready for prime time, a few of these are already inthe labs - Microsoft's work on voice, gaze recognition is established, but not so much via a webcam.

Finger recognition as in fingerprints has already been around, but actually working out which finger via a multitouch pad has been mentioned in previous patents. Working out which finger via a webcam is more difficult. Same with facial recognition. However, handheld device movement is already nearly to market in some guises.

I think the tech demos of webcams being able to identify objects, then use them as user interfaces (e.g. noticing a colored plastic wheel, and then being able to use it as a steering wheel for a racing game etc) already show there is a lot more out there, but it's not had OS integration really.

Interestingly, this patent is all about how to fuse multi-touch data - and then how secondary data can be combined with touch data - either refine the touch data, be interpreted in accordance to the touch data, or combine together to be a new command. So it's moving on from just multi-touch. Would Apple release some of the gestures prior to next summer's OS update, to give time to get acquainted with them?

Interesting to see they're patenting the future possiiblities and what they are.
Rating: 0 Positives / 0 Negatives
45 months ago
Why can't Apple just concentrate on fixing its OS, and providing quality made hardware before doing something new. :(

なぜなら!

Apple is now Micopple

I want money (that's what I want)
Rating: 0 Positives / 0 Negatives
45 months ago

Why can't Apple just concentrate on fixing its OS, and providing quality made hardware before doing something new. :(

なぜなら!

Apple is now Micopple

I want money (that's what I want)


Snow Leopard, fool!
Rating: 0 Positives / 0 Negatives
45 months ago
This is again a good excamle of the flaws in the patentability of inventions in US. Where's the invention in the notion that voice, facial expressions, gestures, gaze, body movement, flip-flops can or could be used to prompt actions from a computer? The invention lies in figuring out a technological solution to make it happen and that is also what should be patentable.

And no, I didn't read the actual patent application so I may be somewhat of here :D
Rating: 0 Positives / 0 Negatives
45 months ago

Why can't Apple just concentrate on fixing its OS, and providing quality made hardware before doing something new. :(

なぜなら!

Apple is now Micopple

I want money (that's what I want)


I think that Apple has more than a couple of employees, so everyone doesn't work on just one thing.
Rating: 0 Positives / 0 Negatives
45 months ago
Everything sounds cool except for facial recognition. That idea is flawed. Why, how many people smile, frown, or squint when they are reading something... or even have a fustrated look when reading what someone wrote in an email that went to an unintended audience.

I can just see a lot of mis-readings on this.

I think shock detection could be good. I mean, people tend to slam their desks, shove their keyboard, flick the screen, throw their mouse when ticked off at the computer. Heck MS needs that for the fustration. Everyday my virtual PC's on my Windows work computers lock up or run slow.

Voice recognition, I am not so sure about. It may have some a long way since I tried it - but it had a hard time with some words and if the software crashed and you had to re-install it; you had to spend hours retraining it. My nephew (while he was in high school) had a comuter project he had to do to graduate. His demonstration was to have his XP machine (yes XP has been around that long - he is now 22) being totally operated by vioce command, using Dragon Naturally speaking. He said it was a fun project, but was fustrating as it took hours to train the thing. We are a long way from the old star trek episodes ("computer, 2 to beam up". "Computer, .....").

Right now I would like to see more work done on touch. On a laptop/tablet I can see not needing a mouse/trackpad anymore. A keyboard is still needed for long typing.
Rating: 0 Positives / 0 Negatives

[ Read All Comments ]