IEEE Spectrum: Lip-reading is a difficult task. Spoken language involves some 50 different sounds, called phonemes, yet the face presents just 10 or so different configurations, called visemes. Even for people with normal hearing, most speech is best understood through the use of both aural and visual cues. Now, Helen Bear and Richard Harvey of the University of East Anglia have improved computer lip-reading software by introducing a two-step algorithm. First, the researchers had the computer map a given viseme to the multiple phonemes it can represent; then, an analysis of video recordings of humans speaking different phonemes allowed the program to zero in on the minute visual clues that differentiate one word from another. Lip-reading software could have many uses, from helping hearing-impaired people to aiding in criminal investigations that involve recorded film footage.
For the UNESCO section chief, “striking a balance between global coherence and respect for national ownership and cultural diversity is both essential and complex.”
May 13, 2026 01:46 PM
Get PT newsletters in your inbox
PT The Week in Physics
A collection of PT's content from the previous week delivered every Monday.
One email per week
PT New Issue Alert
Be notified about the new issue with links to highlights and the full TOC.
One email per month
PT Webinars & White Papers
The latest webinars, white papers and other informational resources.