Researchers create software for Google Glass that provides captions for hard-of-hearing users

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Contact

Jason Maderer
National Media Relations
maderer@gatech.edu
404-385-2966

Sidebar Content
No sidebar content submitted.
Summaries

Summary Sentence:

New software allows hard-of-hearing users to see captions in Google Glass.

Full Summary:

A team of Georgia Institute of Technology researchers has created speech-to-text software for Google Glass that helps hard-of-hearing users with everyday conversations. A hard-of-hearing person wears Glass while a second person speaks directly into a smartphone. The speech is converted to text, sent to Glass and displayed on its heads-up display. 

Media
  • Captioning on Glass Demo Captioning on Glass Demo
    (YouTube Video)
  • Captioning on Glass phone display Captioning on Glass phone display
    (image/png)
  • Captioning on Glass user display Captioning on Glass user display
    (image/png)
  • Jim Foley Jim Foley
    (image/jpeg)
  • Jay Zuerndorfer Jay Zuerndorfer
    (image/jpeg)

A team of Georgia Institute of Technology researchers has created speech-to-text software for Google Glass that helps hard-of-hearing users with everyday conversations. A hard-of-hearing person wears Glass while a second person speaks directly into a smartphone. The speech is converted to text, sent to Glass and displayed on its heads-up display.

A group in Georgia Tech’s College of Computing created the Glassware when one of its own said he was having trouble hearing and thought Glass could help him.

“This system allows wearers like me to focus on the speaker’s lips and facial gestures," said School of Interactive Computing Professor Jim Foley. “If hard-of-hearing people understand the speech, the conversation can continue immediately without waiting for the caption. However, if I miss a word, I can glance at the transcription, get the word or two I need and get back into the conversation.”

Foley’s colleague, Professor Thad Starner, leads the Contextual Computing Group working on the project. He says using a smartphone with Glass has several benefits as compared to using Glass by itself.

“Glass has its own microphone, but it’s designed for the wearer,” said Starner, who is also a technical lead for Glass. “The mobile phone puts a microphone directly next to the speaker’s mouth, reducing background noise and helping to eliminate errors.”

Starner says the phone-to-Glass system is helpful because speakers are more likely to construct their sentences more clearly, avoiding “uhs” and “ums.” However, if captioning errors are sent to Glass, the smartphone software also allows the speaker to edit the mistakes, sending the changes to the person wearing the device.

"The smartphone uses the Android transcription API to convert the audio to text," said Jay Zuerndorfer, the Georgia Tech Computer Science graduate student who developed the software. "The text is then streamed to Glass in real time."

Captioning on Glass is currently available to install from MyGlass. More information and support can be found at the project website here.

Foley and the students are working with the Association of Late Deafened Adults in Atlanta to improve the program.

The same group is also working on a second project, Translation on Glass, that uses the same smartphone-Glass Bluetooth connection process to capture sentences spoken into the smartphone, translate them to another language and send them to Glass. The only difference is that the person wearing Glass, after reading the translation, can reply. The response is translated back to the original language on the smartphone. Two-way translations are currently available for English, Spanish, French, Russian, Korean and Japanese.

"For both uses, the person wearing Glass has to hand their smartphone to someone else to begin a conversation,” said Starner. “It’s not ideal for strangers, but we designed the program to be used among friends, trusted acquaintances or while making purchases.”

The group is working to get Translation on Glass ready for the public.

Related Links

Additional Information

Groups

College of Computing

Categories
No categories were selected.
Related Core Research Areas
People and Technology
Newsroom Topics
Science and Technology
Keywords
captioning on glass, College of Computing, Google Glass, Press Release
Status
  • Created By: Jason Maderer
  • Workflow Status: Archived
  • Created On: Oct 2, 2014 - 8:52am
  • Last Updated: Oct 7, 2016 - 11:07pm