Navigation Links
Research aims to improve speech recognition software
Date:8/11/2010

BINGHAMTON, NY Anyone who has used an automated airline reservation system has experienced the promise and the frustration inherent in today's automatic speech recognition technology. When it works, the computer "understands" that you want to book a flight to Austin rather than Boston, for example. Research conducted by Binghamton University's Stephen Zahorian aims to improve the accuracy of such programs.

Zahorian, a professor of electrical and computer engineering, recently received a grant of nearly half a million dollars from the Air Force Office of Scientific Research. The funds will support the two-year development of a multi-language, multi-speaker audio database that will be available for spoken-language processing research. Zahorian and his team plan to gather and annotate recordings of several hundred speakers each in English, Spanish and Mandarin Chinese.

"The challenge," he said, "is to get speech recognition working better in real-life situations."

That's why the samples in the new database will come from publicly available sources such as YouTube.

Zahorian's team will annotate each sample, creating a more detailed version of closed captioning, including time stamps and descriptions of background sounds. Once the human listener has finished with the transcription, automatic speech recognition algorithms will be used to align the recording with the captions. Next, software will be developed to verify and correct errors in the time alignment.

"Speech-recognition algorithms begin by mimicking what your ear does," Zahorian said. "But we want the algorithms to extract just the most useful characteristics of the speech, not all of the possible data. That's because more detail can actually hurt performance, past a certain point."

The field of automatic speech recognition has a long history, dating back to projects at Bell Labs before the computer age. These days, much of the technology relies on algorithms that convert sounds into numbers.

In Zahorian's research, he represents speech as a picture in a time-frequency plane. He then uses image-processing techniques to extract features of the speech, which has led him to focus more on time than on frequency.

When researchers are ready to test an algorithm, they rely on a common set of databases held by the Linguistic Data Consortium. Zahorian's unusual image-based approach has given his team some of the best results ever reported for automatic speech recognition experiments using two of the consortium's best-known databases.

The database Zahorian develops with the new funding will join these others, offering researchers around the world a new way to test their theories with samples of real-life speech.

Some mistakes are inevitable, given the variations in pitch, tone and pronunciation from person to person.

Still, the field does have a clear standard, Zahorian said: "In order to be useful, a system should have a word-error rate of no more than 10 percent."

Zahorian is interested in language modeling if someone has said these three words, what's the fourth word likely to be? as well as conversation modeling that is, predicting when the speakers will switch. He's also intrigued by the potential to make advances by using established methods from other fields, including the neural networks developed by researchers working in artificial intelligence.

He sees a future in which automatic speech recognition will enable technology to extract the meaning of speech as well as the words.

"The dream," Zahorian said, "is that someday travelers will be able to speak into a little gadget that will translate what they've said into another language instantly and accurately."


'/>"/>

Contact: Gail Glover
gglover@binghamton.edu
607-777-2174
Binghamton University
Source:Eurekalert  

Related medicine news :

1. Embedded Mobile & M2M Device revenues to Rise to Almost $19 Billion Globally by 2014, Says Juniper Research
2. 2010 HSR Impact Award recognizes surgical safety research
3. MSU launches first anti-counterfeiting research program
4. Researchers map all the fragile sites of the yeast Saccharomyces cerevisiaes genome
5. UH Case Medical Center researchers publish promising findings for advanced cervical cancer
6. Researchers discover new way to kill pediatric brain tumors
7. Family Research Council: Planned Parenthood Report Oversexualizes Ten-Year-Olds, Undermines Parental Authority
8. Michael J. Fox Foundation Awards $1 Million to Drive Critical New Research Tools and Technologies in Parkinsons Drug Development
9. Luth Researchs IndicatorEDG(TM) Study Finds Americans Hopes of Achieving Their Dreams Are Fading
10. International Diabetes Federation awards $2 million to 9 global diabetes research projects
11. Gladstones Robert Mahley to receive Research!America advocacy award
Post Your Comments:
*Name:
*Comment:
*Email:
Related Image:
Research aims to improve speech recognition software
(Date:6/25/2016)... ... ... of Bruton Memorial Library on June 21 due to a possible lice infestation, as reported ... head lice: the parasite’s ability to live away from a human host, and to infest ... in the event that lice have simply gotten out of control. , As lice are ...
(Date:6/25/2016)... ... ... On Friday, June 10, Van Mitchell, Secretary of the Maryland Department of Health ... of their exemplary accomplishments in worksite health promotion. , The Wellness at Work Awards ... at the BWI Marriott in Linthicum Heights. iHire was one of 42 businesses to ...
(Date:6/24/2016)... ... ... crisis. Her son James, eight, was out of control. Prone to extreme mood shifts and ... him, he couldn’t control his emotions,” remembers Marcy. “If there was a knife on ... say he was going to kill them. If we were driving on the freeway, ...
(Date:6/24/2016)... (PRWEB) , ... June 24, 2016 , ... Topical BioMedics, Inc, makers of Topricin and ... that call for a minimum wage raise to $12 an hour by 2020 and then ... will restore the lost value of the minimum wage, assure the wage floor does not ...
(Date:6/24/2016)... Frederick, Maryland (PRWEB) , ... June 24, 2016 ... ... Mid-Atlantic Angels is actively feeding the Frederick area economy by obtaining investment capital ... support over the past 2½ years that have already resulted in more than ...
Breaking Medicine News(10 mins):
(Date:6/23/2016)... 2016 Roche (SIX: RO, ROG; OTCQX: RHHBY) ... Elecsys BRAHMS PCT (procalcitonin) assay as a dedicated testing ... With this clearance, Roche is the first IVD company ... for sepsis risk assessment and management. PCT ... PCT levels in blood can aid clinicians in assessing ...
(Date:6/23/2016)... June 23, 2016 Bracket , a leading ... next generation clinical outcomes platform, Bracket eCOA (SM) 6.0, ... June 26 – 30, 2016 in Philadelphia ... Clinical Outcome Assessment product of its kind to fully integrate ... Bracket eCOA 6.0 is a flexible platform for electronic ...
(Date:6/23/2016)... Revolutionary technology includes multi-speaker listening ... industry leaders in advanced audiology and hearing aid technology, ... ™, the world,s first internet connected hearing aid that ...      (Photo: http://photos.prnewswire.com/prnh/20160622/382240 ) , ... ,world firsts,: , TwinLink™ - the first ...
Breaking Medicine Technology: