Fri, Apr 19, 2024 | Shawwal 10, 1445

00:00:00

KT APPDOWNLOAD

Your phone will finally understand what you say

Top Stories

Embracing the bright side: A guide on how to cultivate optimism

The power of solitude: Why you should go on a solo trip

How to get your kids to read: Essential tips for parents

Your phone will finally understand what you say

Microsoft develops first human-like speech recognition system.

By IANS

Published: Wed 19 Oct 2016, 5:06 PM

Last updated: Wed 19 Oct 2016, 7:55 PM

In a major breakthrough in the field of speech recognition, Microsoft researchers have created a technology that accurately recognises the words in a conversation like humans do.

The team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.

The researchers reported a word error rate (WER) of 5.9 per cent, down from the 6.3 per cent WER the team reported just last month.

The 5.9 per cent error rate is about equal to that of people who were asked to transcribe the same conversation, and it's the lowest ever recorded against the industry standard "Switchboard" speech recognition task.

"We've reached human parity. This is an historic achievement," said Xuedong Huang, the company's chief speech scientist in a Microsoft blog post.

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would.

In doing so, the team has beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.

"Even five years ago, I wouldn't have thought we could have achieved this. I just wouldn't have thought it would be possible," said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.

The research milestone comes after decades of research in speech recognition, beginning in the early 1970s with DARPA, the US agency tasked with making technology breakthroughs in the interest of national security.

"This accomplishment is the culmination of over 20 years of effort," said Geoffrey Zweig, who manages the Speech & Dialog research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by speech recognition. That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

"This will make Cortana (Microsoft personal assistant) more powerful, making a truly intelligent assistant possible," Shum said.

To reach the human parity milestone, the team used Microsoft's Computational Network Toolkit (CNTK), a home-grown system for deep learning that the research team has made available on GitHub via an open source license.

CNTK's ability to quickly process deep learning algorithms across multiple computers running a specialised chip called a graphics processing unit vastly improved the speed at which the team was able to do research and, ultimately, reach human parity.

Moving forward, the researchers are working on ways to make sure that speech recognition works well in more real-life settings.

That includes places where there is a lot of background noise, such as at a party or while driving on the highway.

In the longer term, researchers will focus on ways to teach computers not just to transcribe the acoustic signals that come out of people's mouths, but instead to understand the words they are saying.

"The next frontier is to move from recognition to understanding," Zweig said.

Mobiles

More news from

women and money

'I started saving at 6': Dubai expat on learning to value money as a child

Natasha Abbas is a British civil engineer who co-founded North 51, a project management consultancy in Dubai

women and money

parenting

Why using ChatGPT to write the college admission essay is a bad idea

People who are assessing your qualifications can quickly tell the difference between an authentic life narrative and a third-party account

parenting

lifestyle

UAE-based public speaker on talking your way to the top

Arab-Canadian public speaking coach and author of The Million Dollar Speaker Maher Elusini on how to make your speech command value for time and money

lifestyle

arts

'It is important to reveal the full value and richness of Arab culture to the whole world': Famous music conductor Teodor Currentzis

The legendary Greek-Russian conductor Teodor Currentzis, who is all set to perform at the Dubai Opera next week, on music transcending cultural and language barriers

arts

lifestyle

Can children become great entrepreneurs?

Not every kid is meant to go to university or college. So, it makes sense to teach them entrepreneurial skills early

lifestyle

emergencies

‘1 room for Dh8,000 a night’: Some UAE hotels hike prices as floods leave residents, tourists stranded

There are also increasing accounts, on social media and online forums, of tourists and residents across the city having to pay inflated prices for taxis

emergencies