Patent · US Active

Speech recognition assisted evaluation on text-to-speech pronunciation issue detection

US9293129B2 · kind B2 · utility

3Cited by

8References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Pei Zhao · Beijing, CN
Bo Yan · Suzhou, CN
Lei He · Moraga, US
Zhe Geng · Beijing, CN
Yiu-Ming Leung · Kirkland, US

Key dates

Filing date	Mar 5, 2013
Grant date	Mar 22, 2016
Priority date	—
Expiry date	Aug 3, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Pronunciation issues for synthesized speech are automatically detected using human recordings as a reference within a Speech Recognition Assisted Evaluation (SRAE) framework including a Text-To-Speech flow and a Speech Recognition (SR) flow. A pronunciation issue detector evaluates results obtained at multiple levels of the TTS flow and the SR flow (e.g. phone, word, and signal level) by using the corresponding human recordings as the reference for the synthesized speech, and outputs possible pronunciation issues. A signal level may be used to determine similarities/differences between the recordings and the TTS output. A model level checker may provide results to the pronunciation issue detector to check the similarities of the TTS and the SR phone set including mapping relations. Results from a comparison of the SR output and the recordings may also be evaluation by the pronunciation issue detector. The pronunciation issue detector outputs a list that lists potential pronunciation issue candidates.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.