Patent · US Active

Data-driven and rule-based speech recognition output enhancement

US11257484B2 · kind B2 · utility

0Cited by

4References

19Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Dimitrios Dimitriadis · Rutherford, US
Xie Chen · Lianyuan, CN
Nanshan Zeng · Bellevue, US
Yu Shi · Beijing, CN
Liyang Lu · Beijing, CN

Key dates

Filing date	Aug 21, 2019
Grant date	Feb 22, 2022
Priority date	—
Expiry date	Apr 23, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/223
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

According to some embodiments, a multi-layer speech recognition transcript post processing system may include a data-driven, statistical layer associated with a trained automatic speech recognition model that selects an initial transcript. A rule-based layer may receive the initial transcript from the data-driven, statistical layer and execute at least one pre-determined rule to generate a first modified transcript. A machine learning approach layer may receive the first modified transcript from the rule-based layer and perform a neural model inference to create a second modified transcript. A human editor layer may receive the second modified transcript from the machine learning approach layer along with an adjustment from at least one human editor. The adjustment may create, in some embodiments, a final transcript that may be used to fine-tune the data-driven, statistical layer.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.