Patent · US Active

Training speech recognition systems using word sequences

US10388272B1 · kind B1 · utility

271Cited by

114References

20Claims

0Family size

Assignee

Sorenson IP Holdings, LLC · US

Inventors

David Thomson · Bountiful, US
Jadie Adams · Salt Lake City, US
Kenneth Boehme · South Jordan, US

Key dates

Filing date	Dec 4, 2018
Grant date	Aug 20, 2019
Priority date	—
Expiry date	Dec 4, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method may include obtaining first audio data of a communication session between a first device and a second device, obtaining a text string that is a transcription of the first audio data, and selecting a contiguous sequence of words from the text string as a first word sequence. The method may further include comparing the first word sequence to multiple word sequences obtained before the communication session and in response to the first word sequence corresponding to one of the multiple word sequences, incrementing a counter of multiple counters associated with the one of the multiple word sequences. The method may also include deleting the text string and the first word sequence and training and after deleting the text string and the first word sequence, training a language model of an automatic transcription system using the multiple word sequences and the multiple counters.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.