Patent · US Expired

Segment-based similarity method for low complexity speech recognizer

US6230129A · kind A · utility

29Cited by

8References

11Claims

0Family size

Assignee

Sumitomo Electric Industries, Ltd. · JP

Inventors

Philippe Morin · San Jose, US
Ted Applebaum · Lompoc, US

Key dates

Filing date	Nov 25, 1998
Grant date	May 8, 2001
Priority date	—
Expiry date	Nov 25, 2018

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A digital word prototype is constructed using one or more speech utterance for a given spoken word or phrase. First, a phone model is used to derive phoneme similarity time series for each of a plurality of phonemes which represent the degree of similarity between the speech utterance and a set of standard phonemes contained in the phone model. Next, the phoneme similarity data is normalized in relation to a non-speech part of the input speech signal. The normalized phoneme similarity data is divided into segments, such that the sum of all normalized phoneme similarity values in a segment are equal for each segment. Next, a word model is constructed from the phoneme similarity data. To do so, within each segment, a summation value is determined by summing over speech frames each of the normalized phoneme similarity values associated with a particular phoneme. In this way, the word model is represented by a vector of summation values that compactly correlate to the normalized phoneme similarity data. Lastly, the results of the individually processed utterances for a given spoken word (i.e., the individual word models) are combined to produce a digital word prototype that electronica…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.