Patent · US Expired

Vocabulary and/or language model training

US6430551B1 · kind B1 · utility

214Cited by
11References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 6, 1998
Grant dateAug 6, 2002
Priority date
Expiry dateOct 6, 2018

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldControl
  • WIPO sectorInstruments

Abstract

A system includes means for creating a vocabulary and/or statistical language model from a textual training corpus. The vocabulary and/or language model are used in a pattern recognition system, such as a speech recognition system or a handwriting recognition system, for recognizing a time-sequential input pattern. The system includes means for determining at least one context identifier and means for deriving at least one search criterion, such as a keyword, from the context identifier. The system further includes means for selecting documents from a set of documents based on the search criterion. Advantageously, an Internet search engine is used for selecting the documents. Means are used for composing the training corpus from the selected documents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.