Vocabulary and/or language model training
US6430551B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 6, 1998 |
| Grant date | Aug 6, 2002 |
| Priority date | — |
| Expiry date | Oct 6, 2018 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99936
- WIPO fieldControl
- WIPO sectorInstruments
Abstract
A system includes means for creating a vocabulary and/or statistical language model from a textual training corpus. The vocabulary and/or language model are used in a pattern recognition system, such as a speech recognition system or a handwriting recognition system, for recognizing a time-sequential input pattern. The system includes means for determining at least one context identifier and means for deriving at least one search criterion, such as a keyword, from the context identifier. The system further includes means for selecting documents from a set of documents based on the search criterion. Advantageously, an Internet search engine is used for selecting the documents. Means are used for composing the training corpus from the selected documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.