Semiotic class normalization
US9852123B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 26, 2016 |
| Grant date | Dec 26, 2017 |
| Priority date | — |
| Expiry date | May 26, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A language processing system for text normalization of an input string of a semiotic class. In an aspect, a method includes receiving an input string; accessing, for a semiotic class of non-standard words, a language universal covering grammar for a plurality of languages that generates, for each language of the plurality of languages, one or more sequences of word-level components for each instance of the semiotic class in the language; for each of the plurality of languages, accessing a lexical map specific to the language and that maps each sequence of word-level components for each instance of the semiotic class in the language verbalizations in the language; generating, from the language universal grammar and the lexical maps, a lattice of possible verbalizations of the input string; and selecting one of the possible verbalizations as a selected verbalization for the input string.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.