Patent · US Active

Training a probabilistic spelling checker from structured data

US9558179B1 · kind B1 · utility

2Cited by
21References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 5, 2013
Grant dateJan 31, 2017
Priority date
Expiry dateFeb 6, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A spelling system derives a language model for a particular domain of structured data, the language model enabling determinations of alternative spellings of queries or other strings of text from that domain. More specifically, the spelling system calculates (a) probabilities that the various query entity types—such as STREET, CITY, or STATE for queries in the geographical domain—are arranged in each of the various possible orders, and (b) probabilities that an arbitrary query references given particular ones of the entities, such as the street “El Camino Real.” Based on the calculated probabilities, the spelling system generates a language model that has associated scores (e.g., probabilities) for each of a set of probable entity name orderings, where the total number of entity name orderings is substantially less than the number of all possible orderings. The language model can be applied to determine probabilities of arbitrary queries, and thus to suggest alternative queries more likely to represent what a user intended.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.