Patent · US Active

Method for systematic mass normalization of titles

US9342592B2 · kind B2 · utility

10Cited by
6References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 29, 2013
Grant dateMay 17, 2016
Priority date
Expiry dateFeb 24, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06Q50/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for normalizing raw titles to canonical titles is described. The method includes designating a set of canonical titles, generating a set of n-grams for each canonical title, assigning a set of attributes to each n-gram, assigning a set of labels to each of the attributes, and storing the labeled canonical title and labeled n-grams in a database. In some examples, a new title may be mapped to an existing canonical title in the database by generating a set of n-grams for the new title, looking up the n-grams in the database of canonical titles, retrieving the set of labels assigned to n-grams in the database that match n-grams from the new title, and assigning those labels to the corresponding attributes of the new title. The new title may then be mapped to a canonical title on the basis of similarly labeled attributes.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.