Patent · US Active

Method and apparatus for automatically discovering features in free form heterogeneous data

US8108413B2 · kind B2 · utility

11Cited by
9References
35Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 15, 2007
Grant dateJan 31, 2012
Priority date
Expiry dateOct 23, 2027

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/35
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are provided for automatically discovering one or more features in free form heterogeneous data. In one aspect of the invention, the techniques include obtaining free form heterogeneous data, wherein the data comprises one or more data items, applying a label to each data item, using the labeled data to build a language model, wherein a word distribution associated with each label can be derived from the model, and using the word distribution associated with each label to discover one or more features in the data, wherein discovering one or more features in the data facilitates one or more operations that use at least a portion of the labeled data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.