Method and apparatus for automatically discovering features in free form heterogeneous data
US8108413B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 15, 2007 |
| Grant date | Jan 31, 2012 |
| Priority date | — |
| Expiry date | Oct 23, 2027 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/35
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are provided for automatically discovering one or more features in free form heterogeneous data. In one aspect of the invention, the techniques include obtaining free form heterogeneous data, wherein the data comprises one or more data items, applying a label to each data item, using the labeled data to build a language model, wherein a word distribution associated with each label can be derived from the model, and using the word distribution associated with each label to discover one or more features in the data, wherein discovering one or more features in the data facilitates one or more operations that use at least a portion of the labeled data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.