Discovering topical structures of databases
US7818323B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 21, 2008 |
| Grant date | Oct 19, 2010 |
| Priority date | — |
| Expiry date | Apr 15, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/26
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for automatically discovering topical structures of databases includes a model builder adapted to compute various kinds of representations for the database based on schema information and data values of the database. A plurality of base clusterers is also provided, one for each representation. Each base clusterer is adapted to perform, for the representation, preliminary topical clustering of tables within the database to produce a plurality of clusters, such that each of the clusters corresponds to a set of tables on the same topic. A meta-clusterer aggregates results of the clusterers into a final clustering, such that the final clustering comprises a plurality of the clusters. A representative finder identifies representative tables from the clusters in the final clustering. The representative finder identifies at least one representative table for each of the clusters in the final clustering. The representative finder also arranges the representative tables by topic as a topical directory and outputs the topical directory.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.