Patent · US Active

Method and apparatus for organizing data sources

US7529740B2 · kind B2 · utility

8Cited by
3References
1Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 14, 2006
Grant dateMay 5, 2009
Priority date
Expiry dateJan 28, 2027

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99953
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for organizing deep Web services is provided. In one aspect, the method obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.