System and method for web mining and clustering
US8521773B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 25, 2010 |
| Grant date | Aug 27, 2013 |
| Priority date | — |
| Expiry date | Dec 31, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F18/2323
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system for web mining and clustering is described. The method includes receiving and dividing input data into a plurality of primitive datasets. Additionally, one or more combinations of the plurality of primitive datasets may be created. Further, a model for each primitive dataset in the plurality of primitive datasets and each of the one or more combinations of the plurality of primitive datasets may be generated. Subsequently, a cost associated with a model corresponding to each primitive dataset in the plurality of primitive datasets, and each of the one or more combinations of the plurality of primitive datasets may be computed. Further, a sum of the costs associated with the models corresponding to each primitive dataset in the plurality of primitive datasets may be compared with the cost associated with each model corresponding to each of the one or more combinations of the plurality of primitive datasets. Finally, the plurality of primitive datasets may be partitioned into one or more clusters based on the comparison of the costs such that each primitive dataset is a part of a cluster in the one or more clusters or a stand-alone primitive dataset.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.