Patent · US Active

Data mining platform for knowledge discovery from heterogeneous data types and/or heterogeneous data sources

US7921068B2 · kind B2 · utility

68Cited by
17References
28Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 30, 2007
Grant dateApr 5, 2011
Priority date
Expiry dateAug 9, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG16B50/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The data mining platform comprises a plurality of system modules, each formed from a plurality of components. Each module has an input data component, a data analysis engine for processing the input data, an output data component for outputting the results of the data analysis, and a web server to access and monitor the other modules within the unit and to provide communication to other units. Each module processes a different type of data, for example, a first module processes microarray (gene expression) data while a second module processes biomedical literature on the Internet for information supporting relationships between genes and diseases and gene functionality. In the preferred embodiment, the data analysis engine is a kernel-based learning machine, and in particular, one or more support vector machines (SVMs). The data analysis engine includes a pre-processing function for feature selection, for reducing the amount of data to be processed by selecting the optimum number of attributes, or “features”, relevant to the information to be discovered.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.