Input data structure for data mining
US8250105B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 6, 2007 |
| Grant date | Aug 21, 2012 |
| Priority date | — |
| Expiry date | May 21, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F2216/03
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and apparatus, including computer program products, implementing and using techniques for compressing data included in several transactions. Each transaction has at least one item. A unique identifier is assigned to each different item and, if taxonomy is defined, to each different taxonomy parent. Sets of transactions are formed from the several transactions. The sets of transactions are stored using a computer data structure including: a list of identifiers of different items in the set of transactions, information indicating number of identifiers in the list, and bit field information indicating presence of the different items in the set of transactions, said bit field information being organized in accordance with the list for facilitating evaluation of patterns with respect to the set of transactions. A data structure for compressing data included in a set of transactions is also provided.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.