Identifying high value segments in categorical data
US10929438B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 14, 2018 |
| Grant date | Feb 23, 2021 |
| Priority date | — |
| Expiry date | Nov 24, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F2218/08
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and techniques for identifying segments in categorical data include receiving multiple transaction ID (TID) lists with univariate values that satisfy a thresholding metric with each TID list representing an occurrence of a single attribute in a set of transactions. The TID lists are stored with the univariate values that satisfy the thresholding metric in a data structure. In a loop, candidate itemsets to form from combinations of TID lists are determined using only the combinations of TID lists that satisfy categorical constraints. In the loop, for the candidate itemsets that satisfy categorical constraints, both the thresholding metric and a similarity metric are applied to the candidate itemsets. Final itemsets are formed from only the candidate itemsets that satisfy both the thresholding metric and the similarity metric.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.