Monitoring information processing systems utilizing co-clustering of strings in different sets of data records
US11625438B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 23, 2020 |
| Grant date | Apr 11, 2023 |
| Priority date | — |
| Expiry date | Mar 31, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q30/0205
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An apparatus includes a processing device configured to obtain first and second sets of data records, each data record comprising a string associated with an attribute. The processing device is also configured to generate a similarity matrix, wherein entries of the similarity matrix comprise values characterizing similarity between respective pairs of the strings comprising a first string from a data record in the first set and a second string from a data record in the second set. The processing device is further configured to construct a graph network based on the similarity matrix comprising edges connecting pairs of the data records based on values of entries in the similarity matrix, perform a clustering operation on the graph network to identify clusters, and to initiate remedial action responsive to identifying a given cluster comprising at least one data record from each of the first and second sets of data records.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.