Patent · US Active

Methods and systems for generating a reference data structure for anonymization of text data

US11301639B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

Huawei Technologies Co., Ltd. · CN

Inventors

Roozbeh Jalali · Toronto, CA
Haolin Guo · Markham, CA
Wen Qing Chen · Markham, CA
Michael Chih Hung LI · Markham, CA
Zanqing ZHANG · Markham, CA

Key dates

Filing date	Jun 26, 2020
Grant date	Apr 12, 2022
Priority date	—
Expiry date	Jun 26, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method and a system of using machine learning to automatically generate a reference data structure for a K-anonymity model. A vector space is generated from reference text data, where the vector space is defined by numerical vectors representative of semantic meanings of the reference text words. Input text words are converted into numerical vectors using the vector space. Word clusters are formed according to semantic similarity between the input text words, where the semantic similarity between pairs of input text words is represented by metric values determined from pairs of numerical vectors. The word clusters define nodes of the reference data structure. A text label is applied to each node of the reference data structure, where the text label is representative of the semantic meaning shared by elements of the word cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.