Patent · US Active

Semantic duplicate normalization and standardization

US12050873B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateOct 28, 2021
Grant dateJul 30, 2024
Priority date
Expiry dateJan 27, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and computer-readable media are disclosed for list attribute normalization and standardization for creation of a controlled vocabulary. A vocabulary set comprising a plurality of vocabulary term may be received. For each vocabulary term, semantic duplicates may be identified. The semantic duplicates may be identified by analyzing semantics, syntactics, or phonetics of the vocabulary terms. Semantic chains may be formed from each vocabulary term and the corresponding semantic duplicates. The terms in each semantic chain may be ranked to determine a most probable vocabulary term. The most probable vocabulary term may then replace the semantic chain. The most probable vocabulary term across all semantic chains from the vocabulary set may form the controlled vocabulary.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.