Patent · US Active

Using unsupervised machine learning for automatic entity resolution of natural language records

US11783130B2 · kind B2 · utility

2Cited by
3References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 6, 2019
Grant dateOct 10, 2023
Priority date
Expiry dateMay 6, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/022
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer process for entity resolution of natural language records including training a semantic embedding function on a corpus of unlabeled training materials. The semantic embedding function can take a word and represent it as a vector, where the vector represents the word as it relates to the semantic information of the corpus of unlabeled training materials. The process may transform a list of normalized descriptions using the semantic embedding function into a list of vector representations of the descriptions. The process may transform words from a natural language record to a vector representation of the natural language record using the semantic embedding function, and may use a named entity recognizer. The process may find a best match description from the list of normalized descriptions using the list of vector representations of the descriptions and the vector representation of the natural language record, and may include using word mover distance.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.