Using unsupervised machine learning for automatic entity resolution of natural language records
US11783130B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 6, 2019 |
| Grant date | Oct 10, 2023 |
| Priority date | — |
| Expiry date | May 6, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/022
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer process for entity resolution of natural language records including training a semantic embedding function on a corpus of unlabeled training materials. The semantic embedding function can take a word and represent it as a vector, where the vector represents the word as it relates to the semantic information of the corpus of unlabeled training materials. The process may transform a list of normalized descriptions using the semantic embedding function into a list of vector representations of the descriptions. The process may transform words from a natural language record to a vector representation of the natural language record using the semantic embedding function, and may use a named entity recognizer. The process may find a best match description from the list of normalized descriptions using the list of vector representations of the descriptions and the vector representation of the natural language record, and may include using word mover distance.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.