Event matching by analysis of text characteristics (e-match)
US10108697B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 17, 2013 |
| Grant date | Oct 23, 2018 |
| Priority date | — |
| Expiry date | Feb 1, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/38
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for event matching by analysis of text characteristics are presented. A document collection comprising documents is acquired. One or more document subsets of the document collection each comprising one or more documents potentially describing identical events are identified based on certain structured metadata fields of the documents. Salient text features are extracted from the documents in the document collection. An event similarity score for pairs of documents in the document collection is generated by comparing the text features extracted from the documents. A common event document list comprising sets of documents in the document collection whose event similarity scores with each other are above a similarity threshold is generated.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.