Comparing similarity between documents for filtering unwanted documents
US8874663B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Aug 28, 2009 |
| Grant date | Oct 28, 2014 |
| Priority date | — |
| Expiry date | Oct 8, 2032 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L51/212
- WIPO fieldDigital communication
- WIPO sectorElectrical engineering
Abstract
A mechanism for efficiently determining similarity between documents. A set of reference data items is generated by processing a reference document. A similarity index representing similarity between a candidate document and the reference documents is obtained by counting segments of the candidate document matching the reference data items. The candidate document is a message transmitted in a communication system where the message is compared against one or more reference documents representing unwanted messages to filter and block unwanted messages from being transmittal or propagated.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.