Method and apparatus for determining semantic similarity of character strings
US10089301B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 10, 2016 |
| Grant date | Oct 2, 2018 |
| Priority date | — |
| Expiry date | Nov 10, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/205
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and device for determining semantic similarity between two character strings are disclosed. The two character strings are segmented into sequences of words or phrases which represent the correlation between the characters. Edit distance from the first sequence to the second sequence is calculated based on a predetermined algorithm. A minimum semantic distance is then determined from the edit distance by considering the word/phrase pairs appearing in both sequences and the relationship between the cost of the various operations performed to convert the first sequence into the second sequence. The semantic similarity between the two character strings is then determined and normalized from the minimum semantic distance.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.