Method and system for converting document sets to term-association vector spaces on demand
US9201864B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 15, 2013 |
| Grant date | Dec 1, 2015 |
| Priority date | — |
| Expiry date | Jan 14, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein is a method and system for producing a term association vector space on demand for a client given a document set in electronic form. The method extracts terms from the document set, stripping out words that do not convey meaning and adding important phrases within the context of the document set to the terms. Associations between terms are calculated, subjected to further analytical processes, and collected in a matrix, whose rows are vectors defining the vector space. Additional associational data can be added by matrix arithmetic, and documents can be rendered as further vectors in the space.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.