Computerized cross-language document retrieval using latent semantic indexing
US5301109A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Jul 17, 1991 |
| Grant date | Apr 5, 1994 |
| Priority date | — |
| Expiry date | Jul 17, 2011 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/253
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A methodology for retrieving textual data objects in a multiplicity of languages is disclosed. The data objects are treated in the statistical domain by presuming that there is an underlying, latent semantic structure in the usage of words in each language under consideration. Estimates to this latent structure are utilized to represent and retrieve objects. A user query is recouched in the new statistical domain and then processed in the computer system to extract the underlying meaning to respond to the query.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.