Patent · US Active

Long-query retrieval

US8326820B2 · kind B2 · utility

3Cited by
10References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 30, 2009
Grant dateDec 4, 2012
Priority date
Expiry dateJul 23, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24534
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., a document or document-like representation. The techniques use dimension reduction to obtain three parts, topic-related words (major semantics), document specific words (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.