Patent · US Active

Search results ranking using editing distance and document information

US8812493B2 · kind B2 · utility

3Cited by
201References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 11, 2008
Grant dateAug 19, 2014
Priority date
Expiry dateJan 23, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.