Patent · US Active

Extraction of attributes and values from natural language documents

US7970767B2 · kind B2 · utility

15Cited by
11References
9Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 30, 2007
Grant dateJun 28, 2011
Priority date
Expiry dateNov 24, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/169
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

One or more classification algorithms are applied to at least one natural language document in order to extract both attributes and values of a given product. Supervised classification algorithms, semi-supervised classification algorithms, unsupervised classification algorithms or combinations of such classification algorithms may be employed for this purpose. The at least one natural language document may be obtained via a public communication network. Two or more attributes (or two or more values) thus identified may be merged to form one or more attribute phrases or value phrases. Once attributes and values have been extracted in this manner, association or linking operations may be performed to establish attribute-value pairs that are descriptive of the product. In a presently preferred embodiment, an (unsupervised) algorithm is used to generate seed attributes and values which can then support a supervised or semi-supervised classification algorithm.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.