Patent · US Active

Extracting structured data from weblogs

US10180986B2 · kind B2 · utility

1Cited by
162References
21Claims
0Family size

Assignee

Inventor

Key dates

Filing dateOct 12, 2015
Grant dateJan 15, 2019
Priority date
Expiry dateMar 10, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/205
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and apparatus for extracting structured data from weblogs are disclosed. In some examples, the methods and apparatus include retrieving a feed referenced on a webpage of the weblog and, in response to determining that the feed does not contain a first portion of a weblog post, creating, via a processor, a representation of the weblog post based on a second portion of the weblog post included in the feed, searching, via the processor, the weblog for the second portion of the weblog post, when the second portion of the weblog post is found in the weblog, identifying, via the processor, a node associated with the second portion in the webpage, and modifying, via the processor, the representation based on information from within the node to reconstruct the weblog post.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.