Patent · US Active

Extraction and analysis of user-generated content

US8458584B1 · kind B1 · utility

8Cited by
21References
25Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 18, 2010
Grant dateJun 4, 2013
Priority date
Expiry dateApr 30, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A page splitter may be configured to split a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts. An aggregator may be configured to associate the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type. A merger may be configured to merge the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.