Extraction and analysis of user-generated content
US8458584B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 18, 2010 |
| Grant date | Jun 4, 2013 |
| Priority date | — |
| Expiry date | Apr 30, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/951
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A page splitter may be configured to split a first page of a site into first boilerplate and first posts, and a second page of the site into second boilerplate and second posts. An aggregator may be configured to associate the first page with the second page, based on a similarity of the first boilerplate and the second boilerplate, and configured to associate at least one of the first posts and at least one of the second posts with a first post-type, and at least one of the second posts with a second post-type. A merger may be configured to merge the first boilerplate and the second boilerplate into a boilerplate template, posts of the first post-type from the first page and from the second page into a first post-type template, and posts of the second post-type from the second page into a second post-type template, and further configured to merge the boilerplate template, the first post-type template, and the second post-type template into a site template associated with the site.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.