Unsupervised learning tool for feature correction
US7483903B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 17, 2005 |
| Grant date | Jan 27, 2009 |
| Priority date | — |
| Expiry date | Jun 7, 2026 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/95
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for correcting miscategorized features excerpted from web pages are provided. For each of several categories and several pages on a particular web site, a separate feature may be excerpted from that page and associated with that page in relation to that category. Often, many of the “high confidence” features that have been associated with the same category are found to be associated with similar characteristics regardless of the pages from which those features were excerpted. Thus, a set of category characteristics, which are often found associated with the “high confidence” features in a particular category, may be determined. For each page, a candidate feature that is associated with the set of category characteristics may be identified in that page. If, in relation to the particular category, a feature other than the candidate feature is associated with that page, then that other feature may be replaced by the candidate feature.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.