Uniform resource locator canonicalization
US9081861B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 21, 2008 |
| Grant date | Jul 14, 2015 |
| Priority date | — |
| Expiry date | Jan 3, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/951
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method includes receiving a plurality of uniform resource locators (URLs), where the URLs identify content files, and where the URLs include at least one parameter. Fingerprints of the content files are identified. A first entropy of values of the fingerprints conditional on values of a first parameter is determined, and a second entropy of values of the first parameter conditional on values of the fingerprints is determined. It is determined that the first parameter is irrelevant to the identification of a unique content file by the URLs based, at least in part, on the first and second entropy values.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.