Efficient string search
US8086441B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 27, 2007 |
| Grant date | Dec 27, 2011 |
| Priority date | — |
| Expiry date | Oct 17, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/40
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Some embodiments of an efficient string search have been presented. In one embodiment, a string of bytes representing content written in a non-delimited language is received, wherein the content has been classified into a predetermined category. In a single pass through the string of bytes, a set of N-grams is searched for simultaneously. Statistical information on occurrences of the N-grams, if any, in the string of bytes is collected. In some embodiments, a model is generated based on the statistical information, where the model is usable by a content filter to classify content.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.