Patent · US Active

Systems, methods, and media for outputting a dataset based upon anomaly detection

US8381299B2 · kind B2 · utility

284Cited by
5References
93Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 28, 2007
Grant dateFeb 19, 2013
Priority date
Expiry dateJan 22, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2221/034
  • WIPO fieldDigital communication
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, and media for outputting a dataset based upon anomaly detection are provided. In some embodiments, methods for outputting a dataset based upon anomaly detection: receive a training dataset having a plurality of n-grams, which plurality includes a first plurality of distinct training n-grams each being a first size; compute a first plurality of appearance frequencies, each for a corresponding one of the first plurality of distinct training n-grams; receive an input dataset including first input n-grams each being the first size; define a first window in the input dataset; identify as being first matching n-grams, the first input n-grams in the first window that correspond to the first plurality of distinct training n-grams; compute a first anomaly detection score for the input dataset using the first matching n-grams and the first plurality of appearance frequencies; and output the input dataset based on the first anomaly detection score.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.