Patent · US Active

Detecting poisoning attacks on neural networks by activation clustering

US11188789B2 · kind B2 · utility

5Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 7, 2018
Grant dateNov 30, 2021
Priority date
Expiry dateOct 1, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

One embodiment provides a method comprising receiving a training set comprising a plurality of data points, where a neural network is trained as a classifier based on the training set. The method further comprises, for each data point of the training set, classifying the data point with one of a plurality of classification labels using the trained neural network, and recording neuronal activations of a portion of the trained neural network in response to the data point. The method further comprises, for each classification label that a portion of the training set has been classified with, clustering a portion of all recorded neuronal activations that are in response to the portion of the training set, and detecting one or more poisonous data points in the portion of the training set based on the clustering.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.