Patent · US Active

Post-training detection and identification of human-imperceptible backdoor-poisoning attacks

US11514297B2 · kind B2 · utility

1Cited by
0References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 27, 2020
Grant dateNov 29, 2022
Priority date
Expiry dateJun 17, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2221/033
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

This patent concerns novel technology for detecting backdoors of neural network, particularly deep neural network (DNN), classifiers. The backdoors are planted by suitably poisoning the training dataset, i.e., a data-poisoning attack. Once added to input samples from a source class (or source classes), the backdoor pattern causes the decision of the neural network to change to a target class. The backdoors under consideration are small in norm so as to be imperceptible to a human, but this does not limit their location, support or manner of incorporation. There may not be components (edges, nodes) of the DNN which are dedicated to achieving the backdoor function. Moreover, the training dataset used to learn the classifier may not be available. In one embodiment of the present invention which addresses such challenges, if the classifier is poisoned then the backdoor pattern is determined through a feasible optimization process, followed by an inference process, so that both the backdoor pattern itself and the associated source class(es) and target class are determined based only on the classifier parameters and a set of clean (unpoisoned attacked) samples from the different classes …

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.