Patent · US Active

Security for generative models using attention analysis

US12292915B1 · kind B1 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 4, 2023
Grant dateMay 6, 2025
Priority date
Expiry dateDec 4, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/40
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Devices and techniques are generally described for security threat mitigation for generative machine learning models. In some examples, first prompt data including first data associated with a first natural language input and a first span may be determined. An LLM may determine first plan data using the first prompt data. The first plan data may include a call to the first API. A first classifier model may determine a first trust score for the first span. A first attention score may be determined for the first span and the first action plan. Second plan data may be generated based on at least one of the first trust score and the first attention score or the second trust score and the second attention score.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.