Systems and methods for flexible regularized distillation of natural language processing models to facilitate interpretation
US12288029B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 14, 2024 |
| Grant date | Apr 29, 2025 |
| Priority date | — |
| Expiry date | May 14, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems, apparatuses, methods, and computer program products are disclosed for distillation of a natural language processing model. An example method includes receiving, by communications circuitry, a set of text data comprising a set of observations and predicting, by processing circuitry and using the NLP model, classifications for each observation in the text data. The example method further includes generating, by model training engine, a balanced sampled data structure based on the predicted classifications for each observation in the text data and training, by the model training engine, a surrogate model using the balanced sampled data structure. The example method further includes identifying, by an interpreter and from the surrogate model, a set of most-influential tokens in the text data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.