Training data generation to facilitate fine-tuning embedding models
US12423530B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 9, 2023 |
| Grant date | Sep 23, 2025 |
| Priority date | — |
| Expiry date | Dec 25, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/263
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are provided for generating training data to facilitate fine-tuning embedding models. Training data including anchor utterances is obtained. Positive utterances and negative utterances are generated from the anchor utterances. Tuples including the anchor utterances, the positive utterances, and the negative utterances are formed. Embeddings for the tuples are generated and a pre-trained embedding model is fine-tuned based on the embeddings. The fine-tuned model can be deployed to a system.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.