Patent · US Active

Pretraining a language machine-learning model

US12236206B1 · kind B1 · utility

0Cited by

8References

20Claims

0Family size

Assignee

Meta Platforms, Inc. · US

Inventors

Michael W. Lewis · Tucson, US
Marjan Ghazvini Nejad · Seattle, US
Gargi Ghosh · Bellevue, US
Armen Aghajanyan · Bellevue, US
Sida Wang · Hercules, US
Luke Zettlemoyer · Seattle, US

Key dates

Filing date	May 26, 2021
Grant date	Feb 25, 2025
Priority date	—
Expiry date	Aug 28, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/084
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

In one embodiment, a method includes accessing a first document, accessing a plurality of second documents, calculating a relevance score for each of the plurality of second documents indicating a degree of relevance of the second document to the first document using an encoder of a machine-learning model, selecting a subset of the second documents based on their corresponding relevance scores, generating a target document by using the machine-learning model to process the subset of second documents and their corresponding relevance scores, and updating parameters of the machine-learning model based on a comparison between the first document and the generated target document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.