Patent · US Active

Retrieval-augmented language model pre-training and fine-tuning

US11003865B1 · kind B1 · utility

2Cited by

1References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Kenton Chiu Tsun Lee · Mountain View, US
Kelvin Gu · Mountain View, US
Zora Tung · Mountain View, US
Panupong Pasupat · Mountain View, US
Ming Chang · Beijing, CN

Key dates

Filing date	May 20, 2020
Grant date	May 11, 2021
Priority date	—
Expiry date	May 20, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG06N5/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods for pre-training and fine-tuning of neural-network-based language models are disclosed in which a neural-network-based textual knowledge retriever is trained along with the language model. In some examples, the knowledge retriever obtains documents from an unlabeled pre-training corpus, generates its own training tasks, and learns to retrieve documents relevant to those tasks. In some examples, the knowledge retriever is further refined using supervised open-QA questions. The framework of the present technology provides models that can intelligently retrieve helpful information from a large unlabeled corpus, rather than requiring all potentially relevant information to be stored implicitly in the parameters of the neural network. This framework may thus reduce the storage space and complexity of the neural network, and also enable the model to more effectively handle new tasks that may be different than those on which it was pre-trained.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.