Patent · US Active

Automated selection of large language models in cloud computing environments

US12236193B1 · kind B1 · utility

0Cited by
3References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 19, 2024
Grant dateFeb 25, 2025
Priority date
Expiry dateApr 19, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/225
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems or methods for the selection of large language models (LLMs). A system receives a request from a service that hosts an application. The request is configured to be processed by an LLM to generate a response. The system applies a classification model to the request to determine the class of the request. The classification model is a language model trained to receive text and classify the text into a plurality of classes. The system selects an LLM from a plurality of candidate LLMs based in part on the determined class of the request and recommends the selected LLM to the application.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.