Automated selection of large language models in cloud computing environments
US12236193B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 19, 2024 |
| Grant date | Feb 25, 2025 |
| Priority date | — |
| Expiry date | Apr 19, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/225
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems or methods for the selection of large language models (LLMs). A system receives a request from a service that hosts an application. The request is configured to be processed by an LLM to generate a response. The system applies a classification model to the request to determine the class of the request. The classification model is a language model trained to receive text and classify the text into a plurality of classes. The system selects an LLM from a plurality of candidate LLMs based in part on the determined class of the request and recommends the selected LLM to the application.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.