Distributed approximate nearest neighbor service architecture for retrieving items in an embedding space
US12314286B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 28, 2022 |
| Grant date | May 27, 2025 |
| Priority date | — |
| Expiry date | Apr 11, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/285
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An online system maintains item embeddings for items. As a number of items maintained by the online system increases, maintaining a single index of the item embeddings is increasingly difficult. To increase scalability, the online system partitions item embeddings into multiple indices, with each index corresponding to a value of a specific attribute maintained by the online system for items. For example, an online system generates indices that each correspond to a different warehouse offering items. To expedite retrieval of item embeddings, the online system allocates each index to one of a number of shards. When the online system receives a query, the online system determines an embedding for the query and retrieves an index from a shard based on metadata received with the query. Based on distances between the query for the embedding and the item embeddings in the retrieved index, the online system selects one or more items.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.