System for determining document portions that correspond to queries
US11947915B1 · kind B1 · utility
Inventors
Key dates
| Filing date | Apr 23, 2021 |
| Grant date | Apr 2, 2024 |
| Priority date | — |
| Expiry date | Feb 4, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/279
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A document is divided into sections based on a characteristic of the text in the document. Characteristics may include specific characters such as paragraph breaks or selected punctuation, the topics or categories of the text, or a quantity of text in each section. Each section of the document may be represented by an embedding vector in a semantic embedding space. Values are determined based on the text in each section and the semantic characteristics of each section, such as word order, capitalization, punctuation, and word meaning. When a query is received, a vector value representing the query is determined based on the text and semantic characteristics of the query. Based on the similarity between the values determined for the query and those determined for the sections of a document, the specific section of a potentially large document that most closely matches the query is determined and included in a response.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.