Constructing content based on multi-sentence compression of source content
US10949452B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 26, 2017 |
| Grant date | Mar 16, 2021 |
| Priority date | — |
| Expiry date | Sep 13, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/284
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embodiments of the present invention provide systems, methods, and computer storage media directed to facilitating corpus-based content generation, in particular, using graph-based multi-sentence compression to generate a final content output. In one embodiment, pre-existing source content is identified and retrieved from a corpus. The source content is then parsed into sentence tokens, mapped and weighted. The sentence tokens are further parsed into word tokens and weighted. The mapped word tokens are then compressed into candidate sentences to be used in a final content. The final content is assembled using ranked candidate sentences, such that the final content is organized to reduce information redundancy and optimize content cohesion.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.