Patent · US Active

Constructing content based on multi-sentence compression of source content

US10949452B2 · kind B2 · utility

2Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 26, 2017
Grant dateMar 16, 2021
Priority date
Expiry dateSep 13, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/284
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of the present invention provide systems, methods, and computer storage media directed to facilitating corpus-based content generation, in particular, using graph-based multi-sentence compression to generate a final content output. In one embodiment, pre-existing source content is identified and retrieved from a corpus. The source content is then parsed into sentence tokens, mapped and weighted. The sentence tokens are further parsed into word tokens and weighted. The mapped word tokens are then compressed into candidate sentences to be used in a final content. The final content is assembled using ranked candidate sentences, such that the final content is organized to reduce information redundancy and optimize content cohesion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.