Patent · US Active

Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

US8332221B2 · kind B2 · utility

11Cited by
14References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 15, 2011
Grant dateDec 11, 2012
Priority date
Expiry dateAug 15, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/26
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.