Patent · US Active

Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

US8200487B2 · kind B2 · utility

39Cited by
13References
42Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 12, 2004
Grant dateJun 12, 2012
Priority date
Expiry dateSep 22, 2028

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/26
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labelling of successive parts of the document or the entire document. Furthermore the method comprises a learning functionality, logging and analyzing user introduced modifications for adaptation of user's preferences and for further training of the statistical models.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.