Patent · US Active

Document content analysis based on topic modeling

US10558657B1 · kind B1 · utility

8Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 19, 2016
Grant dateFeb 11, 2020
Priority date
Expiry dateMay 31, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/93
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A mechanism for progressive topic modeling is disclosed to facilitate document content analysis. Input documents can be sorted and divided into multiple groups. Topic modeling is performed for each group, where the topic modeling for one group is based on the generated topic model from a previous group, if available. The vocabulary used in the topic modeling process can also be updated for each group of documents. The generated topics can be presented in a user interface to facilitate a user in analyzing the documents. The topic modeling mechanism can also be utilized to enhance a document search experience by generating topics from documents contained in search results and presenting topic words to a user as suggested search terms.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.