Patent · US Active

Automatic document classification via content analysis at storage time

US9239876B2 · kind B2 · utility

11Cited by
2References
22Claims
0Family size

Assignee

Inventor

Key dates

Filing dateDec 3, 2012
Grant dateJan 19, 2016
Priority date
Expiry dateDec 14, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/182
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.