Automatic document classification via content analysis at storage time
US9239876B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Dec 3, 2012 |
| Grant date | Jan 19, 2016 |
| Priority date | — |
| Expiry date | Dec 14, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/182
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.