Automatic document classification using text and images
US7039856B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 30, 1998 |
| Grant date | May 2, 2006 |
| Priority date | — |
| Expiry date | May 14, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/353
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus for automatic document classification using text and images. The present invention provides a method and apparatus for automatic document classification based on text and image. A new document is analyzed based on textual content as well as visual appearance. The new document is automatically stored in one or more mirror directories in which the new document would most likely be stored by the user of the device if the new document were placed manually. Determination of the most likely directories is based on an analysis of multiple documents stored by the user in various directories. The mirror directories are components of a mirror directory structure, which is a copy of a pre-existing directory structure, such as the user's hard drive. By storing the new document automatically, the user is relieved of the duty of manually selecting a directory for the new document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.