Textual data classification method and apparatus
US6507829B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 17, 2000 |
| Grant date | Jan 14, 2003 |
| Priority date | — |
| Expiry date | Jan 17, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/289
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus for classifying textual data is provided. The invention is adapted to automatically classify text. In particular, the invention utilizes a sparse vector framework to evaluate natural language text and to accurately and automatically assign that text to a predetermined classification. This can be done even where the disclosed system has not seen an example of the exact text before. The disclosed method and apparatus are particularly well-suited for coding adverse event reports, commonly referred to as “verbatims,” generated during clinical trials of pharmaceuticals, The invention also provides a method and apparatus that can be used to translate verbatims that have already been classified according to one coding scheme to be translated to another coding scheme in a highly automated process.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.