Adaptive multi-modal detection and fusion in videos via classification-based-learning
US8965115B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 9, 2013 |
| Grant date | Feb 24, 2015 |
| Priority date | — |
| Expiry date | Dec 9, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/806
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Described is a system for object detection using classification-based learning. A fusion method is selected, then a video sequence is processed to generate detections for each frame, wherein a detection is a representation of an object candidate. The detections are fused to generate a set of fused detections for each frame. The classification module generates a classification score labeling each fused detection based on a predetermined classification threshold. Otherwise, a token indicating that the classification module has abstained from generating a classification score is generated. The scoring module produces a confidence score for each fused detection based on a set of learned parameters from the learning module and the set of fused detections. The set of fused detections are filtered by the accept-reject module based on one of the classification score or the confidence score. Finally, a set of final detections representing an object is output.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.