Audio analysis learning using video data
US10204625B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 4, 2018 |
| Grant date | Feb 12, 2019 |
| Priority date | — |
| Expiry date | Jan 4, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/223
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Audio analysis learning is performed using video data. Video data is obtained, on a first computing device, wherein the video data includes images of one or more people. Audio data is obtained, on a second computing device, which corresponds to the video data. A face is identified within the video data. A first voice, from the audio data, is associated with the face within the video data. The face within the video data is analyzed for cognitive content. Audio features are extracted corresponding to the cognitive content of the video data. The audio data is segmented to correspond to an analyzed cognitive state. An audio classifier is learned, on a third computing device, based on the analyzing of the face within the video data. Further audio data is analyzed using the audio classifier.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.