Audio analysis learning with video data
US10573313B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 11, 2019 |
| Grant date | Feb 25, 2020 |
| Priority date | — |
| Expiry date | Feb 11, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/223
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Audio analysis learning is performed using video data. Video data is obtained, on a first computing device, wherein the video data includes images of one or more people. Audio data is obtained, on a second computing device, which corresponds to the video data. A face within the video data is identified. A first voice, from the audio data, is associated with the face within the video data. The face within the video data is analyzed for cognitive content. Audio features corresponding to the cognitive content of the video data are extracted. The audio data is segmented to correspond to an analyzed cognitive state. An audio classifier is learned, on a third computing device, based on the analyzing of the face within the video data. Further audio data is analyzed using the audio classifier.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.