Method and system for speech emotion recognition
US11133025B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 7, 2019 |
| Grant date | Sep 28, 2021 |
| Priority date | — |
| Expiry date | Apr 1, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/221
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for speech emotion recognition for enriching speech to text communications between users in speech chat sessions including: implementing a speech emotion recognition model to enable converting observed emotions in speech samples to enrich text with visual emotion content by: generating a data set of speech samples with labels of a plurality of emotion classes; extracting a set of acoustic features from each of the emotion classes; generating a machine learning (ML) model based on the acoustic features and data set; training the ML model from acoustic features from speech samples during speech chat sessions; predicting emotion content based on a trained ML model in the observed speech; generating enriched text based on predicted emotion content of the trained ML model; and presenting the enriched text in speech to text communications between users in the chat session for visual notice of an observed emotion in the speech sample.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.