Processing apparatus for determining which person in a group is speaking
US7117157B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 22, 2000 |
| Grant date | Oct 3, 2006 |
| Priority date | — |
| Expiry date | Mar 22, 2020 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04N7/18
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Image data from cameras showing movement of a number of people, and sound data, is archived and processed to determine the position and orientation of each person's head and to determine at whom each person is looking. The speaker is determined by determining at which person most people are looking. Alternatively, the sound data is processed to determine the direction from which the sound came, and it is determined who is speaking by determining which person's head is in a position corresponding to the direction from which the sound came. The personal speech recognition parameters for the speaker are selected and used to convert the sound data to text data. Image data to be archived is chosen by selecting the camera which best shows the speaker and the participant to whom he is speaking. Data is stored in a meeting archive database.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.