Patent · US Active

Systems and methods for manipulating electronic content based on speech recognition

US9311395B2 · kind B2 · utility

3Cited by

19References

20Claims

0Family size

Assignee

AOl · US

Inventors

Peter F. Kocks · San Francisco, US
Guoning Hu · Fremont, US
Ping Wu · Saratoga, US

Key dates

Filing date	Jun 9, 2011
Grant date	Apr 12, 2016
Priority date	—
Expiry date	Aug 6, 2033

Classification

Technology area (CPC H)Electricity
CPC primaryH04N21/4668
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.