Patent · US Active

Systems and methods for manipulating electronic content based on speech recognition

US9311395B2 · kind B2 · utility

3Cited by
19References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 9, 2011
Grant dateApr 12, 2016
Priority date
Expiry dateAug 6, 2033

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04N21/4668
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.