Patent · US Active

Visual and audio multimodal searching system

US12346386B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 25, 2023
Grant dateJul 1, 2025
Priority date
Expiry dateJun 22, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A multimodal search system is described. The system can receive image data captured by a camera of a user device. Additionally, the system can receive audio data associated with the image data. The audio data can be captured by a microphone of the user device. Moreover, the system can process the image data to generate visual features. Furthermore, the system can process the audio data to generate a plurality of words. The system can generate a plurality of search terms based on the plurality of words and the visual features. Subsequently, the system can determine one or more search results associated with the plurality of search terms and provide the one or more search results as an output.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.