Patent · US Active

Multimodal fine-grained mixing method and system, device, and storage medium

US11436451B2 · kind B2 · utility

0Cited by
0References
12Claims
0Family size

Assignees

Inventors

Key dates

Filing dateJan 17, 2022
Grant dateSep 6, 2022
Priority date
Expiry dateJan 17, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V2201/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure provides a multimodal fine-grained mixing method and system, a device, and a storage medium. The method includes: extracting data features from multimodal graphic and textual data, and obtaining each composition of the data features, the data features including a visual regional feature and a text word feature; performing fine-grained classification on modal information of each composition of the data features, to obtain classification results; and performing inter-modal and intra-modal information fusion on each composition according to the classification results, to obtain a fusion feature. The method enables a multimodal model to utilize a complementary characteristic of the multimodal data, with no influence by irrelevant information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.