Patent · US Active

Emoji sanitization for natural language model processing

US12387056B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 7, 2022
Grant dateAug 12, 2025
Priority date
Expiry dateOct 31, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In some implementations, a device may obtain a natural language input including an emoji. The device may identify one or more appearance modifiers associated with the emoji. The device may generate a token associated with the emoji that removes the one or more appearance modifiers, wherein the token is associated with multiple emojis including the emoji, and wherein the token is a modified code associated with the emoji or is associated with a cluster that is associated with the multiple emojis. The device may provide, to a natural language processing (NLP) model, the token associated with the emoji. The device may obtain, from the NLP model, an output that indicates an interpretation of the natural language input based on providing the token to the NLP model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.