Patent · US Active

Machine learning of written Latin-alphabet based languages via super-character

US10192148B1 · kind B1 · utility

1Cited by
7References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 18, 2018
Grant dateJan 29, 2019
Priority date
Expiry dateSep 18, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A string of Latin-alphabet based language texts is received and formed a multi-layer 2-D symbol in a computing system. The received string contains at least one word with each word containing at least one letter of the Latin-alphabet based language. 2-D symbol comprises a matrix of N×N pixels of data representing a super-character. The matrix is divided into M×M sub-matrices. Each sub-matrix represents one ideogram formed from the at least one letter contained in a corresponding word in the received string. Ideogram has a square format with a dimension EL letters by EL letters (i.e., row and column). EL is determined from the total number of letters (LL) contained in the corresponding word. EL, LL, N and M are positive integers. Super-character represents a meaning formed from a specific combination of at least one ideogram. Meaning of the super-character is learned with image classification of the 2-D symbol.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.