Natural language processing via a two-dimensional symbol having multiple ideograms contained therein
US10102453B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 1, 2017 |
| Grant date | Oct 16, 2018 |
| Priority date | — |
| Expiry date | Sep 1, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/293
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A string of natural language texts is received and formed a multi-layer 2-D symbol in a first computing system. The 2-D symbol comprises a matrix of N×N pixels of data representing a “super-character”. The matrix is divided into M×M sub-matrices with each sub-matrix containing (N/M)×(N/M) pixels. N and M are positive integers, and N is preferably a multiple of M. Each sub-matrix represents one ideogram defined in an ideogram collection set. “Super-character” represents a meaning formed from a specific combination of a plurality of ideograms. The meaning of the “super-character” is learned in a second computing system by using an image processing technique to classify the 2-D symbol, which is formed in the first computing system and transmitted to the second computing system. Image process technique includes predefining a set of categories and determining a probability for associating each of the predefined categories with the meaning of the “super-character”.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.