Artificial intelligence based image caption creation systems and methods thereof
US10713830B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 13, 2019 |
| Grant date | Jul 14, 2020 |
| Priority date | — |
| Expiry date | May 13, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V20/70
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An image and the maximum number of tokens for a to-be-created image caption are received in a computing system. Font size of graphical image of the token is calculated from the maximum number of tokens and the dimension of desired input image for prediction-style image classification technique. Desired input image is divided into first and second portions. A 2-D symbol is formed by placing a resized image derived from the received image with substantially similar contents in the first portion and by initializing the second portion with blank images. Next token of the image caption is predicted by classifying the 2-D symbol using the prediction-style image classification technique. 2-D symbol is modified by appending the graphical image of just-predicted token to the existing image caption in the second portion, if termination condition for image caption creation is false. Next token is repeatedly predicted until termination condition becomes true.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.