Image captioning with weak supervision
US9811765B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 13, 2016 |
| Grant date | Nov 7, 2017 |
| Priority date | — |
| Expiry date | Jan 13, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V20/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for image captioning with weak supervision are described herein. In implementations, weak supervision data regarding a target image is obtained and utilized to provide detail information that supplements global image concepts derived for image captioning. Weak supervision data refers to noisy data that is not closely curated and may include errors. Given a target image, weak supervision data for visually similar images may be collected from sources of weakly annotated images, such as online social networks. Generally, images posted online include “weak” annotations in the form of tags, titles, labels, and short descriptions added by users. Weak supervision data for the target image is generated by extracting keywords for visually similar images discovered in the different sources. The keywords included in the weak supervision data are then employed to modulate weights applied for probabilistic classifications during image captioning analysis.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.