Patent · US Active

Method and apparatus for data structuring of text

US12033413B2 · kind B2 · utility

0Cited by
1References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 14, 2021
Grant dateJul 9, 2024
Priority date
Expiry dateJul 12, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/416
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided are method and apparatus for data structuring of text. The apparatus for data structuring of text includes a data extraction unit configured to extract text and location information of the text from an image based on an optical character recognition (OCR) technique, a data processing unit configured to generate a text unit based on the text and the location information, a form classification unit configured to classify a form of the image based on the text, a labeling unit configured to label the text unit as first text, second text, and third text respectively corresponding to an item name, an item value, or others based on the classified form, a relationship identification unit configured to map and structure the second text corresponding to the first text, and a misrecognition correction unit configured to determine misrecognition of the first text and correct the first text determined to be misrecognized.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.