Patent · US Active

Full-text fuzzy search method for similar-form Chinese characters in ciphertext domain

US11537626B2 · kind B2 · utility

1Cited by
1References
5Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 9, 2018
Grant dateDec 27, 2022
Priority date
Expiry dateMar 3, 2039

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L9/0894
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The invention discloses a full-text fuzzy search method for similar-form Chinese characters in a ciphertext domain. The method realises a fuzzy search in the Chinese ciphertext domain based on a symmetric searchable encryption scheme and an inverted index structure, supports a fuzzy search on Chinese characters having similar glyphs in ciphertext status, ensures that searching results are ordered, and supports a multi-keyword logical connection fuzzy search. The present invention uses a distributed search engine Lucene and a Chinese word segmentator IKAnalyzer to perform full-text word segmentation on a document and constructs a plaintext inverted index comprising similar-form Chinese characters by means of the established similar-form character library of 3,755 commonly used Chinese characters. Considering the security of the inverted index structure, each keyword in the plaintext inverted index and its corresponding document number are constructed in an encrypted chain form, and a B+ tree structure is used to speed up the search. The invention realizes a fuzzy search in a Chinese full-text ciphertext domain in a semi-trusted cloud server without false detection and missed detecti…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.