Problems of Chinese text lines in real examination paper text.soft erasure(a,b),nonuniform word size(b),hard erasure(c,d),noised background(b,f),diverse text length(a,c,d are short text,b,e,f are long text),dense long texts(e,f)

Problems of Chinese text lines in real examination paper text.soft erasure(a,b),nonuniform word size(b),hard erasure(c,d),noised background(b,f),diverse text length(a,c,d are short text,b,e,f are long text),dense long texts(e,f)

Source publication
Preprint
Full-text available
It happens in the examination paper that text lines include inconsistent nonuniform word size, character erasure, diverse text length and dense long texts. This paper proposes an improved method for ViT to enhance its capability in recognizing text lines in handwritten Chinese examination papers. First, this method employs a segmentation method sui...

Context in source publication

Context 1
... writing styles, challenges associated with character segmentation, large character sets, and complex semantics. In examination paper text, handwritten text recognition faces more complex situations, such as nonuniform word size, character erasure, noisy background, diverse text length, and dense long texts. These problems are visually depicted in Fig. ...

Similar publications

Preprint
Full-text available
Adapting pre-trained models to open classes is a challenging problem in machine learning. Vision-language models fully explore the knowledge of text modality, demonstrating strong zero-shot recognition performance, which is naturally suited for various open-set problems. More recently, some research focuses on fine-tuning such models to downstream...