Handwritten Text Generation with Character-specific Encoding for Style Imitation

Aug 1, 2023·
Jan Zdenek
Jan Zdenek
,
Hideki Nakayama
· 0 min read
Abstract
We propose a novel method for handwritten text generation that uses a style encoder based on a vision transformer network. The style encoder encodes handwriting style from reference images and allows the generator to imitate it. The encoder learns to disentangle style information from the content by learning to recognize who wrote the text. The self-attention mechanism in the encoder allows us to produce character-specific encodings by using characters in the target sequence as queries. Our method can generate handwritten text images in random styles by sampling random latent vectors. In comparison with existing methods, our method outperforms them for handwritten text generation in terms of the quality of generated images and their fidelity with respect to the distribution of real images. Our method also achieves significantly better performance at imitating handwriting styles defined by reference images. Furthermore, the model generalizes well to unseen data and can generate handwritten images of words and character sequences as well as imitate handwriting styles not included in the training data.
Type
Publication
17th International Conference on Document Analysis and Recognition (ICDAR 2023)
publications
Jan Zdenek
Authors
Research Scientist
Jan is a research scientist at CyberAgent, where he works on artificial intelligence and computer vision with a focus on image generation and editing. He received his PhD in Information Science and Technology from the University of Tokyo, where his research centered on image generation. Prior to that, he received his Master’s degree in Creative Informatics from the University of Tokyo, and his Bachelor’s degree in Computer and Information Science from the Czech Technical University in Prague. Born and raised in the Czech Republic, he currently works in Japan.