JOURNAL ARTICLE

Multi-digit image synthesis using recurrent conditional variational autoencoder

Abstract

In the field of deep neural networks, several generative methods have been proposed to address the challenges from generative and discriminative tasks, e.g., natural language process, image caption and image generation. In this paper, a conditional recurrent variational autoencoder is proposed for multi-digit image synthesis. This model is capable of generating multi-digit images from the given number sequences and retaining the generalisation ability to recover different types of background. Our method is evaluated on SVHN dataset and the experimental results show it succeeds to generate multi-digit images with various styles according to the given sequential inputs. The generated images can also be easily identified by both human beings and convolutional neural networks for digit classification.

Keywords:
Autoencoder Numerical digit Discriminative model Computer science Artificial intelligence Image (mathematics) Digit recognition Generative grammar Pattern recognition (psychology) Convolutional neural network Conditional random field Artificial neural network Process (computing) Deep learning Generative model Field (mathematics) Mathematics Arithmetic

Metrics

5
Cited By
0.33
FWCI (Field Weighted Citation Impact)
39
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.