A Deep Learning Approach to Automatic Caption Generation for News Images

Vishwash Batra, Yulan He and George Vogiatzis

Aston Publications Explorer (Aston University) · 2019

Abstract

Automatic caption generation of images has gained significant interest. It gives rise to a lot of interesting image-related applications. For example, it could help in image/video retrieval and management of vast amount of multimedia data available on the Internet. It could also help in development of tools that can aid visually impaired individuals in accessing multimedia content. In this paper, we particularly focus on news images and propose a methodology for automatically generating captions for news paper articles consisting of a text paragraph and an image. We propose several deep neural network architectures built upon Recurrent Neural Networks. Results on a BBC News dataset show that our proposed approach outperforms a traditional method based on Latent Dirichlet Allocation using both automatic evaluation based on BLEU scores and human evaluation.

Citation

Vishwash Batra, Yulan He and George Vogiatzis. “A Deep Learning Approach to Automatic Caption Generation for News Images.” Aston Publications Explorer (Aston University). 2019.

BibTeX
@article{batra2019,
  title     = {A Deep Learning Approach to Automatic Caption Generation for News Images},
  author    = {Vishwash Batra and Yulan He and George Vogiatzis},
  journal   = {Aston Publications Explorer (Aston University)},
  year      = {2019},
}

Related projects