Development of an Image caption generator using AI and Image Processing

Divya Pathrabe*
Periodicity:October - December'2025

Abstract

Image caption generation is a challenging task at   the intersection of computer vision and natural language processing that aims to generate meaningful captions for a given image. This paper proposes an image caption generator that will accept an image as an input and generate an English sentence as output by labeling the image’s content . The system takes the pre-trained deep learning Convolutional Neural Network (CNN) architecture model that extracts high-level visual features from input images, which are then processed by the LSTM to generate coherent and contextually relevant captions. he model is trained on the Flickr8K dataset, ensuring diverse and comprehensive caption generation . Evaluation of model is done using standard metrics such as BLEU and METEOR scores to assess the accuracy and fluency of generated captions.

Keywords

Deep Learning, CNN, RNN, LSTM, Image Captioning, Flikr8k Dataset, BLEU Score

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 15 15 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.