Skip navigation


Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/3399
Title: Image Captioning in Real Time
Authors: Ankit, Patil
Karishma, Saudagar
Atul, Maharnawar
Tejas, Rangatwan
Priyadarshini, I.
Keywords: CNN
LSTM (Long Short Term Memory)
RNN
Xception
Issue Date: 12-Dec-2021
Abstract: The current development in Deep Learning based Machine Translation and Computer Vision have led to incredible Image Captioning models using advanced techniques like Deep Learning. Even if these models are very accurate, they often rely on the use of exorbitant computation hardware making it problematic to apply these models in real-time scenarios, where their actual uses can be noticed. This model uses a hybrid CN-NRNN model, where the CNN part of the model system uses the Xception model for transfer learning, and RNNs are widely used in language modeling. The Flickr8k dataset is used for real-time training and testing. RNN’s LSTM model is used to avoid problems with extinction or gradient explosion during the training phase.
URI: http://192.168.3.232:8080/jspui/handle/123456789/3399
Appears in Collections:Computer

Files in This Item:
File Description SizeFormat 
PID 16 Real Time Image Caption Generator -Formatted Paper.pdf441.75 kBUnknownView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.