
Please use this identifier to cite or link to this item:
http://localhost:8080/xmlui/handle/123456789/3399| Title: | Image Captioning in Real Time |
| Authors: | Ankit, Patil Karishma, Saudagar Atul, Maharnawar Tejas, Rangatwan Priyadarshini, I. |
| Keywords: | CNN LSTM (Long Short Term Memory) RNN Xception |
| Issue Date: | 12-Dec-2021 |
| Abstract: | The current development in Deep Learning based Machine Translation and Computer Vision have led to incredible Image Captioning models using advanced techniques like Deep Learning. Even if these models are very accurate, they often rely on the use of exorbitant computation hardware making it problematic to apply these models in real-time scenarios, where their actual uses can be noticed. This model uses a hybrid CN-NRNN model, where the CNN part of the model system uses the Xception model for transfer learning, and RNNs are widely used in language modeling. The Flickr8k dataset is used for real-time training and testing. RNN’s LSTM model is used to avoid problems with extinction or gradient explosion during the training phase. |
| URI: | http://192.168.3.232:8080/jspui/handle/123456789/3399 |
| Appears in Collections: | Computer |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| PID 16 Real Time Image Caption Generator -Formatted Paper.pdf | 441.75 kB | Unknown | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.