DESIGN AND TRAIN CAPTION FROM IMAGES BYUSING DEEP LEARNING APPROACH
Abstract
Abstract: Picture subscription is an enterprise which involves both visual and linguistic understanding. To build words inhuman languages, image models have to interpret visual content. The focus approach has been used extensive in imagesubtitles because it can give deeper sequential model training with more accurate picture information. It is a crucial andtough challenge to use natural languages to describe the image content automatically. It has several possibilities. Forinstance, it might help understand the substance of the image. It can also provide higher precision and succinct pictureinformation in cases such as picture sharing in social network platforms. In this study, deep neural networks are employed toachieve this goal. A convolutionary neural network (CNN) is utilized to extract vectors from real time video (image frames)and an LSTM network for generating substitutes from these vectors is employed. The Flickr 8K dataset is the most often useddataset to assess the model. It consist of over 8k photographs. The methodology creates picture captions which are generatedby gathering information from pairs of images and captions.Keywords: Convolutional Neural Networks (CNN); Long Short-Term Memory networks (LSTM); Recurrent neural networks(RNN); Deep neural networks (DNNs).
Full Text PDF
IMPORTANT DATES
Submit paper at ijasret@gmail.com
Paper Submission Open For |
October 2024 |
UGC indexed in (Old UGC) |
2017 |
Last date for paper submission |
30th October, 2024 |
Deadline |
Submit Paper any time |
Publication of Paper |
Within 15-30 Days after completing all the formalities |
Publication Fees |
Rs.6000 (UG student) |
Publication Fees |
Rs.8000 (PG student)
|
|