Image Captioning using CNN ,LSTM and Attention

Last update: Dec 16, 2021

Related tags

Deep Learning imagecaptioningproject

Overview

Image Captioning using CNN ,LSTM and Attention

This is a deeplearning model which tries to summarize an image into a text .

Installation

Install this project with pip3. Use python version 3.7

  pip3 install -R requirements.txt
  python3 app.py

these commands are applicable if you want to try the website in localhost.

you can also install docker and build an image from the docker file and run it.

  docker build -f Dockerfile -t imagecaptioning:api .
  docker run -p 8080:8080 -ti imagecaptioning

Deployment

To deploy this project in google cloud app engine . First create an project in app engine. Install google SDK to push ptojects into your local machine then run the following commands.

  gcloud init
  gcloud app deploy

choose the right project and then push the application to the cloud. This is an monolithic application so a single docker image is complied on the app engine.

Demo

link to demo-https://lucky-dahlia-333406.el.r.appspot.com/index

FAQ

why is this project implimented in tensorflow ?

Tensorflow is actively maintained by google and is very convenient to deploy on a server .It automatically switches to gpu while training if it finds one.

what is BELU score ?

BLEU, or the Bilingual Evaluation Understudy, is a score for comparing a candidate translation of text to one or more reference translations.Although developed for translation, it can be used to evaluate text generated for a suite of natural language processing tasks.

In this project, you will discover the BLEU score for evaluating and scoring candidate text using the NLTK library in Python.

Authors

License

MIT

Image Captioning using CNN ,LSTM and Attention

Related tags

Overview

Image Captioning using CNN ,LSTM and Attention

Installation

Deployment

Demo

FAQ

why is this project implimented in tensorflow ?

what is BELU score ?

Authors

License

Owner

ASUTOSH GHANTO

Codes for building and training the neural network model described in Domain-informed neural networks for interaction localization within astroparticle experiments.

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

The Self-Supervised Learner can be used to train a classifier with fewer labeled examples needed using self-supervised learning.

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Second Order Optimization and Curvature Estimation with K-FAC in JAX.

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

This is a package for LiDARTag, described in paper: LiDARTag: A Real-Time Fiducial Tag System for Point Clouds

Provably Rare Gem Miner.

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Source code of the paper "Deep Learning of Latent Variable Models for Industrial Process Monitoring".

This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Codebase of deep learning models for inferring stability of mRNA molecules

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness