A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Last update: Oct 29, 2022

Overview

Automatic_Background_Remover

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Here is the Quick look at it.

Model Details:

CNN Architecture - U-Net with Residual connections
Parameters - 8.9M
Trained on - 64,115 Images
validated on - 2693 Images
batch_size = 32
img_size = (256,256)
Trained for - 13 epochs 
Training time - 80min/epoch on GPUs by Google Colab.

Datasets used for training:

COCO - https://cocodataset.org/#home

The model is trained using modified version of U-NET (https://arxiv.org/abs/1505.04597) Architecture first presented by Olaf Ronneberger, Philipp Fischer, Thomas Brox in 2015. I have added Residual skip connections in U-NET Model which makes it more robust.

I can't put model architecture here because of its huuge size. view here.

Result:

Training loss - 0.038
Validation loss - 0.056

Training accuracy - 0.935
Validation accuracy - 0.907
Training meanIOU - 0.817
Validation meanIOU - 0.787

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Related tags

Overview

Automatic_Background_Remover

Here is the Quick look at it.

Model Details:

Result:

Owner

Gaurav

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

Speedy Implementation of Instance-based Learning (IBL) agents in Python

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Clustering is a popular approach to detect patterns in unlabeled data

A sample pytorch Implementation of ACL 2021 research paper "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

Equivariant GNN for the prediction of atomic multipoles up to quadrupoles.

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

Repositório criado para abrigar os notebooks com a listas de exercícios propostos pelo professor Gustavo Guanabara do canal Curso em Vídeo do YouTube durante o Curso de Python 3

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet.

Repo for paper "Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information"

Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.

Implementation of CVPR'2022:Reconstructing Surfaces for Sparse Point Clouds with On-Surface Priors

Voxel Transformer for 3D object detection

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)