Auto-Lama combines object detection and image inpainting to automate object removals

Last update: Dec 09, 2022

Related tags

Overview

Auto-Lama

Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and Lama from Samsung Research. The entire process is extremely simple:

Objects are detected using the detector.
Masks are generated based on the bounding boxes drawn by the detector.
The original image is sent to the inpainter along with the masks.

Demo

Masking

There are currently a few ways of generating masks:

Masking objects with specified indices.
Masking one main object at a time.
Masking all other objects other than the main object.

Future Goals

Use a more precise segmentation method other than bounding boxes
Implementing a detector that has more

Environment Setup

Prerequisites

docker
make
conda

Building Environment

make build-conda-env
conda activate auto-lama
make build-env

Cleaning Directory

make clean

Detect and Inpaint

Setup

The default config for the detector is

PARAMETERS = {
    "model_name": "facebook/detr-resnet-50",
    "threshold": 0.9,
    "max_items": 10,
    "save_destination": "./test_images",
    "output_destination": "./output_images",
    "max_width": 2000,
    "max_height": 2000,
    "resize": True,
    "resize_scale": 0.75,
    "excluded_objects": [91],
    "image_format": "PNG",
    "mask_target_items": [],
}

Please reference here for the target items that you want to mask, as the default DE:TR uses the COCO Dataset,

Run

make detect_and_inpaint IMAGE_PATH=path/to/image or make detect_and_inpaint IMAGE_PATH={image_url}

Auto-Lama combines object detection and image inpainting to automate object removals

Related tags

Overview

Auto-Lama

Demo

Masking

Future Goals

Environment Setup

Prerequisites

Building Environment

Cleaning Directory

Detect and Inpaint

Setup

Run

Owner

Moment-DETR code and QVHighlights dataset

Geometric Deep Learning Extension Library for PyTorch

OCR Streamlit App is used to extract text from images using python's easyocr, pytorch and streamlit packages

To prepare an image processing model to classify the type of disaster based on the image dataset

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Synthesize photos from PhotoDNA using machine learning 🌱

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Finding all things on-prem Microsoft for password spraying and enumeration.

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University

A Python framework for developing parallelized Computational Fluid Dynamics software to solve the hyperbolic 2D Euler equations on distributed, multi-block structured grids.

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

A PyTorch implementation of "Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning", IJCAI-21

Classification Modeling: Probability of Default

Graph Neural Networks with Keras and Tensorflow 2.

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

BEGAN in PyTorch

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

Split Variational AutoEncoder