Auto-Lama combines object detection and image inpainting to automate object removals

Last update: Dec 09, 2022

Related tags

Overview

Auto-Lama

Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and Lama from Samsung Research. The entire process is extremely simple:

Objects are detected using the detector.
Masks are generated based on the bounding boxes drawn by the detector.
The original image is sent to the inpainter along with the masks.

Demo

Masking

There are currently a few ways of generating masks:

Masking objects with specified indices.
Masking one main object at a time.
Masking all other objects other than the main object.

Future Goals

Use a more precise segmentation method other than bounding boxes
Implementing a detector that has more

Environment Setup

Prerequisites

docker
make
conda

Building Environment

make build-conda-env
conda activate auto-lama
make build-env

Cleaning Directory

make clean

Detect and Inpaint

Setup

The default config for the detector is

PARAMETERS = {
    "model_name": "facebook/detr-resnet-50",
    "threshold": 0.9,
    "max_items": 10,
    "save_destination": "./test_images",
    "output_destination": "./output_images",
    "max_width": 2000,
    "max_height": 2000,
    "resize": True,
    "resize_scale": 0.75,
    "excluded_objects": [91],
    "image_format": "PNG",
    "mask_target_items": [],
}

Please reference here for the target items that you want to mask, as the default DE:TR uses the COCO Dataset,

Run

make detect_and_inpaint IMAGE_PATH=path/to/image or make detect_and_inpaint IMAGE_PATH={image_url}

Auto-Lama combines object detection and image inpainting to automate object removals

Related tags

Overview

Auto-Lama

Demo

Masking

Future Goals

Environment Setup

Prerequisites

Building Environment

Cleaning Directory

Detect and Inpaint

Setup

Run

Owner

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

Distributional Sliced-Wasserstein distance code

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Code for ECCV 2020 paper "Contacts and Human Dynamics from Monocular Video".

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

A Jupyter notebook to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

Automated image registration. Registrationimation was too much of a mouthful.

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

An essential implementation of BYOL in PyTorch + PyTorch Lightning

A motion tracking system for any arbitaray points in a video frame.

Power Core Simulator!

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Long Expressive Memory (LEM)

NeoPlay is the project dedicated to ESport events.

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Distributed Evolutionary Algorithms in Python