This repository contains a CBIR system that uses swin transformer to extract image's feature.

Last update: Nov 17, 2022

Related tags

Overview

Swin-transformer based CBIR

This repository contains a CBIR(content-based image retrieval) system. Here we use Swin-transformer to extract query image's feature, and retrieve similar ones from image database. Notably, our program achieves intelligent user interaction, including selecting an image by opening explorer dialog and cropping interested region by drafting mouse.

Structure

SWIN_CBIR/
|-- checkpoints/
|
|-- database/
|   |-- data/
|   |   |-- 1.jpg
|   |   |-- 2.jpg
|   |  
|   |-- DB.npz
|   |-- index.txt
|
|-- models/
|   |-- __init__.py
|   |-- build.py
|   |-- swin_transformer.py
|
|-- scripts/
|   |-- generate_DB.sh
|
|-- test/
|
|-- config.py
|-- database.py
|-- generate_DB.py
|-- main.py
|-- requirements.txt
|-- README

Getting Started

Prepare images database

Just find out some images and put them into database/data/.
run ./script/generate_DB.sh in linux machine to extract features of all images and package them into DB.npz.
run main.py, open an image and select interested region, then program will find similar images in database automatically!

Results

Here we show two image retrieval results. Two images in the first row are original image and cropped image respectively while the others are retrieval results (have been sorted by similarity).

Note: all images are resize to square for visual requirement, so there would be distorted in some of the images.

Acknowledgments

Part of code in this repository are copied from Swin-transformer, thank the authors for their exquiste code.

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Related tags

Overview

Swin-transformer based CBIR

Structure

Getting Started

Results

Acknowledgments

Owner

JsHou

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

An Unpaired Sketch-to-Photo Translation Model

Face Mask Detection on Image and Video using tensorflow and keras

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

Generate vibrant and detailed images using only text.

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

FS-Mol: A Few-Shot Learning Dataset of Molecules

U-Net for GBM

Meta graph convolutional neural network-assisted resilient swarm communications

A library for implementing Decentralized Graph Neural Network algorithms.

PyTorch implementation for 3D human pose estimation

PyTorch implementation of DCT fast weight RNNs

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

🎯 A comprehensive gradient-free optimization framework written in Python

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

Toolchain to build Yoshi's Island from source code

A texturizer that I just made. Nothing special here.

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

fcn by tensorflow