H&M Fashion Image similarity search with Weaviate and DocArray

This example shows how to do image similarity search using DocArray and Weaviate as Document Store.

How to use the notebook

This repository includes sample data, but you can download the full dataset from Kaggle (see below).

Before you start using the notebook, you need to start a Weaviate instance, by running docker compose up. Weaviate will be running on http://localhost:8080. Alternatively,, you can start a Weaviate instance for free with WCS: Weaviate Cloud Service. Make sure you adapt the Weaviate server in the notebook accordingly.

How to download data (optional - sample data included in this repository)

You need to download the fashion image data from the H&M dataset on Kaggle. You can download it and put in the right folder using:

$ mkdir data
& cd data
$ kaggle competitions download -c h-and-m-personalized-fashion-recommendations
$ unzip h-and-m-personalized-fashion-recommendations.zip

Optional: you can use resize_image.py to downscale the images before using them in the notebook.

Make sure to adapt the file location in the notebook.

Install requirements

The requirements will be installed in the first cell of the notebook. Alternatively, you can run pip install -r requirements.txt.

Embed, store and query

You can run the Jupyter notebook to embed, store and query fashion image data using ResNet50, DocArray and Weaviate.

H&M Fashion Image similarity search with Weaviate and DocArray

Related tags

Overview

H&M Fashion Image similarity search with Weaviate and DocArray

How to use the notebook

How to download data (optional - sample data included in this repository)

Install requirements

Embed, store and query

Owner

Laura Ham

TumorInsight is a Brain Tumor Detection and Classification model built using RESNET50 architecture.

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Official implementation of Densely connected normalizing flows

PyTorch implementation of the Flow Gaussian Mixture Model (FlowGMM) model from our paper

DUE: End-to-End Document Understanding Benchmark

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

An open-access benchmark and toolbox for electricity price forecasting

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

On Effective Scheduling of Model-based Reinforcement Learning

Normalizing Flows with a resampled base distribution

Chinese license plate recognition

Replication of Pix2Seq with Pretrained Model

End-To-End Memory Network using Tensorflow

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Implementation of "JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting"

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

The Dual Memory is build from a simple CNN for the deep memory and Linear Regression fro the fast Memory