A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Last update: Jul 12, 2022

Related tags

Deep Learning buggy-resizing-critique

Overview

A Criticism of the Paper On Buggy Resizing Libraries

This repository contains:

a Jupyter notebook for reproducing the aliased image downsampling fenomenon, as demonstrated in the On Buggy Resizing Libraries paper, which argues that the image downsampling methods of the OpenCV, Tensorflow and PyTorch libraries are "buggy", with only PIL being correct.
simple solutions for antialiasing in every framework, which solves the issue in all cases using the same functions, simply by setting parameters appropriately:
- OpenCV: change the interpolation from bilinear to area (from cv2.INTER_LINEAR to cv2.INTER_AREA)
- Tensorflow: set the antialias flag to True
- PyTorch: change the interpolation mode from bilinear to area, or simply use torchvision.transforms.Resize() instead of torch.nn.functional.interpolate()

Try it out in a Colab Notebook:

My opinion:

neither of the used image downsampling methods is "buggy", not applying antialiasing by default is an understandable design decision for both image and tensor operations.
the main figure of the paper is misleading, and it only illustrates the issues of aliasing for image resizing.
the aliasing issue with downsampling can be solved in all frameworks by simply setting a few parameters correctly. My criticism is that this is not mentioned in the paper.
torchvision.transforms.Resize() is claimed to only be a "a wrapper around the PIL library" in a note in Section 3.2 of the paper. This is true for PIL image inputs, but is incorrect for torch.Tensors, which are resized using torchvision interpolation operations.
the remaining parts of the paper provide valuable insights into the effects of interpolation methods, quantization and compression on the FID score of generative models.

Update: Just found out that there is another, very thorough investigation of the same issue. Highly recommend checking the blogpost out. They also implement an OpenCV-compatible Pillow-equivalent resizing that provides proper antialiasing for all interpolations.

Bilinear downsampling results with and without aliasing:

The main figure (Figure 1) of the paper:

A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Related tags

Overview

A Criticism of the Paper On Buggy Resizing Libraries

Owner

Object tracking using YOLO and a tracker(KCF, MOSSE, CSRT) in openCV

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

This git repo contains the implementation of my ML project on Heart Disease Prediction

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Sentiment analysis translations of the Bhagavad Gita

Experiments for Neural Flows paper

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

shufflev2-yolov5：lighter, faster and easier to deploy

Exploiting a Zoo of Checkpoints for Unseen Tasks

[SDM 2022] Towards Similarity-Aware Time-Series Classification

Fast and robust clustering of point clouds generated with a Velodyne sensor.

Deep Learning and Logical Reasoning from Data and Knowledge

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

Trustworthy AI related projects

Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty

Source code for the paper: Variance-Aware Machine Translation Test Sets (NeurIPS 2021 Datasets and Benchmarks Track)

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

Nest Protect integration for Home Assistant. This will allow you to integrate your smoke, heat, co and occupancy status real-time in HA.

AirLoop: Lifelong Loop Closure Detection