The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Last update: Dec 27, 2022

Related tags

Deep Learning bmvc2021

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

^{Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval.}

Citation

@inproceedings{choudhury2021curious,
author = {Choudhury, Subhabrata and Laina, Iro and Rupprecht, Christian and Vedaldi, Andrea},
booktitle = {British Machine Vision Conference}
title = {The Curious Layperson: Fine-Grained Image Recognition without Expert Labels}
volume = {32},
year = {2021}
}

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Related tags

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

Citation

Owner

Subhabrata Choudhury

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives

METER: Multimodal End-to-end TransformER

Official implementation of NeurIPS'2021 paper TransformerFusion

Submission to Twitter's algorithmic bias bounty challenge

Script for getting information in discord

Fast, general, and tested differentiable structured prediction in PyTorch

A PyTorch implementation for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation".

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Image Matching Evaluation

Near-Duplicate Video Retrieval with Deep Metric Learning

TorchMD-Net provides state-of-the-art graph neural networks and equivariant transformer neural networks potentials for learning molecular potentials

Put blind watermark into a text with python

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks