Self-Supervised Learning with Kernel Dependence Maximization

Last update: Dec 29, 2022

Related tags

Overview

Self-Supervised Learning with Kernel Dependence Maximization

This is the code for SSL-HSIC, a self-supervised learning loss proposed in the paper Self-Supervised Learning with Kernel Dependence Maximization (https://arxiv.org/abs/2106.08320).

Using this implementation should achieve a top-1 accuracy on Imagenet around 74.8% using 128 Cloud TPU v2/3.

Installation

To set up a Python3 virtual environment with the required dependencies, run:

python3 -m venv ssl_hsic_env
source ssl_hsic_env/bin/activate
pip install --upgrade pip
pip install -r ssl_hsic/requirements.txt

Usage

Pre-training

For pre-training on ImageNet with SSL-HSIC loss:

mkdir /tmp/ssl_hsic
python3 -m ssl_hsic.experiment \
--config=ssl_hsic/config.py:default \
--jaxline_mode=train

This is going to pre-train for 1000 epochs. Change config to config.py:test for testing purpose. See jaxline documentation for more information on jaxline_mode.

If save_dir is provided in config.py, the last checkpoint is saved and can be used for evaluation.

Linear Evaluation

For linear evaluation with the saved checkpoint:

mkdir /tmp/ssl_hsic
python3 -m ssl_hsic.eval_experiment \
--config=ssl_hsic/eval_config.py:default \
--jaxline_mode=train

This is going to train a linear layer for 90 epochs. Change config to eval_config.py:test for testing.

Citing this work

If you use this code in your work, please consider referencing our work:

@inproceedings{
  li2021selfsupervised,
  title={Self-Supervised Learning with Kernel Dependence Maximization},
  author={Yazhe Li and Roman Pogodin and Danica J. Sutherland and Arthur Gretton},
  booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
  year={2021},
  url={https://openreview.net/forum?id=0HW7A5YZjq7}
}

Disclaimer

This is not an official Google product.

Self-Supervised Learning with Kernel Dependence Maximization

Related tags

Overview

Self-Supervised Learning with Kernel Dependence Maximization

Installation

Usage

Pre-training

Linear Evaluation

Citing this work

Disclaimer

Owner

DeepMind

nnFormer: Interleaved Transformer for Volumetric Segmentation Code for paper "nnFormer: Interleaved Transformer for Volumetric Segmentation "

Hardware accelerated, batchable and differentiable optimizers in JAX.

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds

CS550 Machine Learning course project on CNN Detection.

Fake videos detection by tracing the source using video hashing retrieval.

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

Efficient Sparse Attacks on Videos using Reinforcement Learning

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

TDN: Temporal Difference Networks for Efficient Action Recognition

Action Recognition for Self-Driving Cars

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

ProFuzzBench - A Benchmark for Stateful Protocol Fuzzing

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).