https://arxiv.org/abs/2102.11005

Last update: Dec 19, 2022

Related tags

Overview

LogME

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

How to use

Just feed the features f and labels y to the function, and you can get a nice score which well correlates with the transfer learning performance.

from LogME import LogME
score = LogME(f, y)

Then you can use the score to quickly select a good pre-trained model. The larger the score is, the better transfer performance you get.

Experimental results

We extensively validate the generality and superior performance of LogME on 14 pre-trained models and 17 downstream tasks, covering various pre-trained models (supervised pre-trained and unsupervised pre-trained), downstream tasks (classification and regression), and modalities (vision and language). Check the paper for all the results.

Computer vision

9 datasets and 10 pre-trained models. LogME is a reasonably good indicator for transfer performance.

NLP

7 tasks and 4 pre-trained models. LogME is a good indicator for transfer performance.

Speedup

LogME provides a dramatic speedup for assessing pre-trained models. The speedup comes from two aspects:

LogME does not need hyper-parameter tuning whereas vanilla fine-tuning requires extensive hyper-parameter tuning.
We designed a fast algorithm to further speedup the computation of LogME.

Citation

If you find it useful, please cite the following paper:

@article{you_logme:_2021,
	title = {LogME: Practical Assessment of Pre-trained Models for Transfer Learning},
	author = {You, Kaichao and Liu, Yong and Long, Mingsheng and Wang, Jianmin},
	journal = {arxiv},
	volume = {abs/2102.11005},
	year = {2021},
	url = {https://arxiv.org/abs/2102.11005},
}

Contact

If you have any question or want to use the code, please contact [email protected] .

https://arxiv.org/abs/2102.11005

Related tags

Overview

LogME

How to use

Experimental results

Computer vision

NLP

Speedup

Citation

Contact

Owner

THUML: Machine Learning Group @ THSS

This repository contains the reference implementation for our proposed Convolutional CRFs.

Empowering journalists and whistleblowers

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

Isaac Gym Reinforcement Learning Environments

Reproducing Results from A Hybrid Approach to Targeting Social Assistance

Text2Art is an AI art generator powered with VQGAN + CLIP and CLIPDrawer models

Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"

image scene graph generation benchmark

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Libtorch yolov3 deepsort

This repository contains the code and models for the following paper.

A framework for using LSTMs to detect anomalies in multivariate time series data. Includes spacecraft anomaly data and experiments from the Mars Science Laboratory and SMAP missions.

Inferring Lexicographically-Ordered Rewards from Preferences

Official Pytorch implementation of Meta Internal Learning

Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

You Only 👀 One Sequence

Driller: augmenting AFL with symbolic execution!

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

In-place Parallel Super Scalar Samplesort (IPS⁴o)