Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Last update: Jul 28, 2022

Related tags

Overview

Information Gain Filtration

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration. The provided Jupyter Notebook gives a simple demostration into the use of IGF during language model finetuning. Data for this demonstration is available on Figshare here.

If you use this method in your published work, please cite the ACL paper that describes this method here.

Owner

GitHub Repository

exponential adaptive pooling for PyTorch

AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling Abstract Pooling layers are essential building blocks of Convolutional Ne

55 Jan 04, 2023

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

FRSKD Official implementation for Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation (CVPR-2021) Requirements Pytho

75 Dec 28, 2022

BC3407-Group-5-Project - BC3407 Group Project With Python

BC3407-Group-5-Project As the world struggles to contain the ever-changing varia

1 Jan 26, 2022

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

Nvdiffrast – Modular Primitives for High-Performance Differentiable Rendering Modular Primitives for High-Performance Differentiable Rendering Samuli

675 Jan 06, 2023

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Introduction Version: 2.3.8 Authors: Chris Fonnesbeck Anand Patil David Huard John Salvatier Web site: https://github.com/pymc-devs/pymc Documentation

7.2k Jan 07, 2023

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership Codes for [NeurIPS'21] You are caught stealing my winni

8 Nov 01, 2022

Deep learning image registration library for PyTorch

TorchIR: Pytorch Image Registration TorchIR is a image registration library for deep learning image registration (DLIR). I have integrated several ide

40 Dec 16, 2022

BirdCLEF 2021 - Birdcall Identification 4th place solution

BirdCLEF 2021 - Birdcall Identification 4th place solution My solution detail kaggle discussion Inference Notebook (best submission) Environment Use K

42 Jan 02, 2023

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective Zhengzhuo Xu, Zenghao Chai, Chun Yuan This is the PyTorch implement

16 Dec 15, 2022

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

143 Jan 01, 2023

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

Investigating U-NETS With Various Intermediate Blocks For Spectrogram-based Singing Voice Separation A Pytorch Implementation of the paper "Investigat

63 Nov 14, 2022

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Related tags

Overview

Information Gain Filtration

Owner

exponential adaptive pooling for PyTorch

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

BC3407-Group-5-Project - BC3407 Group Project With Python

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

Deep learning image registration library for PyTorch

BirdCLEF 2021 - Birdcall Identification 4th place solution

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground Cues

Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

DecoupledNet is semantic segmentation system which using heterogeneous annotations

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

A concise but complete implementation of CLIP with various experimental improvements from recent papers

git《Tangent Space Backpropogation for 3D Transformation Groups》(CVPR 2021) GitHub:1]

This is an easy python software which allows to sort images with faces by gender and after by age.

Implementation detail for paper "Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet"

Sequential GCN for Active Learning

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Related tags

Overview

Information Gain Filtration

Owner

exponential adaptive pooling for PyTorch

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

BC3407-Group-5-Project - BC3407 Group Project With Python

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

Deep learning image registration library for PyTorch

BirdCLEF 2021 - Birdcall Identification 4th place solution

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground Cues

Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

DecoupledNet is semantic segmentation system which using heterogeneous annotations

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

A concise but complete implementation of CLIP with various experimental improvements from recent papers

git《Tangent Space Backpropogation for 3D Transformation Groups》(CVPR 2021) GitHub:1]

This is an easy python software which allows to sort images with faces by gender and after by age.

Implementation detail for paper "Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet"

Sequential GCN for Active Learning

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: