Overcoming-Catastrophic-forgetting-in-Neural-Networks

Elastic weight consolidation technique for incremental learning.

About

Use this API if you dont want your neural network to forget previously learnt tasks while doing transfer learning or domain adaption!

Results

The experiment is done as follow:

Train a 2 layer feed forward neural network on MNIST for 4 epochs
Train the same network later on Fashion-MNIST for 4 epochs This is done once with EWC and then without EWC and results are calculated on test data for both data on same model. Constant learning rate of 1e-4 is used throughout with Adam Optimizer. Importance multiplier is kept at 10e5 and sampling is done with half data before moving to next dataset

EWC	MNIST	Fashion-MNIST
Yes	70.27	81.88
No	48.43	86.69

Usage

from elastic_weight_consolidation import ElasticWeightConsolidation
# Build a neural network of your choice and pytorch dataset for it
# Define a criterion class for new task and pass it as shown below
ewc = ElasticWeightConsolidation(model, crit, lr=0.01, weight=0.1)
# Training procedure
for input, target in dataloader:
  ewc.forward_backward_update(input, target)
ewc.register_ewc_params(dataset, batch_size, num_batches_to_run_for_sampling)
# Repeat this for each new task and it's corresponding dataset

Reference

Paper

Elastic weight consolidation technique for incremental learning.

Related tags

Overview

Overcoming-Catastrophic-forgetting-in-Neural-Networks

About

Results

Usage

Reference

Owner

Shivam Saboo

Car Parking Tracker Using OpenCv

Scaling and Benchmarking Self-Supervised Visual Representation Learning

[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

End-To-End Memory Network using Tensorflow

Find the Heart simple Python Game

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

Detector for Log4Shell exploitation attempts

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

Implementation of PersonaGPT Dialog Model

A custom DeepStack model that has been trained detecting ONLY the USPS logo

Propose a principled and practically effective framework for unsupervised accuracy estimation and error detection tasks with theoretical analysis and state-of-the-art performance.

Makes patches from huge resolution .svs slide files using openslide

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Code release for "Transferable Semantic Augmentation for Domain Adaptation" (CVPR 2021)

Liecasadi - liecasadi implements Lie groups operation written in CasADi

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

Yolov5 deepsort inference，使用YOLOv5+Deepsort实现车辆行人追踪和计数，代码封装成一个Detector类，更容易嵌入到自己的项目中

Official PaddlePaddle implementation of Paint Transformer

Static-test - A playground to play with ideas related to testing the comparability of the code