Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Last update: Nov 01, 2022

Overview

Gradient Centralization TensorFlow

This Python package implements Gradient Centralization in TensorFlow, a simple and effective optimization technique for Deep Neural Networks as suggested by Yong et al. in the paper Gradient Centralization: A New Optimization Technique for Deep Neural Networks. It can both speedup training process and improve the final generalization performance of DNNs.

Installation

Run the following to install:

pip install gradient-centralization-tf

Usage

`gctf.centralized_gradients_for_optimizer`

Create a centralized gradients functions for a specified optimizer.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.

Example:

>>> opt = tf.keras.optimizers.Adam(learning_rate=0.1)
>>> optimizer.get_gradients = gctf.centralized_gradients_for_optimizer(opt)
>>> model.compile(optimizer = opt, ...)

`gctf.get_centralized_gradients`

Computes the centralized gradients.

This function is ideally not meant to be used directly unless you are building a custom optimizer, in which case you could point get_gradients to this function. This is a modified version of tf.keras.optimizers.Optimizer.get_gradients.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.
loss: Scalar tensor to minimize.
params: List of variables.

Returns:

A gradients tensor.

`gctf.optimizers`

Pre built updated optimizers implementing GC.

This module is speciially built for testing out GC and in most cases you would be using gctf.centralized_gradients_for_optimizer though this module implements gctf.centralized_gradients_for_optimizer. You can directly use all optimizers with tf.keras.optimizers updated for GC.

Example:

>>> model.compile(optimizer = gctf.optimizers.adam(learning_rate = 0.01), ...)
>>> model.compile(optimizer = gctf.optimizers.rmsprop(learning_rate = 0.01, rho = 0.91), ...)
>>> model.compile(optimizer = gctf.optimizers.sgd(), ...)

Returns:

A tf.keras.optimizers.Optimizer object.

Developing `gctf`

To install gradient-centralization-tf, along with tools you need to develop and test, run the following in your virtualenv:

git clone [email protected]:Rishit-dagli/Gradient-Centralization-TensorFlow
# or clone your own fork

pip install -e .[dev]

License

Copyright 2020 Rishit Dagli

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Comments

On windows Tensorflow 2.5 it gives error

On windows 10 with miniconda enviroment tensorflow 2.5 gives error on centralized_gradients.py file.

the solution is change import keras.backend as K with import tensorflow.keras.backend as K
bug

opened by mgezer 5

The results in the mnist example are wrong/misleading

Describe the bug The results in your colab ipython notebook are misleading: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

In this example, the model is first trained with a normal Adam optimizer:

model.compile(optimizer = tf.keras.optimizers.Adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics = ['accuracy'])

history_no_gctf = model.fit(training_images, training_labels, epochs=5, callbacks = [time_callback_no_gctf])

And afterwards the same model is recompiled with the gctf.optimizers.adam(). However, recompiling a keras model does not reset the weights. This means that in the first fit call the model is trained and then in the second fit call with the new optimizer the same model is used and of course then the results are better.

This can be fixed, by recreating the model for the second run, by just adding these few lines:

import gctf #import gctf

time_callback_gctf = TimeHistory()

# Model architecture
model = tf.keras.models.Sequential([
                                    tf.keras.layers.Flatten(), 
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu), 
                                    tf.keras.layers.Dense(10, activation=tf.nn.softmax)])

model.compile(optimizer = gctf.optimizers.adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics=['accuracy'])

history_gctf = model.fit(training_images, training_labels, epochs=5, callbacks=[time_callback_gctf])

However, then the results are not better than without gctf:

Type                   Execution time    Accuracy      Loss
-------------------  ----------------  ----------  --------
Model without gctf:           24.7659    0.88825   0.305801
Model with gctf               24.7881    0.889567  0.30812

Could you please clarify what happens here. I tried this gctf.optimizers.adam() optimizer in my own research and it didn't change the results at all and now after seeing it doesn't work in the example which was constructed here. Makes me question the results of this paper.

To Reproduce Execute the colab file given in the repository: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

Expected behavior The right comparison would be if both models start from a random initialization, not that the second model can start with the already pre-trained weights.

Looking forward to a fast a swift explanation.

Best, Max

question

opened by themasterlink 2

Wider dependency requirements

The package as of now to be installed requires tensorflow ~= 2.4.0 and keras ~= 2.4.0. It turns out that this is sometimes problematic for folks who have custom installations of TensorFlow and a winder requirement could be set up.
enhancement

opened by Rishit-dagli 1
Release 0.0.3
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )
opened by Rishit-dagli 0
Add an "About The Examples" section

Add an "About The Examples" section which contains a summary of the usage example notebooks and links to run it on Binder and Colab.

Close #15

opened by Rishit-dagli 0
Update relevant pypi classifiers
Add PyPI classifiers for:

Development status

Intended Audience

Topic

Further also added the Programming Language :: Python :: 3 :: Only classifer

Closes #18
opened by Rishit-dagli 0
Update pypi classifiers
I am specifically thinking of adding three more categories of pypi classifiers:

Development status

Intended Audience

Topic

Apart from this I also think it would be great to add the Programming Language :: Python :: 3 :: Only to make sure the audience to know that this package is intended for Python 3 only.
opened by Rishit-dagli 0
Add an "About the examples" section

It would be great to write an "About the example" section which could demonstrate in short what the example notebooks aim to achieve and show.
documentation

opened by Rishit-dagli 0
Error in usage example for gctf.centralized_gradients_for_optimizer

I noticed that the docstrings for gctf.centralized_gradients_for_optimizer have an error in the example usage section. The example creates an Adam optimizer instance and saves it to opt however the centralized_gradients_for_optimizer is applied on optimizer which ideally does not exist and running the example would result in an error.
documentation

opened by Rishit-dagli 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 120.77kb | 98.16kb | 18.72% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 105.85kb | 86.11kb | 18.65% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )

Source code(tar.gz)
Source code(zip)
v0.0.2(Feb 21, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Fix the issue of supporting multiple modules

Fix multiple typos.

Source code(tar.gz)
Source code(zip)
v0.0.1(Feb 20, 2021)
This is the initial version of the Gradient-Centralization-TensorFlow package.

Features:

Implement Gradient centralization for optimizers using tf.keras.optimizer.Optimizers base class

Supports custom optimizers

Pre-built optimizers implementing GC for testing purposes.

Thanks, @ialimustufa for his contributions to this package.
Source code(tar.gz)
Source code(zip)
gradient_centralization_tf-0.0.1-py3-none-any.whl(7.12 KB)

Owner

Rishit Dagli

High School, Ted-X, Ted-Ed speaker|Mentor, TFUG Mumbai|International Speaker|Microsoft Student Ambassador|#ExploreML Facilitator

GitHub Repository

A naive ROS interface for visualDet3D.

YOLO3D ROS Node This repo contains a Monocular 3D detection Ros node. Base on https://github.com/Owen-Liuyuxuan/visualDet3D All parameters are exposed

19 Oct 08, 2022

Feature board for ERPNext

ERPNext Feature Board Feature board for ERPNext Development Prerequisites k3d kubectl helm bench Install K3d Cluster # export K3D_FIX_CGROUPV2=1 # use

16 Nov 09, 2022

Addition of pseudotorsion caclulation eta, theta, eta', and theta' to barnaba package

Addition to Original Barnaba Code: This is modified version of Barnaba package to calculate RNA pseudotorsion angles eta, theta, eta', and theta'. Ple

1 Jan 11, 2022

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

VoCapXLM Code for EMNLP2021 paper Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training Environment DockerFile: dancingso

15 Jul 28, 2022

A Pytorch reproduction of Range Loss, which is proposed in paper 《Range Loss for Deep Face Recognition with Long-Tailed Training Data》

RangeLoss Pytorch This is a Pytorch reproduction of Range Loss, which is proposed in paper 《Range Loss for Deep Face Recognition with Long-Tailed Trai

7 Nov 27, 2021

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Towards High Fidelity Face-Relighting with Realistic Shadows Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu. In CVPR, 2021. T

114 Dec 10, 2022

Marine debris detection with commercial satellite imagery and deep learning.

Marine debris detection with commercial satellite imagery and deep learning. Floating marine debris is a global pollution problem which threatens mari

56 Dec 16, 2022

A general, feasible, and extensible framework for classification tasks.

Pytorch Classification A general, feasible and extensible framework for 2D image classification. Features Easy to configure (model, hyperparameters) T

26 Nov 22, 2022

Artificial Intelligence playing minesweeper 🤖

AI playing Minesweeper ✨ Minesweeper is a single-player puzzle video game. The objective of the game is to clear a rectangular board containing hidden

8 Oct 17, 2022

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021 Global Pooling, More than Meets the Eye: Posi

32 Apr 24, 2022

Generative Adversarial Networks(GANs)

Generative Adversarial Networks(GANs) Vanilla GAN ClusterGAN Vanilla GAN Model Structure Final Generator Structure A MLP with 2 hidden layers of hidde

2 Nov 05, 2021

Compute FID scores with PyTorch.

FID score for PyTorch This is a port of the official implementation of Fréchet Inception Distance to PyTorch. See https://github.com/bioinf-jku/TTUR f

2.1k Jan 06, 2023

Generate Cartoon Images using Generative Adversarial Network

AvatarGAN ✨ Generate Cartoon Images using DC-GAN Deep Convolutional GAN is a generative adversarial network architecture. It uses a couple of guidelin

50 Dec 29, 2022

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

SeqGAN Requirements: Tensorflow r1.0.1 Python 2.7 CUDA 7.5+ (For GPU) Introduction Apply Generative Adversarial Nets to generating sequences of discre

2k Dec 29, 2022

Face recognition with trained classifiers for detecting objects using OpenCV

Face_Detector Face recognition with trained classifiers for detecting objects using OpenCV Libraries required to be installed using pip Command: cv2 n

0 Oct 31, 2021

Rethinking Transformer-based Set Prediction for Object Detection

Rethinking Transformer-based Set Prediction for Object Detection Here are the code for the ICCV paper. The code is adapted from Detectron2 and AdelaiD

62 Dec 03, 2022

Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications

Computational Optimal Transport for Machine Learning Reading Group Over the last few years, optimal transport (OT) has quickly become a central topic

11 Aug 26, 2022

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs GraphLily is the first FPGA overlay for graph processing. GraphLily supports a rich se

39 Dec 13, 2022

Code for the AAAI 2022 paper "Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph".

multilingual-mrc-isdg Code for the AAAI 2022 paper "Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph". This r

5 Dec 07, 2022

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021) This repository is the official P

159 Dec 30, 2022

Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Related tags

Overview

Gradient Centralization TensorFlow

Installation

Usage

Arguments:

Example:

Arguments:

Returns:

Example:

Returns:

Developing gctf

License

Comments

✅ Bug Fixes / Improvements

Beep boop. Your images are optimized!

Beep boop. Your images are optimized!

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)

✅ Bug Fixes / Improvements

v0.0.2(Feb 21, 2021)

v0.0.1(Feb 20, 2021)

Owner

Rishit Dagli

A naive ROS interface for visualDet3D.

Feature board for ERPNext

Addition of pseudotorsion caclulation eta, theta, eta', and theta' to barnaba package

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

A Pytorch reproduction of Range Loss, which is proposed in paper 《Range Loss for Deep Face Recognition with Long-Tailed Training Data》

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Marine debris detection with commercial satellite imagery and deep learning.

A general, feasible, and extensible framework for classification tasks.

Artificial Intelligence playing minesweeper 🤖

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

Generative Adversarial Networks(GANs)

Compute FID scores with PyTorch.

Generate Cartoon Images using Generative Adversarial Network

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

Face recognition with trained classifiers for detecting objects using OpenCV

Rethinking Transformer-based Set Prediction for Object Detection

Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Code for the AAAI 2022 paper "Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph".

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Developing `gctf`