Target Propagation through Layer Inverses

The present code implements an ideal formulation of Target Propagation using regularized inverses computed analytically rather than using some reverse layer optimized to approximate the inverse.

The code focuses on Recurrent Neural Networks for which vanishing/exploding gradients phenomena are known to impede the performance of a classical gradient back-propagation formula. The experiments demonstrate that TP may be beneficial for optimizing long sequences with RNNs.

The main part of the code consisted in modifying the structure of classical RNNs by adding a target_prop function in the definition of the RNN, see src/model/rnn.py. The optimization is done in src/optim/run_optimizer.py. The following code is provided to reproduce the plots in the paper.

Setup

Create a conda environment and activate it
conda create -n target_prop python=3.8
conda activate target_prop

Install dependencies
conda install seaborn matplotlib pandas

For PyTorch, the installation depends on your OS. For Mac for example, use
conda install pytorch torchvision -c pytorch

Experiments

To reproduce the plots presented in the paper run from the folder exp
python paper_conv_plots.py python heatmap_reg_stepsize.py python heatmap_perf.py python sensitivity_analysis.py python grad_behavior.py

The file exp_rnn.py illustrates a simple pipeline for an experiment on RNNs. The code is composed of data generation in src/data/get_data.py, model definition in src/model/make_model.py and optimization in src/optim /run_optimizer.py. They are wrapped in the function exp/exp_neck.py that is further used with pipeline tools presented in the folder utils_pipeline

Contact

You can report issues and ask questions in the repository's issues page. If you choose to send an email instead, please direct it to Vincent Roulet at vroulet@uw.edu and include [tpri] in the subject line.

Paper

Target Propagation through Layer Inverses
Vincent Roulet, Zaid Harchaoui.
arXiv preprint

Reference

@article{roulet2023target,
  title={Target Propagation via Regularized Inversion},
  author={Roulet, Vincent and Harchaoui, Zaid},
  journal={Transactions on Machine Learning Research},
  year={2023}
}

License

This code has a GPLv3 license.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
exp		exp
pipeline		pipeline
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exp

exp

pipeline

pipeline

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

Repository files navigation

Target Propagation through Layer Inverses

Setup

Experiments

Contact

Paper

License

About

Releases

Packages

Languages

License

vroulet/tpri

Folders and files

Latest commit

History

Repository files navigation

Target Propagation through Layer Inverses

Setup

Experiments

Contact

Paper

License

About

Resources

License

Stars

Watchers

Forks

Languages