Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation principle for unnormalized statistical models." (Gutmann and Hyvarinen, AISTATS 2010)

Last update: Nov 24, 2022

Overview

Noise Contrastive Estimation for pyTorch

Overview

This repository contains a re-implementation of the Noise Contrastive Estimation algorithm, implemented in pyTorch. While the algorithm is fully functional, it is not very efficient.

Implementation details

As provided, the implementation assumes that its input data follows a Zipfian distribution, making it particularly suitable for training language models or word embeddings. In case the built-in (Zipfian) sampler is used to obtain the distractor items, indices representing the data classes have to be sorted in the order of descending frequency, i.e. index 0 should correspond to the most frequent word in the input data.

Acknowledgement

The provided code closely follows the TensorFlow NCE-loss implementation. As such, this project should be seen as an attempt to adopt the TF code for use within pyTorch.

Note

This re-implementation was completed with personal use in mind and is, as such, not actively maintained. You are, however, very welcome to extend or adjust it according to your own needs, should you find it useful. Happy coding :) .

Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation principle for unnormalized statistical models." (Gutmann and Hyvarinen, AISTATS 2010)

Related tags

Overview

Noise Contrastive Estimation for pyTorch

Overview

Implementation details

Acknowledgement

Note

Owner

Denis Emelin

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

[3DV 2021] A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks

A project for developing transformer-based models for clinical relation extraction

The code release of paper 'Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization' NIPS 2020.

Xview3 solution - XView3 challenge, 2nd place solution

Official implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Anagram Generator in Python

Scikit-learn compatible estimation of general graphical models

QuALITY: Question Answering with Long Input Texts, Yes!

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

Multistream CNN for Robust Acoustic Modeling

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

PyElastica is the Python implementation of Elastica, an open-source software for the simulation of assemblies of slender, one-dimensional structures using Cosserat Rod theory.

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Official implementation of VQ-Diffusion

Real-Time High-Resolution Background Matting

Gapmm2: gapped alignment using minimap2 (align transcripts to genome)