Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Last update: Dec 22, 2022

Overview

Alias-Free-Torch

Simple torch module implementation of Alias-Free GAN.

This repository including

Alias-Free GAN style lowpass sinc filter @filter.py
Alias-Free GAN style up/downsample @resample.py
Alias-Free activation @act.py
and test codes @./test

Note: Since this repository is unofficial, filter and upsample could be different with official implementation.

Note: 2d lowpass filter is applying sinc instead of jinc (first order Bessel function of the first kind) in paper

Requirements

Due to torch.kaiser_window and torch.i0 are implemeted after 1.7.0, our repository need torch>=1.7.0.

Pytorch>=1.7.0

TODO

2d sinc filter
2d resample
devide 1d and 2d modules
pip packaging

Test results 1d

Filter sine	Filter noise

upsample	downsample

Test results 2d

Filter L1 norm sine	Filter noise

upsample	downsample

Activation

References

Alias-Free GAN
adefossez/julius
A. V. Oppenheim and R. W. Schafer. Discrete-Time Signal Processing. Pearson, International Edition, 3rd edition, 2010

Acknowledgement

This work is done at MINDsLab Inc.

Thanks to teammates at MINDsLab Inc.

Comments

Batched resampling for the new implementation

Hi, thank you very much for the contribution.

I think the new implementation of resample.Upsample1d and resample.Downsample1d breaks batched resampling when using groups=C without expanding the filter to match the shape. Perhaps the implementation should be like the below (maybe similar goes to 2d):

Upsample1d.forward()

    # x: [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        x = F.pad(x, (self.pad, self.pad), mode='reflect')
        # TConv with filter expanded to C with C groups for depthwise op
        x = self.ratio * F.conv_transpose1d(
            x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C)
        pad_left = self.pad * self.stride + (self.kernel_size -
                                             self.stride) // 2
        pad_right = self.pad * self.stride + (self.kernel_size - self.stride +
                                              1) // 2
        x = x[..., pad_left:-pad_right]

LowPassFilter1d.forward()

    #input [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        if self.padding:
            x = F.pad(x, (self.left_pad, self.right_pad),
                      mode=self.padding_mode)
        # Conv with filter expanded to C with C groups for depthwise op
        out = F.conv1d(x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C) # typo 'groupds' btw
        return out

Could you check the correctness? Thanks again for the implementation!

opened by L0SG 2

torch.speical.i1 typo

https://github.com/junjun3518/alias-free-torch/blob/f1fddd52fdd068ee475e82ae60c92e1bc24ffe02/src/alias_free_torch/filter.py#L22

At this line I believe you wanted torch.special.i1.

opened by torridgristle 2
"if self.pad / self.padding" in LowPassFilter2d

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L165 https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L173

In LowPassFilter2d it looks like if self.pad: should change to if self.padding:, or self.padding = padding should change to self.pad = padding to match LowPassFilter1d.

opened by torridgristle 1
Padding Bool typo

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L73

padding: bool: True, should be padding: bool = True,

I'm not sure if this causes an error with every version of PyTorch, but it does with PyTorch 1.12.0+cu113 on Python 3.7.13

opened by torridgristle 1
2D Filter Jinc appears to be wrong

Here is a plot of the generated 1D sinc filter kernel.

Here is a plot of the generated 2D jinc filter kernel.

I'd expect it to look more like a series of rings or ripples, rather than a donut or torus.

The FFT output for randn noise put through the 2D filter doesn't look right either.

Changing filter_ = 2 * cutoff * window * jinc(2 * cutoff * time) to filter_ = 2 * cutoff * window * sinc(2 * cutoff * time) in kaiser_jinc_filter2d makes a more familiar kernel.

And the FFT output for randn noise put through this 2D filter looks about how I'd expect.

opened by torridgristle 3

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

https://pypi.org/project/alias-free-torch/0.0.6/

Tested version
Source code(tar.gz)
Source code(zip)
v0.0.3(Jul 18, 2022)

https://pypi.org/project/alias-free-torch/0.0.3/

Bug fix for torch.special / remove print / split pad from conv_transpose
Source code(tar.gz)
Source code(zip)
v0.0.2(Jun 22, 2022)

https://pypi.org/project/alias-free-torch/0.0.2/

Rewrite upsample, jinc applied
Source code(tar.gz)
Source code(zip)
v0.0.1(Nov 2, 2021)

v0.0.1 released https://pypi.org/project/alias-free-torch/
Source code(tar.gz)
Source code(zip)

Owner

이준혁(Junhyeok Lee)

Audio/Speech Deep Learning Researcher @mindslab-ai

GitHub Repository

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

MoViNet-pytorch Pytorch unofficial implementation of MoViNets: Mobile Video Networks for Efficient Video Recognition. Authors: Dan Kondratyuk, Liangzh

189 Dec 20, 2022

A lightweight library to compare different PyTorch implementations of the same network architecture.

TorchBug is a lightweight library designed to compare two PyTorch implementations of the same network architecture. It allows you to count, and compar

5 Jan 02, 2023

PyTorch implementation for 3D human pose estimation

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach This repository is the PyTorch implementation for the network presented in:

579 Dec 22, 2022

Malware Env for OpenAI Gym

Malware Env for OpenAI Gym Citing If you use this code in a publication please cite the following paper: Hyrum S. Anderson, Anant Kharkar, Bobby Fila

563 Dec 29, 2022

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages This repository contains the code for the pa

40 Nov 24, 2022

Group-Free 3D Object Detection via Transformers

Group-Free 3D Object Detection via Transformers By Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong. This repo is the official implementation of "Group-

213 Dec 07, 2022

Official Implementation of DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation

DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation [Arxiv] [Paper] As acquiring pixel-wise an

305 Dec 29, 2022

Blender Add-On for slicing meshes with planes

MeshSlicer Blender Add-On for slicing meshes with multiple overlapping planes at once. This is a simple Blender addon to slice a silmple mesh with mul

52 Dec 12, 2022

FinGAT: A Financial Graph Attention Networkto Recommend Top-K Profitable Stocks

FinGAT: A Financial Graph Attention Networkto Recommend Top-K Profitable Stocks This is our implementation for the paper: FinGAT: A Financial Graph At

64 Dec 13, 2022

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

TopClus The source code used for Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations, published in WWW 2022. Requ

63 Dec 18, 2022

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

Continual Learning With Filter Atom Swapping Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper If find t

11 Aug 29, 2022

naked is a Python tool which allows you to strip a model and only keep what matters for making predictions.

naked is a Python tool which allows you to strip a model and only keep what matters for making predictions. The result is a pure Python function with no third-party dependencies that you can simply c

24 Dec 20, 2022

RoadMap and preparation material for Machine Learning and Data Science - From beginner to expert.

ML-and-DataScience-preparation This repository has the goal to create a learning and preparation roadMap for Machine Learning Engineers and Data Scien

33 Dec 29, 2022

Indices Matter: Learning to Index for Deep Image Matting

IndexNet Matting This repository includes the official implementation of IndexNet Matting for deep image matting, presented in our paper: Indices Matt

357 Nov 26, 2022

Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts

t5-japanese Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts. The following is a list of models that

1 Dec 13, 2021

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

DeepFilterNet A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering. libDF contains Rust code used for dat

292 Dec 25, 2022

Data for "Driving the Herd: Search Engines as Content Influencers" paper

herding_data Data for "Driving the Herd: Search Engines as Content Influencers" paper Dataset description The collection contains 2250 documents, 30 i

0 Aug 17, 2021

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Convolutional Neural Network (CNN). This repository contains a source code of a deep learning network built with TensorFlow and Keras to classify gend

1 Dec 19, 2021

Seq2seq - Sequence to Sequence Learning with Keras

Seq2seq Sequence to Sequence Learning with Keras Hi! You have just found Seq2Seq. Seq2Seq is a sequence to sequence learning add-on for the python dee

3.1k Dec 18, 2022

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis Install the package in the requirements.txt, the

108 Dec 23, 2022