Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Last update: Dec 22, 2022

Overview

Alias-Free-Torch

Simple torch module implementation of Alias-Free GAN.

This repository including

Alias-Free GAN style lowpass sinc filter @filter.py
Alias-Free GAN style up/downsample @resample.py
Alias-Free activation @act.py
and test codes @./test

Note: Since this repository is unofficial, filter and upsample could be different with official implementation.

Note: 2d lowpass filter is applying sinc instead of jinc (first order Bessel function of the first kind) in paper

Requirements

Due to torch.kaiser_window and torch.i0 are implemeted after 1.7.0, our repository need torch>=1.7.0.

Pytorch>=1.7.0

TODO

2d sinc filter
2d resample
devide 1d and 2d modules
pip packaging

Test results 1d

Filter sine	Filter noise

upsample	downsample

Test results 2d

Filter L1 norm sine	Filter noise

upsample	downsample

Activation

References

Alias-Free GAN
adefossez/julius
A. V. Oppenheim and R. W. Schafer. Discrete-Time Signal Processing. Pearson, International Edition, 3rd edition, 2010

Acknowledgement

This work is done at MINDsLab Inc.

Thanks to teammates at MINDsLab Inc.

Comments

Batched resampling for the new implementation

Hi, thank you very much for the contribution.

I think the new implementation of resample.Upsample1d and resample.Downsample1d breaks batched resampling when using groups=C without expanding the filter to match the shape. Perhaps the implementation should be like the below (maybe similar goes to 2d):

Upsample1d.forward()

    # x: [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        x = F.pad(x, (self.pad, self.pad), mode='reflect')
        # TConv with filter expanded to C with C groups for depthwise op
        x = self.ratio * F.conv_transpose1d(
            x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C)
        pad_left = self.pad * self.stride + (self.kernel_size -
                                             self.stride) // 2
        pad_right = self.pad * self.stride + (self.kernel_size - self.stride +
                                              1) // 2
        x = x[..., pad_left:-pad_right]

LowPassFilter1d.forward()

    #input [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        if self.padding:
            x = F.pad(x, (self.left_pad, self.right_pad),
                      mode=self.padding_mode)
        # Conv with filter expanded to C with C groups for depthwise op
        out = F.conv1d(x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C) # typo 'groupds' btw
        return out

Could you check the correctness? Thanks again for the implementation!

opened by L0SG 2

torch.speical.i1 typo

https://github.com/junjun3518/alias-free-torch/blob/f1fddd52fdd068ee475e82ae60c92e1bc24ffe02/src/alias_free_torch/filter.py#L22

At this line I believe you wanted torch.special.i1.

opened by torridgristle 2
"if self.pad / self.padding" in LowPassFilter2d

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L165 https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L173

In LowPassFilter2d it looks like if self.pad: should change to if self.padding:, or self.padding = padding should change to self.pad = padding to match LowPassFilter1d.

opened by torridgristle 1
Padding Bool typo

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L73

padding: bool: True, should be padding: bool = True,

I'm not sure if this causes an error with every version of PyTorch, but it does with PyTorch 1.12.0+cu113 on Python 3.7.13

opened by torridgristle 1
2D Filter Jinc appears to be wrong

Here is a plot of the generated 1D sinc filter kernel.

Here is a plot of the generated 2D jinc filter kernel.

I'd expect it to look more like a series of rings or ripples, rather than a donut or torus.

The FFT output for randn noise put through the 2D filter doesn't look right either.

Changing filter_ = 2 * cutoff * window * jinc(2 * cutoff * time) to filter_ = 2 * cutoff * window * sinc(2 * cutoff * time) in kaiser_jinc_filter2d makes a more familiar kernel.

And the FFT output for randn noise put through this 2D filter looks about how I'd expect.

opened by torridgristle 3

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

https://pypi.org/project/alias-free-torch/0.0.6/

Tested version
Source code(tar.gz)
Source code(zip)
v0.0.3(Jul 18, 2022)

https://pypi.org/project/alias-free-torch/0.0.3/

Bug fix for torch.special / remove print / split pad from conv_transpose
Source code(tar.gz)
Source code(zip)
v0.0.2(Jun 22, 2022)

https://pypi.org/project/alias-free-torch/0.0.2/

Rewrite upsample, jinc applied
Source code(tar.gz)
Source code(zip)
v0.0.1(Nov 2, 2021)

v0.0.1 released https://pypi.org/project/alias-free-torch/
Source code(tar.gz)
Source code(zip)

Owner

이준혁(Junhyeok Lee)

Audio/Speech Deep Learning Researcher @mindslab-ai

GitHub Repository

Asynchronous Advantage Actor-Critic in PyTorch

Asynchronous Advantage Actor-Critic in PyTorch This is PyTorch implementation of A3C as described in Asynchronous Methods for Deep Reinforcement Learn

38 Dec 12, 2022

Pretrained Cost Model for Distributed Constraint Optimization Problems

Pretrained Cost Model for Distributed Constraint Optimization Problems Requirements PyTorch 1.9.0 PyTorch Geometric 1.7.1 Directory structure baseline

2 Aug 28, 2022

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

Density-aware Chamfer Distance This repository contains the official PyTorch implementation of our paper: Density-aware Chamfer Distance as a Comprehe

93 Dec 15, 2022

Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph, ICSE 2022

PyCRE Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph, ICSE 2022 Dependencies This project is developed

[email protected]"> 7 May 06, 2022

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

DSE 314/614: Reinforcement Learning This repository containing reinforcement lea

4 Apr 15, 2022

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

This project provides abundant choices of quantization strategies (such as the quantization algorithms, training schedules and empirical tricks) for quantizing the deep neural networks into low-bit c

51 Dec 10, 2022

Problem-943.-ACMP - Problem 943. ACMP

Problem-943.-ACMP В "main.py" расположен вариант моего решения задачи 943 с серв

2 Aug 19, 2022

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers. It contains purchases, recurring

1 Jan 01, 2022

Symbolic Music Generation with Diffusion Models

Symbolic Music Generation with Diffusion Models Supplementary code release for our work Symbolic Music Generation with Diffusion Models. Installation

119 Jan 07, 2023

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Documentation: https://mmsegmentation.readthedocs.io/ English | 简体中文 Introduction MMSegmentation is an open source semantic segmentation toolbox based

5k Dec 31, 2022

A note taker for NVDA. Allows the user to create, edit, view, manage and export notes to different formats.

Quick Notetaker add-on for NVDA The Quick Notetaker add-on is a wonderful tool which allows writing notes quickly and easily anytime and from any app

5 Dec 06, 2022

A general-purpose programming language, focused on simplicity, safety and stability.

The Rivet programming language A general-purpose programming language, focused on simplicity, safety and stability. Rivet's goal is to be a very power

17 Dec 29, 2022

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

FixMyPose / फिक्समाइपोज़ Code and dataset for AAAI 2021 paper "FixMyPose: Pose Correctional Describing and Retrieval" Hyounghun Kim*, Abhay Zala*, Grah

4 Sep 19, 2022

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

PyTorch implementation of 'Denoising Diffusion Probabilistic Models' This reposi

76 Jan 07, 2023

Simple ray intersection library similar to coldet - succedeed by libacc

Ray Intersection This project offers a header only acceleration structure library including implementations for a BVH- and KD-Tree. Applications may i

29 Jun 23, 2022

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Enformer TPU training script (wip) The full training script for Enformer (Tensorflow Sonnet) on TPU clusters, in an effort to migrate the model to pyt

10 Oct 19, 2022

Paper list of log-based anomaly detection

411 Dec 05, 2022

Official Codes for Graph Modularity:Towards Understanding the Cross-Layer Transition of Feature Representations in Deep Neural Networks.

Dynamic-Graphs-Construction Official Codes for Graph Modularity:Towards Understanding the Cross-Layer Transition of Feature Representations in Deep Ne

11 Dec 14, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 06, 2023

Benchmarks for semi-supervised domain generalization.

Semi-Supervised Domain Generalization This code is the official implementation of the following paper: Semi-Supervised Domain Generalization with Stoc

49 Dec 10, 2022