Spatial Transformer Nets in TensorFlow/ TensorLayer

Last update: Nov 23, 2022

Overview

MOVED TO HERE

Spatial Transformer Networks

Spatial Transformer Networks (STN) is a dynamic mechanism that produces transformations of input images (or feature maps)including scaling, cropping, rotations, as well as non-rigid deformations. This enables the network to not only select regions of an image that are most relevant (attention), but also to transform those regions to simplify recognition in the following layers.

Video for different transformation click me.

In this repositary, we implemented a STN for 2D Affine Transformation on MNIST dataset. We generated images with size of 40x40 from the original MNIST dataset, and distorted the images by random rotation, shifting, shearing and zoom in/out. The STN was able to learn to automatically apply transformations on distorted images via classification task.

Fig 1：Transformation

Fig 2：Network

Fig 3：Formula

Result

After classification task, the STN is able to transform the distorted image from Fig 4 back to Fig 5.

Fig 4: Input

Fig 5: Output

Comments

Export graph

Hello,

I'm trying to export this model and its weights into a frozen graph. So far I did this in the "save images" part of the training loop:

saver = tf.train.Saver() saver.save(sess, 'my_stn_model_' + str(epoch)) tf.train.write_graph(sess.graph_def, ".", "test.pb", False) #proto

Then I have my pb and the weights. But I am unable to generate a frozen graph from this because I cannot guess the output node in the graph (which is a really complex one it seems).

I attached the pb which I am able to visualize using Netron:

test.zip

Thanks in advance.

opened by hvico 0
Unexpected Output

I'm trying to train the STN. The training dataset which I've provided contains MNIST dataset images which are rotated 90 degree and the testing dataset contains simple MNIST dataset images. The output I'm getting is straight MNIST images. I was expecting the output to contain characters which are rotated 90 degrees. Can you please guide me on how this works?

opened by nabeel3133 4

Spatial Transformer Nets in TensorFlow/ TensorLayer

Related tags

Overview

MOVED TO HERE

Spatial Transformer Networks

Result

You might also like...

Group Activity Recognition with Clustered Spatial Temporal Transformer

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

Official PyTorch implementation of Spatial Dependency Networks.

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Comments

Export graph

Unexpected Output

Releases(0.0.1)

0.0.1(Jun 26, 2017)

Owner

Hao

Anomaly detection related books, papers, videos, and toolboxes

In this project, we'll be making our own screen recorder in Python using some libraries.

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

FairyTailor: Multimodal Generative Framework for Storytelling

TorchGRL is the source code for our paper Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments for IV 2022.

RRL: Resnet as representation for Reinforcement Learning

Automate issue discovery for your projects against Lightning nightly and releases.

You Only Look Once for Panopitic Driving Perception

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Hepsiburada - Hepsiburada Urun Bilgisi Cekme

PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments.

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

PyTorch code for JEREX: Joint Entity-Level Relation Extractor

PyTorch implementation for 3D human pose estimation

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)