Simple, efficient and flexible vision toolbox for mxnet framework.

Last update: Oct 19, 2019

Overview

MXbox: Simple, efficient and flexible vision toolbox for mxnet framework.

MXbox is a toolbox aiming to provide a general and simple interface for vision tasks. This project is greatly inspired by PyTorch and torchvision. Detailed copyright files are on the way. Improvements and suggestions are welcome.

Installation

MXBox is now available on PyPi.

pip install mxbox

Features

Define preprocess as a flow

transform = transforms.Compose([
    transforms.RandomSizedCrop(224),
    transforms.RandomHorizontalFlip(),
    transforms.mx.ToNdArray(),
    transforms.mx.Normalize(mean = [ 0.485, 0.456, 0.406 ],
                            std  = [ 0.229, 0.224, 0.225 ]),
])

PS: By default, mxbox uses PIL to read and transform images. But it also supports other backends like accimage and skimage.

More usages can be found in documents and examples.

Build an multi-thread DataLoader in few lines

Common datasets such as cifar10, cifar100, SVHN, MNIST are out-of-the-box. You can simply load them from mxbox.datasets.

from mxbox import transforms, datasets, DataLoader
trans = transforms.Compose([
        transforms.mx.ToNdArray(), 
        transforms.mx.Normalize(mean = [ 0.485, 0.456, 0.406 ],
                                std  = [ 0.229, 0.224, 0.225 ]),
])
dataset = datasets.CIFAR10('~/.mxbox/cifar10', transform=trans, download=True)

batch_size = 32
feedin_shapes = {
    'batch_size': batch_size,
    'data': [mx.io.DataDesc(name='data', shape=(batch_size, 3, 32, 32), layout='NCHW')],
    'label': [mx.io.DataDesc(name='softmax_label', shape=(batch_size, ), layout='N')]
}
loader = DataLoader(dataset, feedin_shapes, threads=8, shuffle=True)

Or you can also easily create your own, which only requires to implement __getitem__ and __len__.

class TooYoungScape(mxbox.Dataset):
    def __init__(self, root, lst, transform=None):
        self.root = root
        with open(osp.join(root, lst), 'r') as fp:
            self.lst = [line.strip().split('\t') for line in fp.readlines()]
        self.transform = transform

    def __getitem__(self, index):
        img = self.pil_loader(osp.join(self.root, self.lst[index][0]))
        if self.transform is not None:
            img = self.transform(img)
        return {'data': img, 'softmax_label': img}

    def __len__(self):
        return len(self.lst)
        
dataset = TooYoungScape('~/.mxbox/TooYoungScape', "train.lst", transform=trans)
loader = DataLoader(dataset, feedin_shapes, threads=8, shuffle=True)

Load popular model with pretrained weights

Note: current under construction, many models lack of pretrained weights and some of their definition files are missing.

vgg = mxbox.models.vgg(num_classes=10, pretrained=True)
resnet = mxbox.models.resnet152(num_classes=10, pretrained=True)

TODO list

FLAG options?
Efficient prefetch.
Common Models preparation.
More friendly error logging.

Simple, efficient and flexible vision toolbox for mxnet framework.

Related tags

Overview

MXbox: Simple, efficient and flexible vision toolbox for mxnet framework.

Installation

Features

TODO list

Owner

Ligeng Zhu

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

Code for the paper "Adversarial Generator-Encoder Networks"

This is the official PyTorch implementation of our paper: "Artistic Style Transfer with Internal-external Learning and Contrastive Learning".

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

VLG-Net: Video-Language Graph Matching Networks for Video Grounding

An Unsupervised Detection Framework for Chinese Jargons in the Darknet

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

To model the probability of a soccer coach leave his/her team during Campeonato Brasileiro for 10 chosen teams and considering years 2018, 2019 and 2020.

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

🔥 Real-time Super Resolution enhancement (4x) with content loss and relativistic adversarial optimization 🔥

Implementation of "Deep Implicit Templates for 3D Shape Representation"

Manifold-Mixup implementation for fastai V2

An end-to-end implementation of intent prediction with Metaflow and other cool tools

VD-BERT: A Unified Vision and Dialog Transformer with BERT

tree-math: mathematical operations for JAX pytrees

Dense Prediction Transformers

frida工具的缝合怪

This package contains a PyTorch Implementation of IB-GAN of the submitted paper in AAAI 2021

DFM: A Performance Baseline for Deep Feature Matching