profile tools for pytorch nn models

Last update: Jul 09, 2022

Related tags

Text Data & NLP nnprof

Overview

nnprof

Introduction

nnprof is a profile tool for pytorch neural networks.

Features

multi profile mode: nnprof support 4 profile mode: Layer level, Operation level, Mixed level, Layer Tree level. Please check below for detail usage.
time and memory profile: nnprof support both time and memory profile now. But since memory profile is first supported in pytorch 1.6, please use torch version >= 1.6 for memory profile.
support sorted by given key and show profile percent: user could print table with percentage and sorted profile info using a given key, which is really helpful for optimiziing neural network.

Requirements

Python >= 3.6
PyTorch
Numpy

Get Started

install nnprof

pip install:

pip install nnprof

from source:

python -m pip install 'git+https://github.com/FateScript/nnprof.git'

# or install after clone this repo
git clone https://github.com/FateScript/nnprof.git
pip install -e nnprof

use nnprf

from nnprof import profile, ProfileMode
import torch
import torchvision

model = torchvision.models.alexnet(pretrained=False)
x = torch.rand([1, 3, 224, 224])

# mode could be anyone in LAYER, OP, MIXED, LAYER_TREE
mode = ProfileMode.LAYER

with profile(model, mode=mode) as prof:
    y = model(x)

print(prof.table(average=False, sorted_by="cpu_time"))
# table could be sorted by presented header.

Part of presented table looks like table below, Note that they are sorted by cpu_time.

╒══════════════════════╤═══════════════════╤═══════════════════╤════════╕
│ name                 │ self_cpu_time     │ cpu_time          │   hits │
╞══════════════════════╪═══════════════════╪═══════════════════╪════════╡
│ AlexNet.features.0   │ 19.114ms (34.77%) │ 76.383ms (45.65%) │      1 │
├──────────────────────┼───────────────────┼───────────────────┼────────┤
│ AlexNet.features.3   │ 5.148ms (9.37%)   │ 20.576ms (12.30%) │      1 │
├──────────────────────┼───────────────────┼───────────────────┼────────┤
│ AlexNet.features.8   │ 4.839ms (8.80%)   │ 19.336ms (11.56%) │      1 │
├──────────────────────┼───────────────────┼───────────────────┼────────┤
│ AlexNet.features.6   │ 4.162ms (7.57%)   │ 16.632ms (9.94%)  │      1 │
├──────────────────────┼───────────────────┼───────────────────┼────────┤
│ AlexNet.features.10  │ 2.705ms (4.92%)   │ 10.713ms (6.40%)  │      1 │
├──────────────────────┼───────────────────┼───────────────────┼────────┤

You are welcomed to try diffierent profile mode and more table format.

Contribution

Any issues and pull requests are welcomed.

Acknowledgement

Some thoughts of nnprof are inspired by torchprof and torch.autograd.profile . Many thanks to the authors.

profile tools for pytorch nn models

Related tags

Overview

nnprof

Introduction

Features

Requirements

Get Started

install nnprof

use nnprf

Contribution

Acknowledgement

Owner

Feng Wang

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Built for cleaning purposes in military institutions

Awesome Treasure of Transformers Models Collection

Autoregressive Entity Retrieval

A Telegram bot to add notes to Flomo.

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation

Lattice methods in TensorFlow

Continuously update some NLP practice based on different tasks.

Lyrics generation with GPT2-based Transformer

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

An A-SOUL Text Generator Based on CPM-Distill.

Voice Assistant inspired by Google Assistant, Cortana, Alexa, Siri, ...

Repositório da disciplina no semestre 2021-2

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

Telegram bot to auto post messages of one channel in another channel as soon as it is posted, without the forwarded tag.

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

A python script to prefab your scripts/text files, and re create them with ease and not have to open your browser to copy code or write code yourself

Python powered crossword generator with database with 20k+ polish words