View model summaries in PyTorch!

Last update: Jan 05, 2023

Overview

torchinfo

(formerly torch-summary)

Torchinfo provides information complementary to what is provided by print(your_model) in PyTorch, similar to Tensorflow's model.summary() API to view the visualization of the model, which is helpful while debugging your network. In this project, we implement a similar functionality in PyTorch and create a clean, simple interface to use in your projects.

This is a completely rewritten version of the original torchsummary and torchsummaryX projects by @sksq96 and @nmhkahn. This project addresses all of the issues and pull requests left on the original projects by introducing a completely new API.

Usage

pip install torchinfo

Alternatively, via conda:

conda install -c conda-forge torchinfo

How To Use

from torchinfo import summary

model = ConvNet()
batch_size = 16
summary(model, input_size=(batch_size, 1, 28, 28))

================================================================================================================
Layer (type:depth-idx)          Input Shape          Output Shape         Param #            Mult-Adds
================================================================================================================
SingleInputNet                  --                   --                   --                  --
├─Conv2d: 1-1                   [7, 1, 28, 28]       [7, 10, 24, 24]      260                1,048,320
├─Conv2d: 1-2                   [7, 10, 12, 12]      [7, 20, 8, 8]        5,020              2,248,960
├─Dropout2d: 1-3                [7, 20, 8, 8]        [7, 20, 8, 8]        --                 --
├─Linear: 1-4                   [7, 320]             [7, 50]              16,050             112,350
├─Linear: 1-5                   [7, 50]              [7, 10]              510                3,570
================================================================================================================
Total params: 21,840
Trainable params: 21,840
Non-trainable params: 0
Total mult-adds (M): 3.41
================================================================================================================
Input size (MB): 0.02
Forward/backward pass size (MB): 0.40
Params size (MB): 0.09
Estimated Total Size (MB): 0.51
================================================================================================================

Note: if you are using a Jupyter Notebook or Google Colab, summary(model, ...) must be the returned value of the cell. If it is not, you should wrap the summary in a print(), e.g. print(summary(model, ...)). See tests/jupyter_test.ipynb for examples.

This version now supports:

RNNs, LSTMs, and other recursive layers
Branching output used to explore model layers using specified depths
Returns ModelStatistics object containing all summary data fields
Configurable rows/columns
Jupyter Notebook / Google Colab

Other new features:

Verbose mode to show weights and bias layers
Accepts either input data or simply the input shape!
Customizable line widths and batch dimension
Comprehensive unit/output testing, linting, and code coverage testing

Community Contributions:

Sequentials & ModuleLists (thanks to @roym899)
Improved Mult-Add calculations (thanks to @TE-StefanUhlich, @zmzhang2000)
Dict/Misc input data (thanks to @e-dorigatti)
Pruned layer support (thanks to @MajorCarrot)

Documentation

def summary(
    model: nn.Module,
    input_size: Optional[INPUT_SIZE_TYPE] = None,
    input_data: Optional[INPUT_DATA_TYPE] = None,
    batch_dim: Optional[int] = None,
    cache_forward_pass: Optional[bool] = None,
    col_names: Optional[Iterable[str]] = None,
    col_width: int = 25,
    depth: int = 3,
    device: Optional[torch.device] = None,
    dtypes: Optional[List[torch.dtype]] = None,
    row_settings: Optional[Iterable[str]] = None,
    verbose: int = 1,
    **kwargs: Any,
) -> ModelStatistics:
"""
Summarize the given PyTorch model. Summarized information includes:
    1) Layer names,
    2) input/output shapes,
    3) kernel shape,
    4) # of parameters,
    5) # of operations (Mult-Adds)

NOTE: If neither input_data or input_size are provided, no forward pass through the
network is performed, and the provided model information is limited to layer names.

Args:
    model (nn.Module):
            PyTorch model to summarize. The model should be fully in either train()
            or eval() mode. If layers are not all in the same mode, running summary
            may have side effects on batchnorm or dropout statistics. If you
            encounter an issue with this, please open a GitHub issue.

    input_size (Sequence of Sizes):
            Shape of input data as a List/Tuple/torch.Size
            (dtypes must match model input, default is FloatTensors).
            You should include batch size in the tuple.
            Default: None

    input_data (Sequence of Tensors):
            Arguments for the model's forward pass (dtypes inferred).
            If the forward() function takes several parameters, pass in a list of
            args or a dict of kwargs (if your forward() function takes in a dict
            as its only argument, wrap it in a list).
            Default: None

    batch_dim (int):
            Batch_dimension of input data. If batch_dim is None, assume
            input_data / input_size contains the batch dimension, which is used
            in all calculations. Else, expand all tensors to contain the batch_dim.
            Specifying batch_dim can be an runtime optimization, since if batch_dim
            is specified, torchinfo uses a batch size of 1 for the forward pass.
            Default: None

    cache_forward_pass (bool):
            If True, cache the run of the forward() function using the model
            class name as the key. If the forward pass is an expensive operation,
            this can make it easier to modify the formatting of your model
            summary, e.g. changing the depth or enabled column types, especially
            in Jupyter Notebooks.
            WARNING: Modifying the model architecture or input data/input size when
            this feature is enabled does not invalidate the cache or re-run the
            forward pass, and can cause incorrect summaries as a result.
            Default: False

    col_names (Iterable[str]):
            Specify which columns to show in the output. Currently supported: (
                "input_size",
                "output_size",
                "num_params",
                "kernel_size",
                "mult_adds",
            )
            Default: ("output_size", "num_params")
            If input_data / input_size are not provided, only "num_params" is used.

    col_width (int):
            Width of each column.
            Default: 25

    depth (int):
            Depth of nested layers to display (e.g. Sequentials).
            Nested layers below this depth will not be displayed in the summary.
            Default: 3

    device (torch.Device):
            Uses this torch device for model and input_data.
            If not specified, uses result of torch.cuda.is_available().
            Default: None

    dtypes (List[torch.dtype]):
            If you use input_size, torchinfo assumes your input uses FloatTensors.
            If your model use a different data type, specify that dtype.
            For multiple inputs, specify the size of both inputs, and
            also specify the types of each parameter here.
            Default: None

    row_settings (Iterable[str]):
            Specify which features to show in a row. Currently supported: (
                "ascii_only",
                "depth",
                "var_names",
            )
            Default: ("depth",)

    verbose (int):
            0 (quiet): No output
            1 (default): Print model summary
            2 (verbose): Show weight and bias layers in full detail
            Default: 1
            If using a Juypter Notebook or Google Colab, the default is 0.

    **kwargs:
            Other arguments used in `model.forward` function. Passing *args is no
            longer supported.

Return:
    ModelStatistics object
            See torchinfo/model_statistics.py for more information.
"""

Examples

Get Model Summary as String

from torchinfo import summary

model_stats = summary(your_model, (1, 3, 28, 28), verbose=0)
summary_str = str(model_stats)
# summary_str contains the string representation of the summary!

Explore Different Configurations

class LSTMNet(nn.Module):
    def __init__(self, vocab_size=20, embed_dim=300, hidden_dim=512, num_layers=2):
        super().__init__()
        self.hidden_dim = hidden_dim
        self.embedding = nn.Embedding(vocab_size, embed_dim)
        self.encoder = nn.LSTM(embed_dim, hidden_dim, num_layers=num_layers, batch_first=True)
        self.decoder = nn.Linear(hidden_dim, vocab_size)

    def forward(self, x):
        embed = self.embedding(x)
        out, hidden = self.encoder(embed)
        out = self.decoder(out)
        out = out.view(-1, out.size(2))
        return out, hidden

summary(
    LSTMNet(),
    (1, 100),
    dtypes=[torch.long],
    verbose=2,
    col_width=16,
    col_names=["kernel_size", "output_size", "num_params", "mult_adds"],
    row_settings=["var_names"],
)

========================================================================================================================
Layer (type (var_name))                  Kernel Shape         Output Shape         Param #              Mult-Adds
========================================================================================================================
LSTMNet                                  --                   --                   --                   --
├─Embedding (embedding)                  [300, 20]            [1, 100, 300]        6,000                6,000
│    └─weight                            [300, 20]                                 └─6,000
├─LSTM (encoder)                         --                   [1, 100, 512]        3,768,320            376,832,000
│    └─weight_ih_l0                      [2048, 300]                               ├─614,400
│    └─weight_hh_l0                      [2048, 512]                               ├─1,048,576
│    └─bias_ih_l0                        [2048]                                    ├─2,048
│    └─bias_hh_l0                        [2048]                                    ├─2,048
│    └─weight_ih_l1                      [2048, 512]                               ├─1,048,576
│    └─weight_hh_l1                      [2048, 512]                               ├─1,048,576
│    └─bias_ih_l1                        [2048]                                    ├─2,048
│    └─bias_hh_l1                        [2048]                                    └─2,048
├─Linear (decoder)                       [512, 20]            [1, 100, 20]         10,260               10,260
│    └─weight                            [512, 20]                                 ├─10,240
│    └─bias                              [20]                                      └─20
========================================================================================================================
Total params: 3,784,580
Trainable params: 3,784,580
Non-trainable params: 0
Total mult-adds (M): 376.85
========================================================================================================================
Input size (MB): 0.00
Forward/backward pass size (MB): 0.67
Params size (MB): 15.14
Estimated Total Size (MB): 15.80
========================================================================================================================

ResNet

import torchvision

model = torchvision.models.resnet152()
summary(model, (1, 3, 224, 224), depth=3)

==========================================================================================
Layer (type:depth-idx)                   Output Shape              Param #
==========================================================================================
ResNet                                   --                        --
├─Conv2d: 1-1                            [1, 64, 112, 112]         9,408
├─BatchNorm2d: 1-2                       [1, 64, 112, 112]         128
├─ReLU: 1-3                              [1, 64, 112, 112]         --
├─MaxPool2d: 1-4                         [1, 64, 56, 56]           --
├─Sequential: 1-5                        [1, 256, 56, 56]          --
│    └─Bottleneck: 2-1                   [1, 256, 56, 56]          --
│    │    └─Conv2d: 3-1                  [1, 64, 56, 56]           4,096
│    │    └─BatchNorm2d: 3-2             [1, 64, 56, 56]           128
│    │    └─ReLU: 3-3                    [1, 64, 56, 56]           --
│    │    └─Conv2d: 3-4                  [1, 64, 56, 56]           36,864
│    │    └─BatchNorm2d: 3-5             [1, 64, 56, 56]           128
│    │    └─ReLU: 3-6                    [1, 64, 56, 56]           --
│    │    └─Conv2d: 3-7                  [1, 256, 56, 56]          16,384
│    │    └─BatchNorm2d: 3-8             [1, 256, 56, 56]          512
│    │    └─Sequential: 3-9              [1, 256, 56, 56]          16,896
│    │    └─ReLU: 3-10                   [1, 256, 56, 56]          --
│    └─Bottleneck: 2-2                   [1, 256, 56, 56]          --

  ...
  ...
  ...

├─AdaptiveAvgPool2d: 1-9                 [1, 2048, 1, 1]           --
├─Linear: 1-10                           [1, 1000]                 2,049,000
==========================================================================================
Total params: 60,192,808
Trainable params: 60,192,808
Non-trainable params: 0
Total mult-adds (G): 11.51
==========================================================================================
Input size (MB): 0.60
Forward/backward pass size (MB): 360.87
Params size (MB): 240.77
Estimated Total Size (MB): 602.25
==========================================================================================

Multiple Inputs w/ Different Data Types

class MultipleInputNetDifferentDtypes(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1a = nn.Linear(300, 50)
        self.fc1b = nn.Linear(50, 10)

        self.fc2a = nn.Linear(300, 50)
        self.fc2b = nn.Linear(50, 10)

    def forward(self, x1, x2):
        x1 = F.relu(self.fc1a(x1))
        x1 = self.fc1b(x1)
        x2 = x2.type(torch.float)
        x2 = F.relu(self.fc2a(x2))
        x2 = self.fc2b(x2)
        x = torch.cat((x1, x2), 0)
        return F.log_softmax(x, dim=1)

summary(model, [(1, 300), (1, 300)], dtypes=[torch.float, torch.long])

Alternatively, you can also pass in the input_data itself, and torchinfo will automatically infer the data types.

input_data = torch.randn(1, 300)
other_input_data = torch.randn(1, 300).long()
model = MultipleInputNetDifferentDtypes()

summary(model, input_data=[input_data, other_input_data, ...])

Sequentials & ModuleLists

class ContainerModule(nn.Module):

    def __init__(self):
        super().__init__()
        self._layers = nn.ModuleList()
        self._layers.append(nn.Linear(5, 5))
        self._layers.append(ContainerChildModule())
        self._layers.append(nn.Linear(5, 5))

    def forward(self, x):
        for layer in self._layers:
            x = layer(x)
        return x


class ContainerChildModule(nn.Module):

    def __init__(self):
        super().__init__()
        self._sequential = nn.Sequential(nn.Linear(5, 5), nn.Linear(5, 5))
        self._between = nn.Linear(5, 5)

    def forward(self, x):
        out = self._sequential(x)
        out = self._between(out)
        for l in self._sequential:
            out = l(out)

        out = self._sequential(x)
        for l in self._sequential:
            out = l(out)
        return out

summary(ContainerModule(), (1, 5))

==========================================================================================
Layer (type:depth-idx)                   Output Shape              Param #
==========================================================================================
ContainerModule                          --                        --
├─ModuleList: 1-1                        --                        --
│    └─Linear: 2-1                       [1, 5]                    30
│    └─ContainerChildModule: 2-2         [1, 5]                    --
│    │    └─Sequential: 3-1              [1, 5]                    --
│    │    │    └─Linear: 4-1             [1, 5]                    30
│    │    │    └─Linear: 4-2             [1, 5]                    30
│    │    └─Linear: 3-2                  [1, 5]                    30
│    │    └─Sequential: 3                --                        --
│    │    │    └─Linear: 4-3             [1, 5]                    (recursive)
│    │    │    └─Linear: 4-4             [1, 5]                    (recursive)
│    │    └─Sequential: 3-3              [1, 5]                    (recursive)
│    │    │    └─Linear: 4-5             [1, 5]                    (recursive)
│    │    │    └─Linear: 4-6             [1, 5]                    (recursive)
│    │    │    └─Linear: 4-7             [1, 5]                    (recursive)
│    │    │    └─Linear: 4-8             [1, 5]                    (recursive)
│    └─Linear: 2-3                       [1, 5]                    30
==========================================================================================
Total params: 150
Trainable params: 150
Non-trainable params: 0
Total mult-adds (M): 0.00
==========================================================================================
Input size (MB): 0.00
Forward/backward pass size (MB): 0.00
Params size (MB): 0.00
Estimated Total Size (MB): 0.00
==========================================================================================

Contributing

All issues and pull requests are much appreciated! If you are wondering how to build the project:

torchinfo is actively developed using the lastest version of Python.
- Changes should be backward compatible to Python 3.7, and will follow Python's End-of-Life guidance for old versions.
- Run pip install -r requirements-dev.txt. We use the latest versions of all dev packages.
- Run pre-commit install.
- To use auto-formatting tools, use pre-commit run -a.
- To run unit tests, run pytest.
- To update the expected output files, run pytest --overwrite.
- To skip output file tests, use pytest --no-output

References

Thanks to @sksq96, @nmhkahn, and @sangyx for providing the inspiration for this project.
For Model Size Estimation @jacobkimmel (details here)

Comments

Params and MACs Unit Specifier

It would be very useful to have a way to specify the units (MB, GB, etc ) in which the number of parameters and MACS are reported. This could help quickly compare different architectures.

I think of something like adding arguments params_units and macs_units to the summary() function with a default value 'auto' to respect the current behavior.

opened by richardtml 15
Support half-precision dtypes when calculating model size

@TylerYep 2 tests from torchinfo_xl_test.py are failing for me. can you check if it works for you? cause those 2 tests don't work for me from master as well. hope this fixes the issue. do check out the code and lmk if i have to make any changes. ty

opened by notjedi 14

nn.Parameter is ommitted (with a case)

Describe the bug nn.Parameter is omitted in summary when there are other pytorch predefined layers in the networks. Details are as follows:

To Reproduce

import torch
import torch.nn as nn
from torchinfo import summary

class FCNets(nn.Module):
    def __init__(self, input_dim, hidden_dim, output_dim):
        # 2 layer fully connected networks
        super().__init__()
        # layer1 with nn.Parameter
        self.weight = nn.Parameter(torch.randn(input_dim, hidden_dim))
        self.bias = nn.Parameter(torch.randn(hidden_dim))
        # layer2 with nn.Linear
        self.fc2  = nn.Linear(hidden_dim, output_dim)
        # activation
        self.activation = nn.ReLU()
    
    def forward(self, x):
        # x.shape = [batch_size, input_dim]
        # layer1
        h = torch.mm(x, self.weight) + self.bias
        # activation
        h = self.activation(h)
        # layer2
        out = self.fc2(h)
        return out

# device = torch.device("cuda:0")
device = torch.device("cpu")
x = torch.randn(3, 128).to(device)
fc = FCNets(128, 64, 32).to(device)
summary(fc, input_data=x)

It seems that nn.Parameter is not compatible with other layers (nn.Module class).

==========================================================================================
Layer (type:depth-idx)                   Output Shape              Param #
==========================================================================================
FCNets                                   --                        --
├─ReLU: 1-1                              [3, 64]                   --
├─Linear: 1-2                            [3, 32]                   2,080
==========================================================================================
Total params: 2,080
Trainable params: 2,080
Non-trainable params: 0
Total mult-adds (M): 0.01
==========================================================================================
Input size (MB): 0.00
Forward/backward pass size (MB): 0.00
Params size (MB): 0.01
Estimated Total Size (MB): 0.01
==========================================================================================

However, if we remove self.fc2, the output will be fine.

Pytorch version: 1.7.1 (GPU) Torchinfo version: 1.5.3

opened by zezhishao 12

Compute MACs for full input/output tensor

This changes the value that is returned by summary. Up to now, this value was assuming a batch-size of 1 and, thus, ignored the batch size in the MAC computations. However, this does not work with recurrent NNs as these, e.g., share fully connected layers over many timesteps.

With this change, the correct numbers are printed:

seq_length = 100:
========================================================================================================
Layer (type:depth-idx)                   Kernel Shape     Output Shape     Param #          Mult-Adds
========================================================================================================
\u251c\u2500Embedding: 1-1                         [300, 20]        [1, 100, 300]    6,000            6,000
\u251c\u2500LSTM: 1-2                              --               [1, 100, 512]    3,768,320        376,012,800
|    \u2514\u2500weight_ih_l0                      [2048, 300]
|    \u2514\u2500weight_hh_l0                      [2048, 512]
|    \u2514\u2500weight_ih_l1                      [2048, 512]
|    \u2514\u2500weight_hh_l1                      [2048, 512]
\u251c\u2500Linear: 1-3                            [512, 20]        [1, 100, 20]     10,260           10,240
========================================================================================================
Total params: 3,784,580
Trainable params: 3,784,580
Non-trainable params: 0
Total mult-adds (M): 376.03
========================================================================================================
Input size (MB): 0.00
Forward/backward pass size (MB): 0.67
Params size (MB): 15.14
Estimated Total Size (MB): 15.80
========================================================================================================

seq_length=10:
========================================================================================================
Layer (type:depth-idx)                   Kernel Shape     Output Shape     Param #          Mult-Adds
========================================================================================================
\u251c\u2500Embedding: 1-1                         [300, 20]        [1, 10, 300]     6,000            6,000
\u251c\u2500LSTM: 1-2                              --               [1, 10, 512]     3,768,320        37,601,280
|    \u2514\u2500weight_ih_l0                      [2048, 300]
|    \u2514\u2500weight_hh_l0                      [2048, 512]
|    \u2514\u2500weight_ih_l1                      [2048, 512]
|    \u2514\u2500weight_hh_l1                      [2048, 512]
\u251c\u2500Linear: 1-3                            [512, 20]        [1, 10, 20]      10,260           10,240
========================================================================================================
Total params: 3,784,580
Trainable params: 3,784,580
Non-trainable params: 0
Total mult-adds (M): 37.62
========================================================================================================
Input size (MB): 0.00
Forward/backward pass size (MB): 0.07
Params size (MB): 15.14
Estimated Total Size (MB): 15.20
========================================================================================================

Fixes #32

opened by TE-StefanUhlich 12

Error when using nn.UninitializedParameter

Describe the bug A ValueError is raised when trying to use unavailable operation nelement() on an UninitializedParameter.

summary method goes over all the modules in the model and tries to get the number of parameters, but that's not possible with an UninitializedParameter.

To Reproduce

class Net(nn.Module):
    def __init__(self):
        super().__init__()
        self.param = nn.UninitializedParameter()
    
    def init_param(self):
        self.param = nn.Parameter(torch.zeros(1))
    
    def forward(self, x):
        self.init_param()
        return x

net = Net()
torchinfo.summary(net, input_size=(1, 1))

Output First part of the stack trace:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
~\miniconda3\envs\cudalab\lib\site-packages\torchinfo\torchinfo.py in forward_pass(model, x, batch_dim, cache_forward_pass, device, **kwargs)
    260             if isinstance(x, (list, tuple)):
--> 261                 _ = model.to(device)(*x, **kwargs)
    262             elif isinstance(x, dict):

~\miniconda3\envs\cudalab\lib\site-packages\torch\nn\modules\module.py in _call_impl(self, *input, **kwargs)
   1108             for hook in (*_global_forward_pre_hooks.values(), *self._forward_pre_hooks.values()):
-> 1109                 result = hook(self, input)
   1110                 if result is not None:

~\miniconda3\envs\cudalab\lib\site-packages\torchinfo\torchinfo.py in pre_hook(***failed resolving arguments***)
    457         info = LayerInfo(var_name, module, curr_depth, idx[curr_depth], parent_info)
--> 458         info.calculate_num_params()
    459         info.check_recursive(summary_list)

~\miniconda3\envs\cudalab\lib\site-packages\torchinfo\layer_info.py in calculate_num_params(self)
    125         for name, param in self.module.named_parameters():
--> 126             self.num_params += param.nelement()
    127             if param.requires_grad:

~\miniconda3\envs\cudalab\lib\site-packages\torch\nn\parameter.py in __torch_function__(cls, func, types, args, kwargs)
    120             return super().__torch_function__(func, types, args, kwargs)
--> 121         raise ValueError(
    122             'Attempted to use an uninitialized parameter in {}. '

ValueError: Attempted to use an uninitialized parameter in <method 'numel' of 'torch._C._TensorBase' objects>. This error happens when you are using a `LazyModule` or explicitly manipulating `torch.nn.parameter.UninitializedParameter` objects. When using LazyModules Call `forward` with a dummy batch to initialize the parameters before calling torch functions

Expected behavior To check if a Module is an instance of an UninitializedParameter and skip calls to unavailable operations.

It would still be nice to show somehow in the printed summary table (maybe 'uninitialized'?).

Context:

OS: Windows 10
Python 3.9.7
pytorch 1.10.1 (py3.9_cpu_0)
torchinfo 1.6.0

opened by vladvrabie 11

MACS calculation error when the model structure is nested

Hi, thanks for the tool you provided, very useful. But I also found a bug when I want to calculate each layer's Mult-Addss of a nested model. I got something like this:

For most of the layers like TMVANet( (encoder): TMVANet_Encoder( (rd_encoding_branch): EncodingBranch( (double_3dconv_block1): Double3DConvBlock ... I could not get the Mult-Adds information correctly. I assume it is because the block was wrapped several times and could not be handled correctly? Could you please tell me the ways to solve this problem?

The initial part of my model looks like this:

opened by james20141606 11
Add support for pruned models

According to the pytorch documentation on pruning, the original parameter is replaced with one ending with _orig and a new buffer ending with _mask. The mask contains 0s and 1s based on which the correct parameters are chosen.

All instances of param.nelements() have been replaced by a variable cur_params whose value is set based on whether it is a masked model or not. To keep consistency with the rest of the code base, the _orig is removed from the name variable right after the calculation of cur_params.

opened by MajorCarrot 9
size estimation of model assumes floats everywhere

I can see in model_statistics.py L81 that floats are assumed everywhere. In mixed precision some weights are in fp16 or tf16 (truncated float). Quantized models use int8 weights and a separate float as parameter. The estimated size of the model should be correct, and it should depend on what model we hand over to summary(...).

opened by lizardzandwizardz 9
update support to torchvison mask/faster rcnn model summary
update support to torchvison mask/faster rcnn model summary

update layer_info to support OrderedDict and ImageList case where used within torchvison/detection

unittest passed
opened by michiroooo 9

[SyntaxError]

Hi, I just installed the latest version on a GCP instance with python 3.5 and got this error. It's odd as it works on my local machine with python 3.7.

from torchsummary import summary

Traceback (most recent call last):

  File "/usr/local/lib/python3.5/dist-packages/IPython/core/interactiveshell.py", line 3326, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)

  File "<ipython-input-1-bac45dd5d4db>", line 6, in <module>
    from torchsummary import summary

  File "/home/michalnarbutt/.local/lib/python3.5/site-packages/torchsummary/__init__.py", line 1, in <module>
    from .torchsummary import summary

  File "/home/michalnarbutt/.local/lib/python3.5/site-packages/torchsummary/torchsummary.py", line 30
    **kwargs: Any,
                 ^
SyntaxError: invalid syntax

opened by Ostyk 9

'Conv2d' object has no attribute 'weight_mask'

Hi I'm getting an error for a simple vgg16 implementation

Traceback (most recent call last):
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/torchinfo.py", line 272, in forward_pass
    _ = model.to(device)(*x, **kwargs)
  File "/home/user/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/user/Projects/net/vgg.py", line 97, in forward
    out = self.features(x)
  File "/home/user/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1109, in _call_impl
    result = hook(self, input)
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/torchinfo.py", line 500, in pre_hook
    info.calculate_num_params()
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/layer_info.py", line 151, in calculate_num_params
    cur_params, name = self.get_param_count(name, param)
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/layer_info.py", line 139, in get_param_count
    torch.sum(rgetattr(self.module, f"{without_suffix}_mask"))
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/layer_info.py", line 19, in rgetattr
    module = getattr(module, attr_i)
  File "/home/user/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1177, in __getattr__
    raise AttributeError("'{}' object has no attribute '{}'".format(
AttributeError: 'Conv2d' object has no attribute 'weight_mask'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
  File "/usr/local/lib/python3.8/dist-packages/IPython/core/interactiveshell.py", line 3444, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-45-e9b0e1aa526c>", line 1, in <module>
    summary(v, (1, 3, 224, 224), depth=3, col_names=["input_size", "output_size", "kernel_size", "num_params"])
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/torchinfo.py", line 201, in summary
    summary_list = forward_pass(
  File "/home/user/.local/lib/python3.8/site-packages/torchinfo/torchinfo.py", line 281, in forward_pass
    raise RuntimeError(
RuntimeError: Failed to run torchinfo. See above stack traces for more details. Executed layers up to: []

opened by realsarm 8

AttributeError when input type has no element_size() method

The bug

Hi, I see a problem with the code calculating the size of the layers:

In layer_info.py line 109:

if hasattr(inputs[0], "size") and callable(inputs[0].size):
    return list(inputs[0].size()), inputs[0].element_size()

I have the problem, that I use a package with modified tensors which have no "element_size" method. I.e. the code crashes at that point.

Expected behavior

What about this

if hasattr(inputs[0], "size") and callable(inputs[0].size):
    if hasattr(inputs[0], "element_size") and callable(inputs[0].element_size):
        return list(inputs[0].size()), inputs[0].element_size()
    else:
        #Maybe add a warning here
        return list(inputs[0].size()), 0

opened by lueisert 3

Percentage FLOPS or Multiply Adds - inspired by #199

Option for column representing Percentage FLOPS or Multiply Adds. (similar to #199)

I think this option would be really useful to see which part of model should be optimized if necessary. It also useful to have an idea about the scaling of the model as make your model bigger and bigger.

At first sight, it seems that this could be implemented in a way similar to that of #199.

If this seems reasonable enough, I can come up with PR.

opened by mert-kurttutan 4
AttributeError: 'tuple' object has no attribute 'size'

fixed #141 . I have tested it only on one huggingface model but it should work for every model. The only problem is that I could not fixe the verification of the out file

opened by fabiofumarola 3
One complex parameter should count as two params

As all models' parameters counting traces back here https://github.com/TylerYep/torchinfo/blob/8b3ae72c7cac677176f37450ee27b8c860f803cd/torchinfo/layer_info.py#L154-L170

there is no checking on whether the parameter tensor is complex or real. If a parameter is complex, such as a + i b, then it represents actually two parameters (for counting MACs/FLOPs purpose).

Of course this PR might not be conforming with torchinfo's dev, feel free to close it, I hope complex would be considered in next version.

opened by scaomath 2

nn.ParameterList omitted again in v1.7.1

Hi there. I was trying to inspect mmoe model from mmoe, which has nn.PrameterList.


class Expert(nn.Module):
    def __init__(self, input_size, output_size, hidden_size):
        super(Expert, self).__init__()
        self.fc1 = nn.Linear(input_size, hidden_size)
        self.fc2 = nn.Linear(hidden_size, output_size)
        self.relu = nn.ReLU()
        self.dropout = nn.Dropout(0.3)

    def forward(self, x):
        out = self.fc1(x)
        out = self.relu(out)
        out = self.dropout(out)
        out = self.fc2(out)
        return out


class Tower(nn.Module):
    def __init__(self, input_size, output_size, hidden_size):
        super(Tower, self).__init__()
        self.fc1 = nn.Linear(input_size, hidden_size)
        self.fc2 = nn.Linear(hidden_size, output_size)
        self.relu = nn.ReLU()
        self.dropout = nn.Dropout(0.4)
        self.sigmoid = nn.Sigmoid()
    def forward(self, x):
        out = self.fc1(x)
        out = self.relu(out)
        out = self.dropout(out)
        out = self.fc2(out)
        out = self.sigmoid(out)
        return out

class MMOE(nn.Module):
    def __init__(self, input_size, num_experts, experts_out, experts_hidden, towers_hidden, tasks):
        super(MMOE, self).__init__()
        self.input_size = input_size
        self.num_experts = num_experts
        self.experts_out = experts_out
        self.experts_hidden = experts_hidden
        self.towers_hidden = towers_hidden
        self.tasks = tasks

        self.softmax = nn.Softmax(dim=1)

        self.experts = nn.ModuleList([Expert(self.input_size, self.experts_out, self.experts_hidden) for i in range(self.num_experts)])
        self.w_gates = nn.ParameterList([nn.Parameter(torch.randn(input_size, num_experts), requires_grad=True) for i in range(self.tasks)])
        self.towers = nn.ModuleList([Tower(self.experts_out, 1, self.towers_hidden) for i in range(self.tasks)])

    def forward(self, x):
        experts_o = [e(x) for e in self.experts]
        experts_o_tensor = torch.stack(experts_o)

        gates_o = [self.softmax(x @ g) for g in self.w_gates]

        tower_input = [g.t().unsqueeze(2).expand(-1, -1, self.experts_out) * experts_o_tensor for g in gates_o]
        tower_input = [torch.sum(ti, dim=0) for ti in tower_input]

        final_output = [t(ti) for t, ti in zip(self.towers, tower_input)]
        return final_output

model = MMOE(input_size=499, num_experts=6, experts_out=16, experts_hidden=32, towers_hidden=8, tasks=2)

torchinfo.summary(model, input_size=(1024, 499),
                  col_names=[
                      "kernel_size", 
                      "input_size",
                      "output_size", 
                      "num_params", 
                      "trainable",
                      "mult_adds"
                      ],
                  col_width=16,
                  row_settings=["var_names", "depth"],
                  )

I was on v1.7.1 and I got something like this.

========================================================================================================================================
Layer (type (var_name):depth-idx)        Kernel Shape     Input Shape      Output Shape     Param #          Trainable        Mult-Adds
========================================================================================================================================
MMOE (MMOE)                              --               [1024, 499]      [1024, 1]        5,988            True             --
├─ModuleList (experts): 1-1              --               --               --               --               True             --
│    └─Expert (0): 2-1                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-1            --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-2             --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-3       --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-4            --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (1): 2-2                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-5            --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-6             --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-7       --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-8            --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (2): 2-3                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-9            --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-10            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-11      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-12           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (3): 2-4                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-13           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-14            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-15      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-16           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (4): 2-5                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-17           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-18            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-19      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-20           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (5): 2-6                   --               [1024, 499]      [1024, 16]       --               True             --
│    │    └─Linear (fc1): 3-21           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-22            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-23      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-24           --               [1024, 32]       [1024, 16]       528              True             540,672
├─Softmax (softmax): 1-2                 --               [1024, 6]        [1024, 6]        --               --               --
├─Softmax (softmax): 1-3                 --               [1024, 6]        [1024, 6]        --               --               --
├─ModuleList (towers): 1-4               --               --               --               --               True             --
│    └─Tower (0): 2-7                    --               [1024, 16]       [1024, 1]        --               True             --
│    │    └─Linear (fc1): 3-25           --               [1024, 16]       [1024, 8]        136              True             139,264
│    │    └─ReLU (relu): 3-26            --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Dropout (dropout): 3-27      --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Linear (fc2): 3-28           --               [1024, 8]        [1024, 1]        9                True             9,216
│    │    └─Sigmoid (sigmoid): 3-29      --               [1024, 1]        [1024, 1]        --               --               --
│    └─Tower (1): 2-8                    --               [1024, 16]       [1024, 1]        --               True             --
│    │    └─Linear (fc1): 3-30           --               [1024, 16]       [1024, 8]        136              True             139,264
│    │    └─ReLU (relu): 3-31            --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Dropout (dropout): 3-32      --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Linear (fc2): 3-33           --               [1024, 8]        [1024, 1]        9                True             9,216
│    │    └─Sigmoid (sigmoid): 3-34      --               [1024, 1]        [1024, 1]        --               --               --
========================================================================================================================================
Total params: 105,446
Trainable params: 105,446
Non-trainable params: 0
Total mult-adds (M): 101.84
========================================================================================================================================
Input size (MB): 2.04
Forward/backward pass size (MB): 2.51
Params size (MB): 0.40
Estimated Total Size (MB): 4.95
========================================================================================================================================

This seems to be great, nearly all things are included. But nn.ParameterList (w_gates) is omitted. I went through #54 and #84. Seems to be mentioned before and I downgraded it to v1.7.0

I got result as follow, which includes nn.ParameterList, but result itself seems to be incorrect?

========================================================================================================================================
Layer (type (var_name):depth-idx)        Kernel Shape     Input Shape      Output Shape     Param #          Trainable        Mult-Adds
========================================================================================================================================
MMOE (MMOE)                              --               [1024, 499]      [1024, 1]        --               True             --
├─Softmax (softmax): 1-6                 --               [1024, 6]        [1024, 6]        --               --               --
├─ModuleList (experts): 1-2              --               --               --               16,528           True             --
│    └─Expert (0): 2-1                   --               [1024, 499]      [1024, 16]       16,528           True             --
│    │    └─Linear (fc1): 3-2            --               [1024, 499]      [1024, 32]       (recursive)      True             16,384,000
│    │    └─Linear (fc1): 3-2            --               [1024, 499]      [1024, 32]       (recursive)      True             16,384,000
│    │    └─ReLU (relu): 3-3             --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-4       --               [1024, 32]       [1024, 32]       --               --               --
│    └─Expert (1): 2-3                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (0): 2                     --               --               --               --               --               --
│    │    └─Linear (fc2): 3-5            --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (1): 2-3                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    │    └─Linear (fc1): 3-6            --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-7             --               [1024, 32]       [1024, 32]       --               --               --
│    └─Expert (2): 2-5                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (1): 2                     --               --               --               --               --               --
│    │    └─Dropout (dropout): 3-8       --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-9            --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (2): 2-5                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    │    └─Linear (fc1): 3-10           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    └─Expert (3): 2-7                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (2): 2                     --               --               --               --               --               --
│    │    └─ReLU (relu): 3-11            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-12      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-13           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (3): 2-7                   --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (4): 2-10                  --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (3): 2                     --               --               --               --               --               --
│    │    └─Linear (fc1): 3-14           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-15            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-16      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-17           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (5): 2-12                  --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    └─Expert (4): 2-10                  --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    │    └─Linear (fc1): 3-18           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-19            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-20      --               [1024, 32]       [1024, 32]       --               --               --
├─ParameterList (w_gates): 1-3           --               --               --               5,988            True             --
├─ModuleList (towers): 1-4               --               --               --               --               True             --
│    └─Tower (0): 2-13                   --               [1024, 16]       [1024, 1]        (recursive)      True             --
├─ModuleList (experts): 1-2              --               --               --               16,528           True             --
│    └─Expert (4): 2                     --               --               --               --               --               --
│    │    └─Linear (fc2): 3-21           --               [1024, 32]       [1024, 16]       528              True             540,672
│    └─Expert (5): 2-12                  --               [1024, 499]      [1024, 16]       (recursive)      True             --
│    │    └─Linear (fc1): 3-22           --               [1024, 499]      [1024, 32]       16,000           True             16,384,000
│    │    └─ReLU (relu): 3-23            --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Dropout (dropout): 3-24      --               [1024, 32]       [1024, 32]       --               --               --
│    │    └─Linear (fc2): 3-25           --               [1024, 32]       [1024, 16]       528              True             540,672
├─Softmax (softmax): 1-5                 --               [1024, 6]        [1024, 6]        --               --               --
├─ModuleList (towers): 1-4               --               --               --               --               True             --
│    └─Tower (1): 2                      --               --               --               --               --               --
│    │    └─Linear (fc2): 3-35           --               [1024, 8]        [1024, 1]        (recursive)      True             9,216
├─Softmax (softmax): 1-6                 --               [1024, 6]        [1024, 6]        --               --               --
├─ModuleList (towers): 1-4               --               --               --               --               True             --
│    └─Tower (0): 2-13                   --               [1024, 16]       [1024, 1]        (recursive)      True             --
│    │    └─Linear (fc1): 3-27           --               [1024, 16]       [1024, 8]        136              True             139,264
│    │    └─ReLU (relu): 3-28            --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Dropout (dropout): 3-29      --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Linear (fc2): 3-30           --               [1024, 8]        [1024, 1]        9                True             9,216
│    │    └─Sigmoid (sigmoid): 3-31      --               [1024, 1]        [1024, 1]        --               --               --
│    └─Tower (1): 2-14                   --               [1024, 16]       [1024, 1]        9                True             --
│    │    └─Linear (fc1): 3-32           --               [1024, 16]       [1024, 8]        136              True             139,264
│    │    └─ReLU (relu): 3-33            --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Dropout (dropout): 3-34      --               [1024, 8]        [1024, 8]        --               --               --
│    │    └─Linear (fc2): 3-35           --               [1024, 8]        [1024, 1]        (recursive)      True             9,216
│    │    └─Sigmoid (sigmoid): 3-36      --               [1024, 1]        [1024, 1]        --               --               --
========================================================================================================================================
Total params: 105,446
Trainable params: 105,446
Non-trainable params: 0
Total mult-adds (M): 118.24
========================================================================================================================================
Input size (MB): 2.04
Forward/backward pass size (MB): 2.24
Params size (MB): 0.36
Estimated Total Size (MB): 4.64
========================================================================================================================================

opened by github-0-searcher 2

estimate model size is different with nvidia-smi usage

Describe the bug estimate model size is different with nvidia-smi usage

To Reproduce

used code, and command line
The code will run on the cuda:2 device

import torch
import torch.nn as nn
import timm
import torchvision

import argparse

from torchinfo import summary



#data load#
device_num = 2
device = torch.device("cuda:"+str(device_num))

num_classes = 2
# model name, augmentation, sche 설정 받기

parser = argparse.ArgumentParser()

parser.add_argument('--model', required=True)
args = parser.parse_args()

# model name, augmentation, sche 설정 받기

each_model = args.model
#best model

model = ''
if each_model == 'CvT-21' :
    model = torch.load('../ref_model/whole_CvT-21-384x384-IN-1k_2class.pt')

elif each_model == 'MLP-Mixer-b16' :
    model = timm.create_model('mixer_b16_224', pretrained=True, num_classes=num_classes)

elif each_model == 'Beit-base-patch16' :
    model = timm.create_model('beit_base_patch16_224', pretrained=True, num_classes=num_classes)

elif each_model == 'ViT-base-16' :
    model = timm.create_model('vit_base_patch16_224', pretrained=True, num_classes=num_classes)

elif each_model == 'ResNet101' :
    model = timm.create_model('resnet101', pretrained=True, num_classes=num_classes)

elif each_model == 'MobileNetV2' :
    model = timm.create_model('mobilenetv2_100', pretrained=True, num_classes=num_classes)

elif each_model == 'DenseNet121' :
    model = timm.create_model('densenet121', pretrained=True, num_classes=num_classes)

elif each_model == 'EfficientNetB0' :
    model = timm.create_model('efficientnet_b0', pretrained=True, num_classes=num_classes)

if each_model == 'ShuffleNetV2' :
    model = torchvision.models.shufflenet_v2_x1_0(pretrained=True)
    num_f = model.fc.in_features
    model.fc = nn.Linear(num_f, num_classes) #마지막 linear layer의 아웃풋을 2로 만들기

elif each_model == 'gmlp_s16' :
    model = timm.create_model('gmlp_s16_224', pretrained=True, num_classes=num_classes)


elif each_model == 'resmlp_24' :
    model = timm.create_model('resmlp_24_224', pretrained=True, num_classes=num_classes)


elif each_model == 'mobilevit-s' :
    model = timm.create_model('mobilevit_s', pretrained=True, num_classes=num_classes)


elif each_model == 'mobilevit-xs' :
    model = timm.create_model('mobilevit_xs', pretrained=True, num_classes=num_classes)


elif each_model == 'mobilevit-xxs' :
    model = timm.create_model('mobilevit_xxs', pretrained=True, num_classes=num_classes)


model = model.to(device)
model.eval()

summary(model, input_size=(1, 3, 224, 224), mode='eval', device = device)

batch_size = 1
data_shape = (3, 224, 224)
random_data = torch.rand((batch_size, *data_shape)).to(device)
               

with torch.no_grad():

    outputs = model(random_data)

python img1_test_original_testset_serve_2c_gpumem_forgit.py --model 'MobileNetV2'

Expected behavior nvidia-smi memory usage is same with estimate model size

Screenshots

Additional context But those 2 values had different values (over around 1000MB) I also already checked the https://github.com/TylerYep/torchinfo/issues/149#issue-1291452433 , but I could not reproduce similar values with Nvidia-smi GPU usage and the estimated total size of torch info.

Are there any points I missed in the code? or was It really caused by other things, not by my code?

Thanks!

+ In case of shufflenetV2 make same problem python img1_test_original_testset_serve_2c_gpumem_forgit.py --model 'ShuffleNetV2'

+2 I did the simple check for moving data on GPU devices and it yielded this GPU usage

import torch
device_num = 2
device = torch.device("cuda:"+str(device_num))

batch_size = 1
data_shape = (3, 224, 224)
random_data = torch.rand((batch_size, *data_shape)).to(device)

Was this might involve in the issue?

opened by YHYeooooong 6

Releases(v1.7.1)

v1.7.1(Sep 26, 2022)
What's Changed

Update half precision test cases to support Pytorch v1.12 by @mert-kurttutan in https://github.com/TylerYep/torchinfo/pull/165

Use layer_id instead of class_name in add_missing_layers by @mert-kurttutan in https://github.com/TylerYep/torchinfo/pull/163

Replace add_missing_layers with add_missing_container_layers by @mert-kurttutan in https://github.com/TylerYep/torchinfo/pull/169

New Contributors

@mert-kurttutan made their first contribution in https://github.com/TylerYep/torchinfo/pull/165

Full Changelog: https://github.com/TylerYep/torchinfo/compare/v1.7.0...v1.7.1
Source code(tar.gz)
Source code(zip)
v1.7.0(May 28, 2022)
What's Changed

Calculate parameter counts for nn.Parameter

Add parameter counts for nn.UninitializedParameters

Show output shapes for the overall model

Internal optimizations (hooks applied iteratively instead of recursively, add profiling code)

Full Changelog: https://github.com/TylerYep/torchinfo/compare/v1.6.6...v1.7.0
Source code(tar.gz)
Source code(zip)
v1.6.6(May 16, 2022)
What's Changed

Add "Trainable" column by @bsridatta in https://github.com/TylerYep/torchinfo/pull/128

Do not error if there are None values in ModuleLists

Obtain kernel_size using the actual attribute name.

Source code(tar.gz)
Source code(zip)
v1.6.5(Mar 25, 2022)

Fix a regression in which torchinfo would crash on PyTorch versions < 1.9
Source code(tar.gz)
Source code(zip)
v1.64(Mar 23, 2022)
What's Changed

Support half-precision dtypes when calculating model size by @jedi2610 in https://github.com/TylerYep/torchinfo/pull/109

Uninitialized tensor fix by @jedi2610 in https://github.com/TylerYep/torchinfo/pull/120

Allow user to set the training/eval mode of the model before calling summary.

New Contributors

@jedi2610 made their first contribution in https://github.com/TylerYep/torchinfo/pull/109

@bsridatta made their first contribution in https://github.com/TylerYep/torchinfo/pull/116

Full Changelog: https://github.com/TylerYep/torchinfo/compare/v1.6.3...v1.64
Source code(tar.gz)
Source code(zip)
v1.6.3(Jan 15, 2022)

Source code(tar.gz)
Source code(zip)
v1.6.2(Jan 11, 2022)

Fixed bug with layer reuse in the same variable, with and without existing hooks.
Source code(tar.gz)
Source code(zip)
v1.6.1(Dec 24, 2021)
Support for pruned models

Support for "ascii_only" as a row setting to disable the fancy branch logging.

Source code(tar.gz)
Source code(zip)
v1.6.0(Dec 21, 2021)

Deprecate Python 3.6 support. Install v1.5.4 if you want to use Python 3.6.
Source code(tar.gz)
Source code(zip)
v1.5.4(Nov 24, 2021)

LayerInfo's trainable field is now trainable_params
Source code(tar.gz)
Source code(zip)
v1.5.3(Aug 7, 2021)
Display layers that share the same variable

e.g. activation layers that are defined once and then reused throughout the model

README updates.

Source code(tar.gz)
Source code(zip)
v1.5.2(Jul 6, 2021)
Use sys.getsizeof for calculating input size. In the future, we will use this instead of the tensor shape to calculate the size.

Rework the input_data correction to allow nested dicts and other data structure combinations.

Add missing basic summary test.

Refactor the main summary function to use a common traversal helper function.

Source code(tar.gz)
Source code(zip)
v1.5.1(Jul 5, 2021)
Fix bug causing inconsistent Mult-Add totals that do not sum correctly.

Overhaul output testing to work automatically for all tests

Add cache_forward_pass to make it easier to iterate on depths in Jupyter Notebooks

Source code(tar.gz)
Source code(zip)
v.1.5.0(Jul 3, 2021)

Upgrade the version number past v1.4.5 in order for pip to resolve the version correctly across older versions of pip.
Source code(tar.gz)
Source code(zip)
v0.1.5(Jun 13, 2021)
Fix issues with torch.jit scripted modules

Add support for ParameterLists

Display bias layers in verbose=2 mode

Source code(tar.gz)
Source code(zip)
v0.1.4(Jun 7, 2021)

Add py.typed to surface type annotations.
Source code(tar.gz)
Source code(zip)
v0.1.3(Jun 4, 2021)

Fix bug with inconsistent calculations for total_params using different depths.
Source code(tar.gz)
Source code(zip)
v0.1.2(May 22, 2021)

Fix MACs calculation and differences in output when using different depths.

depth parameter now only affects formatting, calculated values do not change.
Source code(tar.gz)
Source code(zip)
v0.1.1(May 9, 2021)

Add model name to the topmost row of the summary table, fixed bug in nested_list_size.
Source code(tar.gz)
Source code(zip)
v0.1.0(May 8, 2021)

Add row_settings=("var_name", "depth") as customizable options for showing variable names in a row.
Source code(tar.gz)
Source code(zip)
v0.0.9(Apr 10, 2021)

Adjust MACs calculation to include batch_size
Source code(tar.gz)
Source code(zip)
torchinfo-0.0.9-py3-none-any.whl(15.87 KB)
torchinfo-0.0.9.tar.gz(19.09 KB)
v1.4.4(Dec 24, 2020)

Last release under the old PyPI package, deprecates Python 3.5 support.

Please pin v1.4.3 if you wish to continue using Python 3.5. Additionally fixes several misc bugs.
Source code(tar.gz)
Source code(zip)
v1.4.3(Sep 22, 2020)

Minor bug fixes, now forwards correct error messages when an exception occurs.
Source code(tar.gz)
Source code(zip)
v1.4.2(Aug 24, 2020)

Fix bug with nested list inputs.
Source code(tar.gz)
Source code(zip)
torch-summary-1.4.2.tar.gz(14.45 KB)
torch_summary-1.4.2-py3-none-any.whl(14.68 KB)
v1.4.0(Jul 19, 2020)

Sequential & ModuleList Support

Thanks @roym899 for implementing this feature!
Source code(tar.gz)
Source code(zip)
torch-summary-1.4.0.tar.gz(13.02 KB)
torch_summary-1.4.0-py3-none-any.whl(13.48 KB)
v1.3.3(Jul 1, 2020)

Source code(tar.gz)
Source code(zip)
torch-summary-1.3.3.tar.gz(12.63 KB)
torch_summary-1.3.3-py3-none-any.whl(13.10 KB)
v1.2.0(May 9, 2020)

Source code(tar.gz)
Source code(zip)
torch-summary-1.2.0.tar.gz(10.36 KB)
torch_summary-1.2.0-py3-none-any.whl(12.40 KB)
v1.1.8-3.5(May 7, 2020)

This is likely a one-time release backporting to Python 3.5 by reverting all of the f-strings to format strings.
Source code(tar.gz)
Source code(zip)
torch-summary-1.1.8.tar.gz(9.67 KB)
torch_summary-1.1.8-py3-none-any.whl(11.65 KB)
v1.1.4(Apr 19, 2020)

Minor API changes with device and batch_dim, as well as additional testing on models from issue submitters.
Source code(tar.gz)
Source code(zip)
torch-summary-1.1.4.tar.gz(8.54 KB)
torch_summary-1.1.4-py3-none-any.whl(10.66 KB)
v1.0.1(Mar 19, 2020)

Initial Release
Source code(tar.gz)
Source code(zip)
torch-summary-1.0.1.tar.gz(6.38 KB)

View model summaries in PyTorch!

Related tags

Overview

torchinfo

Usage

How To Use

Documentation

Examples

Get Model Summary as String

Explore Different Configurations

ResNet

Multiple Inputs w/ Different Data Types

Sequentials & ModuleLists

Contributing

References

Comments

Releases(v1.7.1)

v1.7.1(Sep 26, 2022)

What's Changed

New Contributors

v1.7.0(May 28, 2022)

What's Changed

v1.6.6(May 16, 2022)

What's Changed

v1.6.5(Mar 25, 2022)

v1.64(Mar 23, 2022)

What's Changed

New Contributors

v1.6.3(Jan 15, 2022)

v1.6.2(Jan 11, 2022)

v1.6.1(Dec 24, 2021)

v1.6.0(Dec 21, 2021)

v1.5.4(Nov 24, 2021)

v1.5.3(Aug 7, 2021)

v1.5.2(Jul 6, 2021)

v1.5.1(Jul 5, 2021)

v.1.5.0(Jul 3, 2021)

v0.1.5(Jun 13, 2021)

v0.1.4(Jun 7, 2021)

v0.1.3(Jun 4, 2021)

v0.1.2(May 22, 2021)

v0.1.1(May 9, 2021)

v0.1.0(May 8, 2021)

v0.0.9(Apr 10, 2021)

v1.4.4(Dec 24, 2020)

v1.4.3(Sep 22, 2020)

v1.4.2(Aug 24, 2020)

v1.4.0(Jul 19, 2020)

v1.3.3(Jul 1, 2020)

v1.2.0(May 9, 2020)

v1.1.8-3.5(May 7, 2020)

v1.1.4(Apr 19, 2020)

v1.0.1(Mar 19, 2020)

Owner

Tyler Yep

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

The implementation of "Bootstrapping Semantic Segmentation with Regional Contrast".

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)

Outlier Exposure with Confidence Control for Out-of-Distribution Detection

Simulation code and tutorial for BBHnet training data

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

PyTorch implementaton of our CVPR 2021 paper "Bridging the Visual Gap: Wide-Range Image Blending"

🔥3D-RecGAN in Tensorflow (ICCV Workshops 2017)

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

unofficial pytorch implementation of RefineGAN

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

CN24 is a complete semantic segmentation framework using fully convolutional networks

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI