Implementation of U-Net and SegNet for building segmentation

Last update: Dec 07, 2022

Overview

Specialized project

Created by Katrine Nguyen and Martin Wangen-Eriksen as a part of our specialized project at Norwegian University of Science and Technology (NTNU).

Models

Most of our code and the U-net model is significantly inspired by this project Unet-for-Person-Segmentation. The SegNet model we created on our own based on other implementations of SegNet in Tensorflow.

Data

The model is trained and tested on Massachusetts Buildings Dataset from Kaggle. The original images where 1500X1500 pixels each over an area of 1500x1500 meters (1mx1m resolution). The original 137 images were cropped into 64x64 pixels and images without building were filtered out.

To make the masks compatible with our model the masks was changed from white (255,255,255) labels to greyscale with value 1. This is done in image_fix.py found in the repo.

Folder structure

Images and masks are saved in local directories and used in data.py and test.py. This is of course possible to change, however if you want to use the exact same code you can follow this folder structure.


.
├── ...
├── building-segmentation                # Directory for all images
│   ├── Images                           # Directory for raw images
│   │   ├── cropped_images_train_64      # Directory for cropped images where number specifies resolution, containg .jpg
│   │   ├── cropped_images_train_128     # Directory for cropped images where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   ├── Masks                            # Directory for all maskes
│   │   ├── cropped_masks_train_64       # Directory for cropped masks where number specifies resolution, containg .jpg
│   │   ├── cropped_masks_train_128      # Directory for cropped masks where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   └── Test                             # Miscellaneous information
│       ├── test_64                      # Directory for images where number specifies resolution, containing .jpg
│       └── ...                          # More directories with other resolutions
└── ...

# data.py
    images = glob(os.path.join(dataset_path, "images/cropped_images_train_64/*"))
    masks = glob(os.path.join(dataset_path, "masks/cropped_masks_train_64/*"))
    
    # In main:
        dataset_path = "building-segmentation"
    
# test.py
    test_images = glob("building-segmentation/test/test_64/*")

Implementation of U-Net and SegNet for building segmentation

Related tags

Overview

Specialized project

Models

Data

Folder structure

Running the project

Requirements

Training

Testing

Owner

Martin.w-e

PAWS 🐾 Predicting View-Assignments with Support Samples

Interactive Visualization to empower domain experts to align ML model behaviors with their knowledge.

Neural Module Network for VQA in Pytorch

High performance distributed framework for training deep learning recommendation models based on PyTorch.

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Feedback is important: response-aware feedback mechanism for background based conversation

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

VR Viewport Pose Model for Quantifying and Exploiting Frame Correlations

Accurate identification of bacteriophages from metagenomic data using Transformer

ML course - EPFL Machine Learning Course, Fall 2021

BC3407-Group-5-Project - BC3407 Group Project With Python

Incremental Cross-Domain Adaptation for Robust Retinopathy Screening via Bayesian Deep Learning

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

FaceQgen: Semi-Supervised Deep Learning for Face Image Quality Assessment

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Pytorch implementation of OCNet series and SegFix.

Generate image analogies using neural matching and blending

IOT: Instance-wise Layer Reordering for Transformer Structures

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.