基于Paddle框架的fcanet复现

Last update: Mar 07, 2022

Related tags

Deep Learning fcanet-Paddle

Overview

fcanet-Paddle

基于Paddle框架的fcanet复现

fcanet

本项目基于paddlepaddle框架复现fcanet，并参加百度第三届论文复现赛，将在2021年5月15日比赛完后提供AIStudio链接～敬请期待

参考项目：

frazerlin-fcanet

数据准备

本项目已挂载论文所使用的数据集，对于tgz及tar文件需要利用以下命令解压

tar -xvf benchmark.tgz
tar xvf VOCtrainval_11-May-2012.tar

整个工程具有以下目录结构

/home/aistudio
|───Data(数据集)
└───────benchmark_RELEASE
└───────VOCdevkit
└───────GrabCut
└───────Berkeley
└───fcanet(代码文件)
└───InitialPaddleModel(初始化权重)

训练

The official PyTorch implementation of CVPR 2020 paper "Interactive Image Segmentation with First Click Attention". 并未提供训练代码。通过邮件联系作者，作者由于企业合作项目原因，合作结束后会将会提供训练代码

测试

模型下载

提取码：2ira

AIStudio链接

验证集测试

python fcanet/evaluate.py --backbone [resnet/res2net] --dataset [GrabCut,Berkeley,DAVIS(not exists in this repo),VOCdevkit] (--sis)

如下图所示，默认的backbone均为101

resnet101测试示例

res2net101测试示例

backbone	dataset	mNoC	mIoU-NoC
resnet101	Berkeley	4.23	[0. 0.728 0.854 0.885 0.912 0.915 0.926 0.935 0.939 0.935 0.94 0.943 0.942 0.944 0.945 0.945 0.947 0.947 0.948 0.947 0.949]
resnet101	GrabCut	2.24	[0. 0.78 0.87 0.923 0.944 0.95 0.956 0.966 0.964 0.971 0.971 0.971 0.975 0.977 0.978 0.979 0.978 0.978 0.979 0.979 0.979]
resnet101	VOC2012	2.9810329734461627	[0. 0.715 0.838 0.885 0.909 0.926 0.937 0.945 0.951 0.957 0.962 0.964 0.967 0.969 0.971 0.973 0.974 0.976 0.977 0.978 0.979]
res2net101	Berkeley	3.98	[0. 0.788 0.872 0.901 0.921 0.93 0.933 0.938 0.938 0.943 0.943 0.943 0.943 0.945 0.947 0.948 0.949 0.949 0.95 0.951 0.95 ]
res2net101	GrabCut	2.16	[0. 0.819 0.877 0.927 0.916 0.931 0.948 0.96 0.966 0.967 0.969 0.971 0.973 0.976 0.977 0.976 0.978 0.977 0.98 0.977 0.979]
res2net101	VOC2012	2.793988911584476	[0. 0.757 0.841 0.882 0.908 0.925 0.937 0.945 0.952 0.958 0.963 0.966 0.968 0.971 0.973 0.974 0.976 0.977 0.978 0.98 0.98 ]

可视化测试

利用annotator.py可以实现可视化操作，感兴趣的读者可是利用Qt实现UI程序，实现效果如下所示

需要注意的是，AIStudio环境暂不支持这种可视化方式，你需要将此仓库部署到本地运行，你可能需要修改代码文件中的路径

python fcanet/annotator.py --backbone res2net --input fcanet/test.jpg --output test_mask.jpg

关于作者

姓名	郭权浩
学校	电子科技大学研2020级
研究方向	计算机视觉
主页	Deep Hao的主页
如有错误，请及时留言纠正，非常蟹蟹！
后续会有更多论文复现系列推出，欢迎大家有问题留言交流学习，共同进步成长！

基于Paddle框架的fcanet复现

Related tags

Overview

fcanet-Paddle

fcanet

数据准备

训练

测试

验证集测试

resnet101测试示例

res2net101测试示例

可视化测试

关于作者

Owner

QuanHao Guo

Deep Residual Learning for Image Recognition

PyTorch implementation of spectral graph ConvNets, NIPS’16

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

BC3407-Group-5-Project - BC3407 Group Project With Python

Build Graph Nets in Tensorflow

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

Experiment about Deep Person Re-identification with EfficientNet-v2

Code for Overinterpretation paper Overinterpretation reveals image classification model pathologies

This is a Python wrapper for TA-LIB based on Cython instead of SWIG.

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

Point Cloud Registration using Representative Overlapping Points.

RodoSol-ALPR Dataset

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

🇰🇷 Text to Image in Korean

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

nnDetection is a self-configuring framework for 3D (volumetric) medical object detection which can be applied to new data sets without manual intervention. It includes guides for 12 data sets that were used to develop and evaluate the performance of the proposed method.

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models