CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Last update: Mar 07, 2022

Related tags

Deep Learning CvT-ASSD

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

https://github.com/albert-jin/CvT-SSD

new-code-website:

https://github.com/albert-jin/CvT-ASSD

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

You may probably need to install an anaconda environment which contains all packages followed.
- pytorch 1.9.0 py3.7_cuda10.2_cudnn7_0 pytorch
- cudatoolkit 10.2.89 h74a9793_1
- opencv-python 4.5.2.54 pypi_0 pypi
- visdom 0.1.8.9 pypi_0 pypi
- yacs 0.1.8 pypi_0 pypi
- jupyter 1.0.0 pypi_0 pypi
For training, an NVIDIA GPU is strongly recommended for speed. we use two NVIDIA GTX-1080TI, but we recommend GPUs like Tesla-V100 /RTX-3090 for more memory
Before you run the codes for self-study or reappearance the performance in this paper "CvT-ASSD", please add the CvT_SSD/model/ directory into sources Root caused by the reference of many codes inside of model directory
you should download the pytorch parameters file postfix by ".pth" and move into models/CvT/weights like 项目结构.PNG
图像物体检测benchmark(参照论文native-SSD)一般是将VOC2007—TEST的数据作为模型的测试集,训练集可有以下搭配:
- 1. 07:VOC2007 trainval 训练集验证集
- 1. 02+12 VOC2007 trainval + VOC2007 trainval 训练集验证集
- 1. 07+12+COCO 在 COCO trainval35k上预训练,然后在07+12上微调
评价指标maP使用mxnet提供的VOC07MApMetric,将recall分成10等分,继而对所有precision取平均,在对类别去平均,具体参见 https://blog.csdn.net/u014203453/article/details/77598997

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Related tags

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

new-code-website:

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

Owner

金伟强 -上海大学人工智能小渣渣~

Accepted at ICCV-2021: Workshop on Computer Vision for Automated Medical Diagnosis (CVAMD)

Depth image based mouse cursor visual haptic

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

ICCV2021 Papers with Code

Geometric Sensitivity Decomposition

Rlmm blender toolkit - A set of tools to streamline level generation in UDK straight from Blender

Official implementation of "Refiner: Refining Self-attention for Vision Transformers".

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

2021:"Bridging Global Context Interactions for High-Fidelity Image Completion"

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

Pytorch implementation of Bert and Pals: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Danfeng Hong, Lianru Gao, Jing Yao, Bing Zhang, Antonio Plaza, Jocelyn Chanussot. Graph Convolutional Networks for Hyperspectral Image Classification, IEEE TGRS, 2021.

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

Hierarchical Few-Shot Generative Models

Image morphing without reference points by applying warp maps and optimizing over them.