2021 National Underwater Robotics Vision Optics

Last update: Nov 04, 2022

Overview

2021-National-Underwater-Robotics-Vision-Optics

2021年全国水下机器人算法大赛-光学赛道-B榜精度第18名 (Kilian_Di的团队：A榜[email protected]:95 56.36 B榜[email protected]:95 56.7） 2021年全国水下机器人算法大赛-声学赛道-B榜精度第5名 (Kilian_Di的团队：A榜[email protected]:95 52.3 B榜[email protected]:95 53.1）

请按照mmdetection官方文档配置环境，并运行Config文件对应的模型

代码内容和trick：

基本网络模型
- cascade rcnn
- resnext101 pretrained on COCO
- soft-nms
- 基于mmdetection
- mmcv-full==1.2.5
- pytorch==1.6.0
- torchvision==0.7.0
- cudatoolkit=10.1
添加的trick
- dcn
- global_context(gcb)
- RandomRotate90
- cutout
- Mixup
- 边框抖动
- 高斯噪声椒盐噪声
- Libra RCNN
- GIoU/CIoU/DIoU Loss
- Attention Block
- Multi-scale Training and Testing

Note：mmdet-new/下的models文件请直接复制mmdetection项目即可

Contact me

Email：[email protected]

Owner

Di Chang

Computer Vision/[email protected] /

GitHub Repository

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

CvT: Introducing Convolutions to Vision Transformers Pytorch implementation of CvT: Introducing Convolutions to Vision Transformers Usage: img = torch

193 Jan 03, 2023

Google Recaptcha solver.

byerecaptcha - Google Recaptcha solver. Model and some codes takes from embium's repository -Installation- pip install byerecaptcha -How to use- from

21 Dec 19, 2022

Multimodal Descriptions of Social Concepts: Automatic Modeling and Detection of (Highly Abstract) Social Concepts evoked by Art Images

MUSCO - Multimodal Descriptions of Social Concepts Automatic Modeling of (Highly Abstract) Social Concepts evoked by Art Images This project aims to i

0 Aug 22, 2021

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usec

1k Jan 06, 2023

Open-Ended Commonsense Reasoning (NAACL 2021)

Open-Ended Commonsense Reasoning Quick links: [Paper] | [Video] | [Slides] | [Documentation] This is the repository of the paper, Differentiable Open-

31 Oct 19, 2022

Real-Time Semantic Segmentation in Mobile device

Real-Time Semantic Segmentation in Mobile device This project is an example project of semantic segmentation for mobile real-time app. The architectur

708 Jan 01, 2023

unet-family: Ultimate version

unet-family: Ultimate version 基于之前my-unet代码，我整理出来了这一份终极版本unet-family，方便其他人阅读。相比于之前的my-unet代码，代码分类更加规范，有条理对于clone下来的代码不需要修改各种复杂繁琐的路径问题，直接就可以运行。并且代码有

2 Sep 19, 2022

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

Think Big, Teach Small: Do Language Models Distil Occam’s Razor? Software related to the paper "Think Big, Teach Small: Do Language Models Distil Occa

0 Dec 07, 2021

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

Eigenlearning This repo contains code for replicating the experiments of the paper A Theory of the Inductive Bias and Generalization of Kernel Regress

45 Dec 02, 2022

Python scripts for performing lane detection using the LSTR model in ONNX

ONNX LSTR Lane Detection Python scripts for performing lane detection using the Lane Shape Prediction with Transformers (LSTR) model in ONNX. Requirem

29 Aug 30, 2022

This repository implements variational graph auto encoder by Thomas Kipf.

Variational Graph Auto-encoder in Pytorch This repository implements variational graph auto-encoder by Thomas Kipf. For details of the model, refer to

215 Jan 02, 2023

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

SOFT: Softmax-free Transformer with Linear Complexity SOFT: Softmax-free Transformer with Linear Complexity, Jiachen Lu, Jinghan Yao, Junge Zhang, Xia

272 Dec 25, 2022

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime Created by Prarthana Bhattacharyya. Disclaimer: This is n

5 Nov 08, 2022

Official repository for "Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring".

RNN-MBP Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring (AAAI-2022) by Chao Zhu, Hang Dong, Jinshan Pan

22 Aug 31, 2022