SSTDNet

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight' using pytorch.

This code is work for general object detection problem. not for (oriented) text detection problem. I will probably update to handle oriented bounding box as soon as possible :)

[How to use]

you need dataset.

dataset structure is..

/train/0.jpg, /train/0.txt, /valid/0.jpg, /valid/0.txt, ....
0.txt contain position and label of objects like below

(xmin, ymin, xmax, ymax, label)

1273.0 935.0 1407.0 1017.0 v1

911.0 893.0 979.0 953.0 v1

984.0 889.0 1053.0 948.0 v1
To encode label name to integer number, you should define labels in the 'class_lable_map.xlsx"
v1 1

v2 2

....
* start from 1. not from 0. 0 will be background (in the loss.py).

need some settings for dataset reader.

- see train.py. you can find some code for reading dataset
```
  'trainset = ListDataset(root="../train", gt_extension=".txt", labelmap_path="class_label_map.xlsx", is_train=True, transform=transform, input_image_size=512, num_crops=n_crops, original_img_size=2048)'
  
```
- you should set the 'input_image_size' and 'original_img_size'. 'input_image_size' is size of (cropped) image for train. And 'original_img_size' is size of (original) image. I made this parameter to handle high resolution image. if you don't need crop function, -1 for num_crops.
Train with your dataset!
you should define some parameter like learning rate, which optimizer to use, size of batch etc.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
class_label_map.xlsx		class_label_map.xlsx
datagen.py		datagen.py
encoder.py		encoder.py
inception.py		inception.py
loss.py		loss.py
sstdnet.py		sstdnet.py
test.py		test.py
test_multi.py		test_multi.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

class_label_map.xlsx

class_label_map.xlsx

datagen.py

datagen.py

encoder.py

encoder.py

inception.py

inception.py

loss.py

loss.py

sstdnet.py

sstdnet.py

test.py

test.py

test_multi.py

test_multi.py

train.py

train.py

utils.py

utils.py

Repository files navigation

SSTDNet

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight' using pytorch.

This code is work for general object detection problem. not for (oriented) text detection problem. I will probably update to handle oriented bounding box as soon as possible :)

About

Releases

Packages

Languages

HotaekHan/SSTDNet

Folders and files

Latest commit

History

Repository files navigation

SSTDNet

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight' using pytorch.

This code is work for general object detection problem. not for (oriented) text detection problem. I will probably update to handle oriented bounding box as soon as possible :)

About

Resources

Stars

Watchers

Forks

Languages