SwinTextSpotter

This is the pytorch implementation of Paper: SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting. The paper is available at this link.

We use the models pre-trained on ImageNet. The ImageNet pre-trained SwinTransformer backbone is obtained from SwinT_detectron2.

Installation

Python=3.8
PyTorch=1.8.0, torchvision=0.9.0, cudatoolkit=11.1
OpenCV for visualization

Steps

Install the repository (we recommend to use Anaconda for installation.)

conda create -n SWINTSv2 python=3.8 -y
conda activate SWINTSv2
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install opencv-python
pip install scipy
pip install shapely
pip install rapidfuzz
pip install timm
pip install Polygon3
git clone https://github.com/mxin262/SwinTextSpotterv2.git
cd SwinTextSpotterv2
python setup.py build develop

dataset path

datasets
|_ totaltext
|  |_ train_images
|  |_ test_images
|  |_ totaltext_train.json
|  |_ weak_voc_new.txt
|  |_ weak_voc_pair_list.txt
|_ mlt2017
|  |_ train_images
|  |_ annotations/icdar_2017_mlt.json
.......

Downloaded images

ICDAR2017-MLT [image]
Syntext-150k:
- Part1: 94,723 [dataset]
- Part2: 54,327 [dataset]
ICDAR2015 [image]
ICDAR2013 [image]
Total-Text_train_images [image]
Total-Text_test_images [image]
ReCTs [images&label] PW: 2b4q
LSVT [images&label] PW: 9uh1
ArT [images&label] PW: 2865
SynChinese130k [images][label]
Vintext_images [image]

Downloaded label[Google Drive] [BaiduYun] PW: 46vd

Downloader lexicion[Google Drive] and place it to corresponding dataset.

You can also prepare your custom dataset following the example scripts. [example scripts]

Totaltext

To evaluate on Total Text, CTW1500, ICDAR2015, first download the zipped annotations and unzip it

Pretrain SWINTSv2 (e.g., with Swin-Transformer backbone)

python projects/SWINTSv2/train_net.py \
  --num-gpus 8 \
  --config-file projects/SWINTSv2/configs/SWINTS-swin-pretrain.yaml

Fine-tune model on the mixed real dataset

python projects/SWINTSv2/train_net.py \
  --num-gpus 8 \
  --config-file projects/SWINTSv2/configs/SWINTS-swin-mixtrain.yaml

Fine-tune model

python projects/SWINTSv2/train_net.py \
  --num-gpus 8 \
  --config-file projects/SWINTSv2/configs/SWINTS-swin-finetune-totaltext.yaml

Evaluate SWINTSv2 (e.g., with Swin-Transformer backbone)

python projects/SWINTSv2/train_net.py \
  --config-file projects/SWINTSv2/configs/SWINTS-swin-finetune-totaltext.yaml \
  --eval-only MODEL.WEIGHTS ./output/model_final.pth

Visualize the detection and recognition results (e.g., with ResNet50 backbone)

python demo/demo.py \
  --config-file projects/SWINTSv2/configs/SWINTS-swin-finetune-totaltext.yaml \
  --input input1.jpg \
  --output ./output \
  --confidence-threshold 0.4 \
  --opts MODEL.WEIGHTS ./output/model_final.pth

Copyright

For commercial purpose usage, please contact Prof. Yuliang Liu: [email protected] and Prof. Lianwen Jin: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
build		build
configs		configs
demo		demo
detectron2.egg-info		detectron2.egg-info
detectron2		detectron2
dev		dev
docker		docker
docs		docs
projects/SWINTSv2		projects/SWINTSv2
tests		tests
tools		tools
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
chn_cls_list.txt		chn_cls_list.txt
eng_cls_dict.txt		eng_cls_dict.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SwinTextSpotter

Installation

Steps

Totaltext

Copyright

About

Uh oh!

Releases

Packages

Languages

mxin262/SwinTextSpotterv2

Folders and files

Latest commit

History

Repository files navigation

SwinTextSpotter

Installation

Steps

Totaltext

Copyright

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages