ANPR for TW license plate

TODO

Check new dataset, it seems having some problem
Try https://github.com/frotms/PaddleOCR2Pytorch.git to see if paddleOCR can be converted to torch and utilze the training script
Data Parallel

pipeline:

Enviroments

Python >= 3.10
Third-party Packages
- paddle
  - CPU version: pip install paddlepaddle
    - recommended
  - GPU version: pip install paddlepaddle-gpu
    - If using the GPU version, please ensure that the cuDNN libraries are located in the default path before running.
    - If the cuDNN libraries are not in the default environment, add the installation path to the session's $LD_LIBRARY_PATH before executing Python:
      - export LD_LIBRARY_PATH=/path/to/your/libraries:$LD_LIBRARY_PATH
      - e.g.: If I installed cuDNN via conda in a custom environment, run: export LD_LIBRARY_PATH=${HOME}/.conda/envs/tenv/lib:$LD_LIBRARY_PATH, then execute Python commands.
- PaddleOCR: pip install paddleocr
** After the initial PaddleOCR setup, OCR models will be saved at:
- ~/.paddleocr/whl/det/en/en_PP-OCRv3_det_infer.tar
- ~/.paddleocr/whl/rec/en/en_PP-OCRv4_rec_infer.tar

NOTE

I have modified the code from paddleOCR : function decode() at line 136 at paddleocr/ppocr/postprocess/rec_postprocess.py

adding a argument raw_prob=None to it and modify the output from

result_list.append((text, np.mean(conf_list).tolist()))

to

result_list.append(
    (text, np.mean(conf_list).tolist(), conf_list) if raw_prob is None
    else (text, np.mean(conf_list).tolist(), conf_list, raw_prob[batch_idx])
)

And, inside CTCLabelDecode __call__(), I also modify it from

text = self.decode(
    preds_idx,
    preds_prob,
    is_remove_duplicate=True,
    return_word_box=return_word_box,
)

to

text = self.decode(
    preds_idx,
    preds_prob,
    is_remove_duplicate=True,
    return_word_box=return_word_box,
    raw_prob=preds
)

For getting the logit

Deblur

The LPDGAN model (paper: A Dataset and Model for Realistic License Plate Deblurring) is used for deblurring license plates to enhance the accuracy of OCR-based license plate recognition.

Train:

Using the command: python train_LPDGAN.py

Important Args:
- --data_root : The root of dataset, should contain the following folders:
  1. sharp
  2. The blur folders
  e.g., ./dataset/tw_pos/train/
```
.
└── dataset/
    └── tw_pos/
        └── train/
            ├── sharp/
            │   ├── 1.jpg
            │   ├── 2.jpg
            │   └── ...
            ├── blur/
            │   ├── 1.jpg
            │   ├── 2.jpg
            │   └── ...
            ├── blur_little/
            │   ├── 1.jpg
            │   ├── 2.jpg
            │   └── ...
            ├── blur_mosiac/
            │   ├── 1.jpg
            │   ├── 2.jpg
            │   └── ...
            └── ...
```
  each folder should hold the image, only different level of blurring. (sharp is the target, should be all clear image)
  - --blur_aug : Specifies the kinds of blur to be used.
    - e.g., --blur_aug blur_mosaic blur blur_little, which will use the images in those 3 folders to pair with sharp images for training LPDGAN.
  - --n_epochs & --n_epochs_decay : The epochs that using origin lr and the epochs that using the decay the lr
  - --ave_epoch_freq : How many epoch should validate once
  - --model_save_root : The root to save model
  - --val_data_root : same as the description of --data_root in Evaluation part
  - label_file : same as the description in Evaluation part

E.g., python train_LPDGAN.py --model_save_root ./LPDGAN/checkpoints/exp --save_epoch_freq 10 --val_data_root ./dataset/third_party/blur/ --label_file ./dataset/third_party/label.json

Then it will save the epoch with the best validation OCR accuracy among the validated epochs.

Evaluation: Using OCR performance:

Since the downstream task involves OCR for license plate recognition, we use OCR accuracy (1 - CER) as the evaluation metric on deblurred license plate images. Image restoration metrics like SSIM, PSNR, and Perceptual Loss are not used in this evaluation.
- python ocr_eval.py --label {the label file} --data_root {the root of test imgs} --deblur {the weights of trained SwinTrans_G} --deblur_ckpt_dir {the directory of the pretrained SwinTrans_G, with a default value}
- e.g., python ocr_eval.py --data_root ./dataset/third_party/blur/ --label_file ./dataset/third_party/label.json --deblur net_G.pt
- label_file:
  - A JSON object file where each key-value pair is in the format: filename:license_plate
    - e.g.:
      {"1.jpg":"AAA0000", "2.jpg":"BB111", ... }
- data_root: The root directory where testing images are stored. During execution, each image filename in label_file will be combined with data_root to form the path for each test image.
  - e.g., --data_root ./dataset/third_party/blur/, with the sample label_file above, the full paths for images in ocr_eval.py will be ["./dataset/third_party/blur/1.jpg", "./dataset/third_party/blur/2.jpg", ...], and the labels will be generated in the order specified by label_file as ["AAA0000", "BB111", ...]

Validation on cruiser dataset

Number of validation license plate : 1485

LCS rate baseline	OCR accuracy baseline
0.51	0.49

Abluation study for Text Extraction Module

For Text L1 Loss
On New dataset
Shared settings:

Batch size inital lr epochs lr fix epochs lr linearly decay to 0 validation cycle

15 $2\times 10^{-4}$ 100 100 5

Text Extraction Module	LCS rate	OCR accuracy
pretrained CRNN from easyocr	205 tmux 2	205 tmux 2
paddleOCR	205 tmux 3	205 tmux 3

Ablation study for Initial LR and Scheduling epochs

Shared settings:

Batch size text extraction module validation cycle

15 paddle OCR 5

Old dataset

initial lr	epochs lr fix	epochs lr linearly decay to 0	LCS rate	OCR accuracy
$1\times 10^{-4}$	100	100	126 tmux 8	126 tmux 8
$1\times 10^{-4}$	50	150	TODO	TODO
$2\times 10^{-4}$	100	100	TODO	TODO
$2\times 10^{-4}$	50	150	126 tmux 4	126 tmux 4

New dataset

initial lr	epochs lr fix	epochs lr linearly decay to 0	LCS rate	OCR accuracy
$1\times 10^{-4}$	100	100	126 tmux 7	126 tmux 7
$1\times 10^{-4}$	50	150	TODO	TODO
$2\times 10^{-4}$	100	100	205 tmux 3	205 tmux 3
$2\times 10^{-4}$	50	150	126 tmux 3	126 tmux 3

Execution for License Plate Recognition

Usage

Refer to unit_inference.py for examples.

Function: recognition_a_car() This function demonstrates how to recognize a car plate from a cropped image.

Parameters:

--img: Path to the cropped car image for license plate recognition.
--lp_yolo: Path to YOLOv8 model weights for license plate detection
- default set.
--deblur: Use LPGAN for deblurring (flag: store_true).
--lpdgan: Path to pre-trained LPDGAN generator weights
- default set
- Please note that Providing only the lpdgan path will not suffice for deblurring. Deblurring will only be applied when the --deblur option is explicitly set.

The license plate recognition result is refered to recognition_a_car() txt string variable string variable.

Example: python unit_inference.py --img ./dataset/cars/0.png --deblur

image of above demo command is from private data. If you would like to run the demo, please request access at the following link: Google Drive Link.

Please provide your identity and reason for requesting access.

Acknowledgments

This project utilizes code and resources from the following repositories:

LPDGAN
- origin paper: https://www.ijcai.org/proceedings/2024/0086.pdf
Automatic-Number-Plate-Recognition-Using-YOLOv8-EasyOCR
PaddleOCR2Pytorch/pytorchocr

We deeply appreciate the work of these developers and their contributions to the open-source community.

Name		Name	Last commit message	Last commit date
Latest commit History 198 Commits
LPDGAN		LPDGAN
SR		SR
anpr		anpr
dataset		dataset
docs		docs
imgproc_utils		imgproc_utils
logger		logger
pytorchocr		pytorchocr
torchtool		torchtool
.gitignore		.gitignore
LICENSE		LICENSE
demo.py		demo.py
ocr_eval.py		ocr_eval.py
readme.md		readme.md
requirements.txt		requirements.txt
train.sh		train.sh
train_LPDGAN.py		train_LPDGAN.py
train_sr.py		train_sr.py
unit_inference.py		unit_inference.py
validation.sh		validation.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ANPR for TW license plate

TODO

pipeline:

Enviroments

NOTE

Deblur

Train:

Evaluation: Using OCR performance:

Validation on cruiser dataset

Abluation study for Text Extraction Module

Ablation study for Initial LR and Scheduling epochs

Execution for License Plate Recognition

Usage

Acknowledgments

About

Releases

Packages

Languages

License

Tsai-chia-hsiang/tw_anpr

Folders and files

Latest commit

History

Repository files navigation

ANPR for TW license plate

TODO

pipeline:

Enviroments

NOTE

Deblur

Train:

Evaluation: Using OCR performance:

Validation on cruiser dataset

Abluation study for Text Extraction Module

Ablation study for Initial LR and Scheduling epochs

Execution for License Plate Recognition

Usage

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages