Pytorch-model-quantization

Part of AI/ML model deployment on edge devices includes reducing its memory size and weights type in order to ease on resources needed for using the model. This two short scripts are used for doing it semi automatically using derivative of export.py from yolov5 and quantize_torch_model.py The end part requires cli (command written at the end of quantize_torch_model.py) I executed it in yolov5 docker enviornment(https://github.com/ultralytics/yolov5/wiki/Docker-Quickstart) and in the yolov5 folder (git clone https://github.com/ultralytics/yolov5 # clone, cd yolov5, pip install -r requirements.txt # install) to simplify and adjust enviornment requirments. To ease the process, please put the model and export_unparsed.py in yolov5 folder

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
export_unparsed.py		export_unparsed.py
min_row_data.py		min_row_data.py
quantize_torch_model.py		quantize_torch_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytorch-model-quantization

About

Releases

Packages

Languages

arielsolomon/Pytorch-model-quantization

Folders and files

Latest commit

History

Repository files navigation

Pytorch-model-quantization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages