Amazon ML Challegne 2024

Create conda env using requirements.yml

conda env create -f environment.yml

Preprocessing

create a dataset folder which will contain train/test file and train test images
download and store train test images at
- ./dataset/train_images
- ./dataset/test_images

Functions to download images are available in utils.py

OCR using easyOCR

python easyocr_text_extract.py --start_idx <first row index> --end_idx <end_row_index>

EasyOCR use gpu to extract the text, set environment variable CUDA_VISIBLE_DEVICES if you wish to use some other gpu, default is 0.

Modify the image and input/output file path in python script as needed

Zero Shot Inference using HuggingFaceM4/idefics2-8b

python zero_shot_idefice.py --start <start_index > --end <end_index> --input_csv <input_csv_path> --output_csv <output_csv_path>

The default path to read images is set to `../dataset/test_images` modify it as needed.

Finetuning Idefice2 with quantized LoRA

We have used hugginface transfomrers with accelerate for distributed training on 4 V100 GPUs. Modify no of gpus in accelerate_config.yaml as needed.

accelerate launch --config_file <config_file_path> train.py

there are default paths in train and data_collator for reading train.csv and train_images, change as per your directory structure.

Inference

python eval.py --start_idx <start_idx> --end_idx <end_idx>

there are default paths for reading test.csv and test_images and to write the predictions.

For T5 finetuning and evaluation take a look at `t5-finetune` folder

Submission by Team DEFAULT

Shreykumar Satapara
Sayanta Adhikari
Arkaprava Majumdar
Rishabh Karnad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon ML Challegne 2024

Create conda env using requirements.yml

Preprocessing

Functions to download images are available in utils.py

OCR using easyOCR

Zero Shot Inference using HuggingFaceM4/idefics2-8b

The default path to read images is set to `../dataset/test_images` modify it as needed.

Finetuning Idefice2 with quantized LoRA

there are default paths in train and data_collator for reading train.csv and train_images, change as per your directory structure.

Inference

there are default paths for reading test.csv and test_images and to write the predictions.

For T5 finetuning and evaluation take a look at `t5-finetune` folder

Submission by Team DEFAULT

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
t5-finetune		t5-finetune
accelerate_config.yaml		accelerate_config.yaml
constants.py		constants.py
data_collator.py		data_collator.py
easyocr_text_extract.py		easyocr_text_extract.py
eval.py		eval.py
readme.md		readme.md
requirements.yml		requirements.yml
sanity.py		sanity.py
train.py		train.py
utils.py		utils.py
zero_shot_idefice.py		zero_shot_idefice.py

ShreySatapara/amazon-ml-challenge-2024

Folders and files

Latest commit

History

Repository files navigation

Amazon ML Challegne 2024

Create conda env using requirements.yml

Preprocessing

Functions to download images are available in utils.py

OCR using easyOCR

Zero Shot Inference using HuggingFaceM4/idefics2-8b

The default path to read images is set to ../dataset/test_images modify it as needed.

Finetuning Idefice2 with quantized LoRA

there are default paths in train and data_collator for reading train.csv and train_images, change as per your directory structure.

Inference

there are default paths for reading test.csv and test_images and to write the predictions.

For T5 finetuning and evaluation take a look at t5-finetune folder

Submission by Team DEFAULT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

The default path to read images is set to `../dataset/test_images` modify it as needed.

For T5 finetuning and evaluation take a look at `t5-finetune` folder

Packages