Initial FineTuning the model

Collaborators [Team NeuralNinjas]

Please change the path directories accordingly in each file.

Initial FineTuning the model

Download data using the code provided by the host.
Preprocess train data preprocess_train.py
Creating JSON for finetuning with LLaMA-Factory dataprep.py
Follow guidelines given in docs of LLaMA-Factory to register our dataset. (Refer data/example.json)
For finetuning follow finetune.ipynb. Settings for finetuning will be registered using WebUI.

Inferencing of Finetuned Model

Run inference.py
Post Processing postprocess.py
Evaluation Metric: F1 Score

Experimentation

Model: https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct

    1. Inference Baseline Qwen2VL-7B-Instruct AWQ
        score: 0.617
        prompt: <Return only value>What is the {entity_name} of this product?

    2. Finetuning
        prompt: What is the {entity_name}?
        a. 10k Samples
            postprocessing:
                a.Replaced Range with NA:
                    - Invalid units replaced with NA
                    - score: 0.678
                b.Replaced range with Max Value:
                    - score: 0.677

        b. 20k Samples with Cosine Scheduler
            Inference Results:
                - score: 0.679
            FineTuned on Curated 1600 samples:
                - score: 0.865
                - lr_scheduler: reduce-lr-on-plateau
                
            FineTuned on 20k samples:
                - Preprocessing: 
                    - Replace Range with Max value
                    - Remove entity values with invalid units

                - FineTuned on Curated 1600 Samples:
                    - score: 
                    - lr_scheduler: reduce-lr-on-plateau
                    
    3. Data Curation
        - Missing {entity_value} replaced with NA
        - Correct inaccurate {entity_value}
    
    4. Experimental Settings
        -- batch_size: 8
        -- learning_rate: 5e-5
        -- gradient_accumulation: 8
        -- scheduler: appropriately choosen
        -- tool: LLaMA-Factory
        -- finetuning method: qlora-8bit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collaborators [Team NeuralNinjas]

Initial FineTuning the model

Inferencing of Finetuned Model

Experimentation

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LLaMA-Factory		LLaMA-Factory
data		data
ML Challenge 2024_NeuralNinjas.pdf		ML Challenge 2024_NeuralNinjas.pdf
README.md		README.md
dataprep.py		dataprep.py
finetune.ipynb		finetune.ipynb
inference.py		inference.py
postprocess.py		postprocess.py
preprocess_train.py		preprocess_train.py

nachiketashunya/Amazon-ML-Challenge-2024

Folders and files

Latest commit

History

Repository files navigation

Collaborators [Team NeuralNinjas]

Initial FineTuning the model

Inferencing of Finetuned Model

Experimentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages