This repository contains a unified collection of Llama-based projects:
- Dataset Builder: Tools for creating and preprocessing training datasets
- Llama Fine-tuning: Scripts and utilities for fine-tuning Llama models
- Llama RAG: Retrieval-Augmented Generation implementation using Llama
.
├── dataset_builder/ # Dataset creation and preprocessing
├── llama_finetune/ # Fine-tuning scripts and configurations
└── llama3_rag_project/ # RAG implementation
- Install Poetry if you haven't already:
curl -sSL https://install.python-poetry.org | python3 -
- Install dependencies:
poetry install
- Activate the virtual environment:
poetry shell
Each subproject can be used independently within the unified environment:
- Dataset Builder: Tools for creating training datasets
- Llama Fine-tuning: Fine-tune Llama models on custom datasets
- RAG: Implement retrieval-augmented generation using Llama
See individual project directories for specific usage instructions.