Lumina-Image 2.0 : A Unified and Efficient Image Generative Model

📰 News

[2024-2-5] ComfyUI now supports Lumina-Image 2.0! 🎉 Thanks to ComfyUI@ComfyUI! 🙌 Feel free to try it out! 🚀
[2024-1-31] We have released the latest .pth format weight file Google Drive.
[2024-1-25] 🚀🚀🚀 We are excited to release Lumina-Image 2.0, including:
- 🎯 Checkpoints, Fine-Tuning and Inference code.
- 🎯 Website & Demo are live now! Check out the Huiying and Gradio Demo!

📑 Open-source Plan

🎥 Demo

Demo.mp4

🎨 Qualitative Performance

📊 Quantatitive Performance

🎮 Model Zoo

Resolution	Parameter	Text Encoder	VAE	Download URL
1024	2.6B	Gemma-2-2B	FLUX-VAE-16CH	hugging face

💻 Finetuning Code

1. Create a conda environment and install PyTorch

conda create -n Lumina2 -y
conda activate Lumina2
conda install python=3.11 pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=12.1 -c pytorch -c nvidia -y

2.Install dependencies

pip install -r requirements.txt

3. Install flash-attn

pip install flash-attn --no-build-isolation

4. Prepare data

You can place the links to your data files in ./configs/data.yaml. Your image-text pair training data format should adhere to the following:

{
    "image_path": "path/to/your/image",
    "prompt": "a description of the image"
}

5. Start finetuning

bash scripts/run_1024_finetune.sh

🚀 Inference Code

We support multiple solvers including Midpoint Solver, Euler Solver, and DPM Solver for inference.

Note

Both the Gradio demo and the direct inference method use the .pth format weight file, which can be downloaded from Google Drive.

Note

You can also directly download from huggingface. We have uploaded the .pth weight files, and you can simply specify the --ckpt argument as the download directory.

Gradio Demo

python demo.py \
    --ckpt /path/to/your/ckpt \
    --res 1024 \
    --port 12123

Direct Batch Inference

bash scripts/sample.sh

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
configs		configs
data		data
models		models
scripts		scripts
transport		transport
util		util
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
finetune.py		finetune.py
grad_norm.py		grad_norm.py
imgproc.py		imgproc.py
parallel.py		parallel.py
requirements.txt		requirements.txt
sample.py		sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lumina-Image 2.0 : A Unified and Efficient Image Generative Model

📰 News

📑 Open-source Plan

🎥 Demo

🎨 Qualitative Performance

📊 Quantatitive Performance

🎮 Model Zoo

💻 Finetuning Code

1. Create a conda environment and install PyTorch

2.Install dependencies

3. Install flash-attn

4. Prepare data

5. Start finetuning

🚀 Inference Code

About

Releases

Packages

Contributors 3

Languages

License

Alpha-VLLM/Lumina-Image-2.0

Folders and files

Latest commit

History

Repository files navigation

Lumina-Image 2.0 : A Unified and Efficient Image Generative Model

📰 News

📑 Open-source Plan

🎥 Demo

🎨 Qualitative Performance

📊 Quantatitive Performance

🎮 Model Zoo

💻 Finetuning Code

1. Create a conda environment and install PyTorch

2.Install dependencies

3. Install flash-attn

4. Prepare data

5. Start finetuning

🚀 Inference Code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages