LargeDiT T2i, which text encoder should be used? #178

Miracle2333 · 2024-03-12T02:44:38Z

I load the pretrained text encoder from official LLAMA-2 and the generated results are random noise. So which text encoder should be used? Could you specify the hugging face repo id？

gaopengpjlab · 2024-03-12T03:05:50Z

We use frozen LLaMa-7B as a text encoder. Please download our T2I checkpoint which includes frozen text encoder and diffusion backbone contained in the same checkpoint.

gaopengpjlab · 2024-03-12T03:08:44Z

https://huggingface.co/Alpha-VLLM/Large-DiT/tree/main/240308_3b_1024

gaopengpjlab · 2024-03-12T03:52:14Z

Please note that our pretrained checkpoints only support high-resolution image generation.

Miracle2333 · 2024-03-12T07:58:08Z

We use frozen LLaMa-7B as a text encoder. Please download our T2I checkpoint which includes frozen text encoder and diffusion backbone contained in the same checkpoint.

Hi,

I pull the checkpoint from the hf site and find it doesn't contain text-encoder. In addition, the codes of demo.py show that we need to load text-encoder ckpt from other hugging face sites. Could you provide the text-encoder to be loaded here?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LargeDiT T2i, which text encoder should be used? #178

LargeDiT T2i, which text encoder should be used? #178

Miracle2333 commented Mar 12, 2024

gaopengpjlab commented Mar 12, 2024 •

edited

Loading

gaopengpjlab commented Mar 12, 2024

gaopengpjlab commented Mar 12, 2024

Miracle2333 commented Mar 12, 2024

LargeDiT T2i, which text encoder should be used? #178

LargeDiT T2i, which text encoder should be used? #178

Comments

Miracle2333 commented Mar 12, 2024

gaopengpjlab commented Mar 12, 2024 • edited Loading

gaopengpjlab commented Mar 12, 2024

gaopengpjlab commented Mar 12, 2024

Miracle2333 commented Mar 12, 2024

gaopengpjlab commented Mar 12, 2024 •

edited

Loading