Using captions & CLIP with Lora? #47
Replies: 5 comments 8 replies
-
I actually haven't gotten around to testing captions. I'd be happy to help if you want to try it out with my repo. For now, it's set up to use filenames (although separate text files would not be hard to add), and the class images all get the same class prompt. You might find discussions over at the Stable Diffusion Dreambooth discord helpful: https://discord.gg/JFsWqZFb3e |
Beta Was this translation helpful? Give feedback.
-
Maybe! But can't say for sure. Maybe having multiple prompt per image might help as well. Thanks for the helpful suggestion! |
Beta Was this translation helpful? Give feedback.
-
Another question on the CLIP train - what are folks favorite methods for batch processing images with CLIP? Using a tool like Automatic1111 web ui, a script, or something else? |
Beta Was this translation helpful? Give feedback.
-
For others digging into captioning, it looks like the EveryDream repo and Discord are a great resource for this info as well (a Dreambooth-esque tool that uses captioning extensively). There's also a set of accompanying tools to help with batch captioning and the like. Still digging into to it's info & tooling, but it looks interesting in its own right and as way to help prep training content for Lora. |
Beta Was this translation helpful? Give feedback.
-
I have heard a lot of talk about captioning, however I have a bit of a hard time finding any concrete examples whatsoever. |
Beta Was this translation helpful? Give feedback.
-
It sounds like using captions can improve training in Lora significantly (example). @cloneofsimo does that sound right to you? @brian6091 I see you have an option for captions in your Colab as well.
Do folks have any tips on using captions in general, or with Lora specifically?
.txt
files or a singledesc.csv
file?Any help or guidance would be much appreciated.
And, if there's a resource on using captions or CLIP effectively in training that I haven't been able to find yet, feel free to pawn me over to that as well. Thanks!
Beta Was this translation helpful? Give feedback.
All reactions