-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Best Settings, anyone? #7
Comments
I don't think there is "best settings" for such complex things as videos, but rather "strategies for better results". What I feel would be needed:
|
While we're on the topic of caption generation, you could also hook up something like comfyui-ollama to help create descriptions for img2vid. What I would like to do is set up folder batching to handle multiple images at once while saving metadata (like the noise seed), in case I want to go back and tweak a usable candidate. The noise seed seems to play a massive role in the quality of generation or whether or not it focuses on the correct subject. @scofano This is a good find, thank you. I'm going to pipe this into llama vision and see how that fares. |
@botslop For descriptions, I use florence2. Then you can use the description of the image as context and feed ollama with it. In case you are not using it already: https://github.com/nkchocoai/ComfyUI-SaveImageWithMetaData
Yes, I did not figure out the settings yet. Most of my generations are either still images or just noise. I tried different settings but so far, no luck (that's why I created this thread). |
Just trying to put the pieces together here. Looks like someone on YouTube made a setup video. While the video itself doesn't go into too much detail on how to tweak the scheduler values, a comment from andro-meta provided some good insight.
As a general strategy, you could start with generating several candidates, and look for a seed that captures the motion you're looking for (even if it's distorted, it might pick up on important elements you're looking for in the final result, like animating the subject instead of the background). |
Did anyone try different size vs steps vs length combinations to find the best one?
The text was updated successfully, but these errors were encountered: