-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MultimodalConversableAgent in autogenstudio? #1169
Comments
@victordibia fyi |
Hi @antoan, Thanks for the note. If you would consider describing your envisioned use case in a bit more detail, that would be helpful once we get there. |
In the meantime, @BeibinLi is thinking about implementing multimodal in the core. Knowing the use case here would also help that. |
I see, thank for letting me know. My use case involves the periodic visual monitoring of an industrial hanger, for anomalies - e.g people present in the hanger where none should be preset, via a camera stream. I initially intended to use a multimodal agent in conjunction with autogen studio to render anomalous detection frames to the user, and a gui is the only component I lack to complete the experience. Please let me know if this is sufficient. |
There was already a P3 for supporting contrib agents; appended multi modal to that list |
It is working as it is now. The skill file I'm using is this one:
|
is working |
Is it currently possible or are there plans to support this in the future?
The text was updated successfully, but these errors were encountered: