-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task [pose estimation]: implement the general pose estimation API and utilities, plus integrate some initial models (e.g., mediapipe, DeepLabCut) #173
Comments
👋 Welcome to Senselab! Thank you for your interest and contribution. Senselab is a comprehensive Python package designed to process behavioral data, including voice and speech patterns, with a focus on reproducibility and robust methodologies. Your issue will be reviewed soon. Stay tuned! |
hi @brukew , sounds nice. Do you mind sharing some more detailed insights on your plan? For example,
I would recommend clarifying all these aspects before starting coding. |
thanks @fabiocat93, sounds good! Will update the issue as I make the plan. |
Thank you @brukew . This is good. Here are a couple of comments:
As a minor note, feel free to reply in a thread instead of simply editing the original text of the issue. this way, we can keep track of all the reasoning process (this is mostly helpful to me, since my memory is not that great! thanks!) |
Also, here is a good document overviewing pose estimation as a task: https://medium.com/augmented-startups/top-9-pose-estimation-models-of-2022-70d00b11db43 Consider that you are not necessarily required to implement everything yourself. People have been working in the domain for a while and you can use their models and their utility functions. For instance, here is a related project you should look at: Here are some more models I have tried recently and could be good to add at some point in the future: |
there is also SLEAP and DANNCE and TULIP from here https://www.tdunnlab.org/ |
@fabiocat93 Yes, both of your points make sense. The number of joints differs greatly between each model - I assume we want all information retained though so the pose skeleton object will just vary in size/content per model. I will look into how to ensure this is done best - maybe just hardcoding matches between the key points of each model and labeling it the same. How should I approach using existing toolkits? If I just want a minor visualization utility function for example, would I need to add the whole toolkit as a requirement and import it? or would it be fine to just take the code from their github? |
Good question. If you can isolate a specific function that you need, you can simply copy-paste that and report the source+mentioning their LICENSE. Obv, this assumes that their license allows doing that |
Also, I have been thinking about this in the evaluation sense—not necessarily training or fine-tuning a model. This could definitely be added in the future, but am I right in prioritizing evaluation? |
Hey @brukew, do you mind giving me a quick update? I know you keep Satra in the loop with your weekly reports, but I'd love to stay up to speed, too! Thanks! |
Hey, yeah. Finished up with an implementation for mediapipe - will push a pr for it soon. Having issues with dependencies when running poetry install - if you are available sometime today, would be nice to clear that up. |
hi guys @brukew, how is it going with this? I will extend your deadline (from Dec 13 to Jan 14). Please, let me know if you face any blockers |
Senselab Pose Estimation
Goal:
Integrate robust pose estimation workflows within Senselab
ViTPose performs best on infants - mediapipe and deeplabcut are low on accuracy
Support Plan
Start with MediaPipe
Follow up with:
Workflow
Version Planning
V1:
V2:
V3:
Inputs/Outputs
Inputs:
Outputs:
PoseImage
PoseVideo
At each step, proper documentation and tests are expected. Tutorials will also be implemented for each workflow (across models and modalities).
The text was updated successfully, but these errors were encountered: