Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task: Support pose estimation on video #249

Open
4 tasks
brukew opened this issue Feb 3, 2025 · 1 comment
Open
4 tasks

Task: Support pose estimation on video #249

brukew opened this issue Feb 3, 2025 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@brukew
Copy link
Contributor

brukew commented Feb 3, 2025

Description

This would support pose estimation on video inputs (via file or Video object). This can currently be done by looping through the video frames and running pose estimation on it, but a built-in method should be implemented for simplicity.

I imagine users may use pose estimation primarily on Video rather than individual images so this is needed.

Tasks

  • Figure out the exact implementation (Representation + Access)
  • Implement
  • Test
  • Revise docs and tutorial

Freeform Notes

Thinking about how this could be implemented

  • Representation
    • VideoPose object
    • List of ImagePose objects for each frame
  • Access Points
    • instance method in Video
    • Video input into API
@brukew brukew self-assigned this Feb 3, 2025
@fabiocat93
Copy link
Collaborator

Hey @brukew , some additional considerations:

  • Exploit temporal info for smoother pose estimation:
    -- The pose of a person in frame X can provide useful context for frame X+1.
    -- Consider leveraging past frame data to improve tracking accuracy, reduce jitter, and interpolate missing data.
    -- As a possible technique, I know Kalman filtering and interpolation. There may be something smarter, though.
  • Track individuals across frames using spatial information (e.g., useful for multi-person tracking scenarios).
    -- Consider incorporating facial recognition or spatial+tmporal info for better identity mapping.
  • Implement a low-resource version of the algorithm, maybe allowing skipping some frames (e.g., you analyze 1 frame every N)

@fabiocat93 fabiocat93 added the enhancement New feature or request label Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants