You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've noticed you've been training lot of YoLo-WholeBody models, and some of them look fantastic!
I was wondering if any of these models supports estimated depths for the bone joints (so the model outputs 3D points instead of 2D points)
Obviously the depth is estimated because it's impossible to know the actual depth of a pixel without a proper depth-map, but I've seen a few models that do a reasonably good job generating Z values, which greatly helps doing things like using the whole body poses for avatar animation.
I understand this is subject to the availability of training images with proper depth data, but I take it these days such data is already freely available?
Relevant Log Output
URL or source code for simple inference testing code
No response
The text was updated successfully, but these errors were encountered:
Rather than estimating depth, this could be achieved by using a depth camera or simply combining an ONNX file for depth estimation such as DepthAnythingV2, etc.
However, the important point is that this model (WholebodyXXX) only estimates the visible area. The concept is fundamentally different from the Pose Estimation architecture.
Issue Type
Feature Request
OS
Windows
OS architecture
x86_64
Programming Language
Other
Framework
ONNX
Model name and Weights/Checkpoints URL
YoLo-WholeBody-XXX
Description
I've noticed you've been training lot of YoLo-WholeBody models, and some of them look fantastic!
I was wondering if any of these models supports estimated depths for the bone joints (so the model outputs 3D points instead of 2D points)
Obviously the depth is estimated because it's impossible to know the actual depth of a pixel without a proper depth-map, but I've seen a few models that do a reasonably good job generating Z values, which greatly helps doing things like using the whole body poses for avatar animation.
I understand this is subject to the availability of training images with proper depth data, but I take it these days such data is already freely available?
Relevant Log Output
URL or source code for simple inference testing code
No response
The text was updated successfully, but these errors were encountered: