- Initial version, main functionality:
- Body keypoint detection and rendering in Ubuntu 14 and 16.
- It can read an image directory, video or webcam.
- It can display the results or storing them on disk.
- Main improvements:
- Rendering max resolution from 720p to >32k images.
- Highly improved documentation.
- Functions or parameters renamed:
- Demo renamed from rtpose to openpose.
- Main bugs fixed:
- Demo uses exec instead of start, so it works with more OpenCV custom compiled versions.
- Main improvements:
- Added face keypoint detection.
- Added Windows 10 compatibility.
- Auto-detection of the number of GPUs.
- MPI visualization more similar to COCO one.
- Rendering max resolution from 720p to >32k images.
- GUI info adder working when the worker TDatum has more than 1 Datum.
- It prints out the error description before throwing the exception (so that it is written on the Windows cmd).
- Highly improved documentation.
- Functions or parameters renamed:
- Flag
--write_pose
renamed as--write_keypoint
and it also applies to face and/or hands. - Flag
--write_pose_json
renamed as--write_keypoint_json
and it also applies to face and/or hands. - Flag
--write_pose_format
renamed as--write_keypoint_format
and it also applies to face and/or hands. - PoseSaver and its JSON variant renamed as KeypointSaver.
- PoseJsonCocoSaver renamed as CocoJsonSaver.
- Flag
- Main bugs fixed:
- All visualization functions moved to same thread, so it works with most OpenCV custom compiled versions.
- Fixed error on debug mode:
Too many resources requested for launch
.
- Main improvements:
- Added hand keypoint detection.
- Windows branch merged to master branch.
- Face and hands use
Maximum
instead ofNms
, since there is only 1 person / detection. - Increased accuracy on multi-scale (added
Datum::scaleRatios
to save the relative scale ratio when multi-scale). - Increased speed ~5% by adding CPU rendering (but GPU is the default rendering).
- Rendering colors modified, visually better results.
- Rendering threshold for pose, face and hands becomes user-configurable.
- Check() functions give more feedback.
- WCocoJsonSaver finished and removed its 3599-image limit.
- Added
--camera_fps
so generated video will use that frame rate. - Reduced the number of printed information messages. Default logging priority threshold increased to Priority::Max.
- Google flags to OpenPose configuration parameters reader moved from each demo to utilities/flagsToOpenPose.
- Nms classes do not use
numberParts
forReshape
, they deduce the value. - Improved documentation.
- Functions or parameters renamed:
- Render flags renamed in the demo in order to incorporate the CPU/GPU rendering.
- Keypoints saved in JSON files (
--write_keypoint_json
) are now saved aspose_keypoints
,face_keypoints
,hand_left_keypoints
, andhand_right_keypoints
. They all were previously saved asbody_parts
. - Flag
--num_scales
renamed as--scale_number
. - All hand and pose flags renamed such as they start by
--hand_
and--face_
respectively.
- Main bugs fixed:
- Fixed bug in Array::getConstCvMat() if mVolume=0, now returning empty cv::Mat.
- Fixed bug:
--process_real_time
threw error with webcam. - Fixed bug: Face not working when input and output resolutions are different.
- Fixed some bugs that prevented debug version to run.
- Face saved in JSON files were called
--body_parts
. Now they are called--face_keypoints
.
- Main improvements:
- Windows library turned into DLL dynamic library (i.e. portable).
- Improved documentation.
- Functions or parameters renamed:
openpose/utilities/macros.hpp
moved toopenpose/utilities/macros.hpp
.
- Main improvements:
- Added OpenCV 3.3 compatibility.
- Caffe turned into DLL library.
- OpenPose is now completely portable across Windows 10 computers (with Nvidia graphic card).
- Added OpenPose 1.0.1 portable demo.
- Removed Python and some unnecessary boost dependencies on the VS project.
- Replaced all double quotes by angle brackets in include statements (issue #61).
- Added 3-D reconstruction demo.
- Auto-detection of the camera index.
- Speed up of ~30% in op::floatPtrToUCharCvMat.
- COCO extractor now extracts image ID from the image name itslef (format "string_%d"). Before, only working with validation test, now applicable to e.g. test sets.
- Changed display texts, added
OpenPose
name.
- Main bugs fixed:
- Pycaffe can now be imported from Python.
- Fixed
Tutorial/Wrapper
VS linking errors.
- Main improvements:
- Added CMake installer for Ubuntu.
- Added how to use keypoint data in
examples/tutorial_wrapper/
. - Added flag for warnings of type
-Wsign-compare
and removed in code. - Slightly improved accuracy by considering ears-shoulder connection (e.g. +0.4 mAP for 1 scale in validation set).
- Main bugs fixed:
- Windows version crashing with std::map copy.
- Main improvements:
- Speed increase when processing images with different aspect ratios. E.g. ~20% increase over 3.7k COCO validation images on 1 scale.
- Huge speed increase and memory reduction when processing multi-scale. E.g. over 3.7k COCO validation images on 4 scales: ~40% (~770 to ~450 sec) speed increase, ~25% memory reduction (from ~8.9 to ~6.7 GB / GPU).
- Slightly increase of accuracy given the fixed mini-bugs.
- Added IP camera support.
- Output images can have the input size, OpenPose able to change its size for each image and not required fixed size anymore.
- FrameDisplayer accepts variable size images by rescaling every time a frame with bigger width or height is displayed (gui module).
- OpOutputToCvMat & GuiInfoAdder does not require to know the output size at construction time, deduced from each image.
- CvMatToOutput and Renderers allow to keep input resolution as output for images (core module).
- New standalone face keypoint detector based on OpenCV face detector: much faster if body keypoint detection is not required but much less accurate.
- Face and hand keypoint detectors now can return each keypoint heatmap.
- The flag
USE_CUDNN
is no longer required;USE_CAFFE
andUSE_CUDA
(replacing the oldCPU_ONLY
) are no longer required to use the library, only to build it. In addition, Boost, Caffe, and its dependencies have been removed from the OpenPose header files. Only OpenCV include and lib folders are required when building a project using OpenPose. - OpenPose successfully compiles if the flags
USE_CAFFE
and/orUSE_CUDA
are not enabled, although it will give an error saying they are required. - COCO JSON file outputs 0 as score for non-detected keypoints.
- Added example for OpenPose for user asynchronous output and cleaned all
tutorial_wrapper/
examples. - Added
-1
option for--net_resolution
in order to auto-select the best possible aspect ratio given the user input. - Net resolution can be dynamically changed (e.g. for images with different size).
- Added example to add functionality/modules to OpenPose.
- Added
--disable_multi_thread
flag in order to allow debug and/or highly reduce the latency (e.g. when using webcam in real-time). - Allowed to output images without any rendering.
- Functions or parameters renamed:
- OpenPose able to change its size and initial size dynamically:
- Flag
--resolution
renamed as--output_resolution
. - FrameDisplayer, GuiInfoAdder and Gui constructors arguments modified (gui module).
- OpOutputToCvMat constructor removed (core module).
- New Renders classes to split GpuRenderers from CpuRenderers.
- Etc.
- Flag
- OpenPose able to change its net resolution size dynamically:
- Changed several functions on
core/
,pose/
,face/
, andhand/
modules.
- Changed several functions on
CPU_ONLY
changed byUSE_CUDA
to keep format.
- OpenPose able to change its size and initial size dynamically:
- Main bugs fixed:
- Scaling resize issue fixed: ~1-pixel offset due to not considering 0-based indexes.
- Ubuntu installer script now works even if Python pip was not installed previously.
- Flags to set first and last frame as well as jumping frames backward and forward now works on image directory reader.
- Main improvements:
- Heatmaps can be saved in floating format.
- More efficient non-processing version (i.e., if all keypoint extractors are disabled, and only image extraction and display/saving operations are performed).
- Heat maps scaling: Added
--heatmaps_scale
to OpenPoseDemo, added option not to scale the heatmaps, and added customfloat
format to save heatmaps in floating format. - Detector of the number of GPU also considers the initial GPU index given by the user.
- Added
--write_json
as new version of--write_keypoint_json
. It includes the body part candidates (if enabled), as well as any extra information added in the future (e.g. person ID). - Body part candidates can be retrieved in op::Datum and saved with
--write_json
.
- Functions or parameters renamed:
PoseParameters
splitted intoPoseParameters
andPoseParametersRender
and const parameters turned into functions for more clarity.
- Main bugs fixed:
- Render working on images > 4K (#324).
- Cleaned redundant arguments on
getAverageScore
andgetKeypointsArea
. - Slight speed up when heatmaps must be returned to the user (not doing a double copy anymore).
- Main improvements:
- Output of
--write_json
uses less hard disk space (enters and tabs removed). - Removed Boost dependencies.
- Caffe added as submodule.
- CMake installer compatible with Windows.
- Added freeglut download script (3-D reconstruction demo for Windows).
- Added Debug version for Windows (CMake).
- Runtime verbose about average speed configurable by user with
PROFILER_ENABLED
option (CMake/Makefile.config) and--profile_speed
flag. - Lighter Caffe version compiled by CMake in Ubuntu: disabled Caffe extra support (e.g., OpenCV, Python) and doc.
- Renamed CMake binaries (Ubuntu) to match old Makefile format:
_bin
by.bin
. - 3-D reconstruction demo cleaned, implemented in Ubuntu too, and now defined as module of OpenPose rather than just a demo.
- CMake as default installer in documentation.
- Added flag: number_people_max to optionally select the maximum number of people to be detected.
- 3-D reconstruction module forces the user to set
number_people_max 1
to avoid errors (as it assumes only 1 person per image). - Removed old
windows/
version. CMake is the only Windows version available. - Camera parameters (flir camera) are read from disk at runtime rather than being compiled.
- 3-D reconstruction module can be implemented with different camera brands or custom image sources.
- Flag
--write_json
includes 3-D keypoints. - 3-D reconstruction module can be used with images and videos. Flag
--3d_views
added to allow--image_dir
and--video
allow loading stereo images. - Flag
--camera_resolution
applicable to--flir_camera
. - Throw error message if requested GPU IDs does not exist (e.g. asking for 2 GPUs starting in ID 1 if there is only 2 GPUs in total).
- VideoSaver (
--write_video
) compatible with multi-camera setting. It will save all the different views concatenated. - OpenPose small GUI rescale the verbose text to the displayed image, to avoid the text to be either too big or small.
- OpenPose small GUI shows the frame number w.r.t. the original producer, rather than the frame id. E.g., if video is started at frame 30, OpenPose will display 30 rather than 0 in the first frame.
- OpenPose GUI: 'l' and 'k' functionality swapped.
- 3-D reconstruction module: Added flag
--3d_min_views
to select minimum number of cameras required for 3-D reconstruction. - Flag
--camera_fps
also applies to recorded video (--write_video
). - Flir camera producer
n
times faster forn
cameras (multi-threaded). If the number of cameras is greater than the number of the computer threads, the speed up might not be exactlyn
times.
- Output of
- Functions or parameters renamed:
- Flag
no_display
renamed asdisplay
, able to select betweenNoDisplay
,Display2D
,Display3D
, andDisplayAll
. - 3-D reconstruction demo is now inside the OpenPose demo binary.
- Renamed
*_keypoints
by*_keypoints_2d
to avoid confusion with 3d ones in--write_json
output file. - CvMatToOpInput requires PoseModel to know the normalization to be performed.
- Created
net/
module in order to reducecore/
number of classes and files and for future scalability.
- Flag
- Main bugs fixed:
- Slight speed up (~1%) for performing the non-maximum suppression stage only in the body part heatmaps channels, and not also in the PAF channels.
- Fixed core-dumped in PoseRenderer with GUI when changed element to be rendered to something else than skeleton.
- 3-D visualizer does not crash on exit anymore.
- Fake pause ('m' key pressed) works again.
- Main improvements:
- Flir cameras: Added software trigger and a dedicated thread to keep reading images to remove latency (analogously to webcamReader).
- 3-D reconstruction: Added non-linear minimization to further improve 3-D triangulation accuracy by ~5% (Ubuntu only).
- CMake: All libraries as single variable (simpler to add/remove libraries).
- Datum includes extrinsic and intrinsic camera parameters.
- Function
scaleKeypoints(Array<float>& keypoints, const float scale)
also accepts 3D keypoints. - 3D keypoints and camera parameters in meters (instead of millimeters) in order to reduce numerical errors.
- New
PoseExtractor
class to contain future ID and tracking algorithms as well as the current OpenPose keypoint detection algorithm.
- Functions or parameters renamed:
- Removed scale parameter from hand and face rectangle extractor (causing wrong results if custom
--output_resolution
). - Functions
scaleKeypoints
, other thanscaleKeypoints(Array<float>& keypoints, const float scale)
, renamed asscaleKeypoints2d
. (W)PoseExtractor
renamed to(W)PoseExtractorNet
to distinguish from newPoseExtractor
. Analogously with(W)FaceExtractorNet
and(W)HandExtractorNet
.
- Removed scale parameter from hand and face rectangle extractor (causing wrong results if custom
- Main bugs fixed:
- Fixed hand and face extraction and rendering scaling issues when
--output_resolution
is not the default one. - Part candidates (
--part_candidates
) are saved with the same scale than the final keypoints itself. - Fixed bug in keepTopNPeople.hpp (
--number_people_max
) that provoked core dumped if lots of values equal to the threshold.
- Fixed hand and face extraction and rendering scaling issues when
Download and/or check any OpenPose version from https://github.com/CMU-Perceptual-Computing-Lab/openpose/releases.