New Features
- Add center alignments for draw_texts in OpencvBackendVisualizer (#2958)
- Add wflw2coco script (#2961)
- Support 300VW Dataset (#3005)
- Add RTMW3D for 3D wholebody pose estimation task (#3037)
Improvements
- In browse dataset : CombinedDataset element are now browse in turn, and image saved into their dataset name folder (#2985)
Bug Fixes
- Fix loss computation in MSPNHead (#2993)
- Fix bug in inferencer (#2966)
- Make category_id in CocoWholeBodyDataset as numpy.array (#2963)
Documentation
- Add rtmlib examples (#2923)
- Fix readthedocs configuration (#2979)
- Add more detailed comments (#2982)
- Improve documentation folder structure of ExLPose (#2977)
New Contributors
- @AntDum made their first contribution in open-mmlab#2958
- @Yanyirong made their first contribution in open-mmlab#2961
- @drazicmartin made their first contribution in open-mmlab#2977
- @KeqiangSun made their first contribution in open-mmlab#3005
- @jitrc made their first contribution in open-mmlab#3004
- @zgjja made their first contribution in open-mmlab#2963
- @jibranbinsaleem made their first contribution in open-mmlab#3027
- @cpunion made their first contribution in open-mmlab#3026
Fix the bug when downloading config and checkpoint using mim
(see Issue #2918).
Release note: https://github.com/open-mmlab/mmpose/releases/tag/v1.3.0
Release note: https://github.com/open-mmlab/mmpose/releases/tag/v1.2.0
Release note: https://github.com/open-mmlab/mmpose/releases/tag/v1.1.0
Release note: https://github.com/open-mmlab/mmpose/releases/tag/v1.0.0
Highlights
- Release RTMPose, a high-performance real-time pose estimation algorithm with cross-platform deployment and inference support. See details at the project page
- Support several new algorithms: ViTPose (arXiv'2022), CID (CVPR'2022), DEKR (CVPR'2021)
- Add Inferencer, a convenient inference interface that perform pose estimation and visualization on images, videos and webcam streams with only one line of code
- Introduce Project, a new form for rapid and easy implementation of new algorithms and features in MMPose, which is more handy for community contributors
New Features
- Support RTMPose (#1971, #2024, #2028, #2030, #2040, #2057)
- Support Inferencer (#1969)
- Support ViTPose (#1876, #2056, #2058, #2065)
- Support CID (#1907)
- Support DEKR (#1834, #1901)
- Support training with multiple datasets (#1767, #1930, #1938, #2025)
- Add project to allow rapid and easy implementation of new models and features (#1914)
Improvements
- Improve documentation quality (#1846, #1858, #1872, #1899, #1925, #1945, #1952, #1990, #2023, #2042)
- Support visualizing keypoint indices (#2051)
- Support OpenPose style visualization (#2055)
- Accelerate image transpose in data pipelines with tensor operation (#1976)
- Support auto-import modules from registry (#1961)
- Support keypoint partition metric (#1944)
- Support SimCC 1D-heatmap visualization (#1912)
- Support saving predictions and data metainfo in demos (#1814, #1879)
- Support SimCC with DARK (#1870)
- Remove Gaussian blur for offset maps in UDP-regress (#1815)
- Refactor encoding interface of Codec for better extendibility and easier configuration (#1781)
- Support evaluating CocoMetric without annotation file (#1722)
- Improve unit tests (#1765)
Bug Fixes
- Fix repeated warnings from different ranks (#2053)
- Avoid frequent scope switching when using mmdet inference api (#2039)
- Remove EMA parameters and message hub data when publishing model checkpoints (#2036)
- Fix metainfo copying in dataset class (#2017)
- Fix top-down demo bug when there is no object detected (#2007)
- Fix config errors (#1882, #1906, #1995)
- Fix image demo failure when GUI is unavailable (#1968)
- Fix bug in AdaptiveWingLoss (#1953)
- Fix incorrect importing of RepeatDataset which is deprecated (#1943)
- Fix bug in bottom-up datasets that ignores images without instances (#1752, #1936)
- Fix upstream dependency issues (#1867, #1921)
- Fix evaluation issues and update results (#1763, #1773, #1780, #1850, #1868)
- Fix local registry missing warnings (#1849)
- Remove deprecated scripts for model deployment (#1845)
- Fix a bug in input transformation in BaseHead (#1843)
- Fix an interface mismatch with MMDetection in webcam demo (#1813)
- Fix a bug in heatmap visualization that causes incorrect scale (#1800)
- Add model metafiles (#1768)
New Features
- Support 4 light-weight pose estimation algorithms: SimCC (ECCV'2022), Debias-IPR (ICCV'2021), IPR (ECCV'2018), and DSNT (ArXiv'2018) (#1628)
Migrations
- Add Webcam API in MMPose 1.0 (#1638, #1662) @Ben-Louis
- Add codec for Associative Embedding (beta) (#1603) @ly015
Improvements
- Add a colab tutorial for MMPose 1.0 (#1660) @Tau-J
- Add model index in config folder (#1710, #1709, #1627) @ly015, @Tau-J, @Ben-Louis
- Update and improve documentation (#1692, #1656, #1681, #1677, #1664, #1659) @Tau-J, @Ben-Louis, @liqikai9
- Improve config structures and formats (#1651) @liqikai9
Bug Fixes
- Update mmengine version requirements (#1715) @Ben-Louis
- Update dependencies of pre-commit hooks (#1705) @Ben-Louis
- Fix mmcv version in DockerFile (#1704)
- Fix a bug in setting dataset metainfo in configs (#1684) @ly015
- Fix a bug in UDP training (#1682) @liqikai9
- Fix a bug in Dark decoding (#1676) @liqikai9
- Fix bugs in visualization (#1671, #1668, #1657) @liqikai9, @Ben-Louis
- Fix incorrect flops calculation (#1669) @liqikai9
- Fix
tensor.tile
compatibility issue for pytorch 1.6 (#1658) @ly015 - Fix compatibility with
MultilevelPixelData
(#1647) @liqikai9
We are excited to announce the release of MMPose 1.0.0beta. MMPose 1.0.0beta is the first version of MMPose 1.x, a part of the OpenMMLab 2.0 projects. Built upon the new training engine.
Highlights
-
New engines. MMPose 1.x is based on MMEngine, which provides a general and powerful runner that allows more flexible customizations and significantly simplifies the entrypoints of high-level interfaces.
-
Unified interfaces. As a part of the OpenMMLab 2.0 projects, MMPose 1.x unifies and refactors the interfaces and internal logics of train, testing, datasets, models, evaluation, and visualization. All the OpenMMLab 2.0 projects share the same design in those interfaces and logics to allow the emergence of multi-task/modality algorithms.
-
More documentation and tutorials. We add a bunch of documentation and tutorials to help users get started more smoothly. Read it here.
Breaking Changes
In this release, we made lots of major refactoring and modifications. Please refer to the migration guide for details and migration instructions.
This release is meant to fix the compatibility with the latest mmcv v1.6.1
Highlights
-
Support TCFormer backbone, CVPR'2022 (#1447, #1452) @zengwang430521
-
Add RLE models on COCO dataset (#1424) @Indigo6, @Ben-Louis, @ly015
-
Update swin models with better performance (#1467) @jin-s13
New Features
-
Support TCFormer backbone, CVPR'2022 (#1447, #1452) @zengwang430521
-
Add RLE models on COCO dataset (#1424) @Indigo6, @Ben-Louis, @ly015
-
Support layer decay optimizer constructor and learning rate decay optimizer constructor (#1423) @jin-s13
Improvements
-
Improve documentation quality (#1416, #1421, #1423, #1426, #1458, #1463) @ly015, @liqikai9
-
Support PAVI logger (#1434) @EvelynWang-0423
-
Add progress bar for some demos (#1454) @liqikai9
-
Webcam API supports quick device setting in terminal commands (#1466) @ly015
-
Update swin models with better performance (#1467) @jin-s13
Bug Fixes
-
Rename
custom_hooks_config
tocustom_hooks
in configs to align with the documentation (#1427) @ly015 -
Fix deadlock issue in Webcam API (#1430) @ly015
-
Fix smoother configs in video 3D demo (#1457) @ly015
Highlights
-
Support hand gesture recognition
- Try the demo for gesture recognition
- Learn more about the algorithm, dataset and experiment results
-
Major upgrade to the Webcam API
- Tutorials (EN|zh_CN)
- API Reference
- Demo
New Features
- Support gesture recognition algorithm MTUT CVPR'2019 and dataset NVGesture CVPR'2016 (#1380) @Ben-Louis
Improvements
-
Upgrade Webcam API and related documents (#1393, #1404, #1413) @ly015
-
Support exporting COCO inference result without the annotation file (#1368) @liqikai9
-
Replace markdownlint with mdformat in CI to avoid the dependence on ruby #1382 @ly015
-
Improve documentation quality (#1385, #1394, #1395, #1408) @chubei-oppen, @ly015, @liqikai9
Bug Fixes
-
Fix xywh->xyxy bbox conversion in dataset sanity check (#1367) @jin-s13
-
Fix a bug in two-stage 3D keypoint demo (#1373) @ly015
-
Fix out-dated settings in PVT configs (#1376) @ly015
-
Fix myst settings for document compiling (#1381) @ly015
-
Fix a bug in bbox transform (#1384) @ly015
-
Fix inaccurate description of
min_keypoints
in tracking apis (#1398) @pallgeuer -
Fix warning with
torch.meshgrid
(#1402) @pallgeuer -
Remove redundant transformer modules from
mmpose.datasets.backbones.utils
(#1405) @ly015
Highlights
-
Support RLE (Residual Log-likelihood Estimation), ICCV'2021 (#1259) @Indigo6, @ly015
-
Support Swin Transformer, ICCV'2021 (#1300) @yumendecc, @ly015
-
Support PVT, ICCV'2021 and PVTv2, CVMJ'2022 (#1343) @zengwang430521
-
Speed up inference and reduce CPU usage by optimizing the pre-processing pipeline (#1320) @chenxinfeng4, @liqikai9
New Features
-
Support RLE (Residual Log-likelihood Estimation), ICCV'2021 (#1259) @Indigo6, @ly015
-
Support Swin Transformer, ICCV'2021 (#1300) @yumendecc, @ly015
-
Support PVT, ICCV'2021 and PVTv2, CVMJ'2022 (#1343) @zengwang430521
Improvements
-
Speed up inference and reduce CPU usage by optimizing the pre-processing pipeline (#1320) @chenxinfeng4, @liqikai9
-
Video demo supports models that requires multi-frame inputs (#1300) @liqikai9, @jin-s13
-
Update benchmark regression list (#1328) @ly015, @liqikai9
-
Remove unnecessary warnings in
TopDownPoseTrack18VideoDataset
(#1335) @liqikai9 -
Improve documentation quality (#1313, #1305) @Ben-Louis, @ly015
-
Update deprecating settings in configs (#1317) @ly015
Bug Fixes
-
Fix a bug in human skeleton grouping that may skip the matching process unexpectedly when
ignore_to_much
is True (#1341) @daixinghome -
Fix a GPG key error that leads to CI failure (#1354) @ly015
-
Fix bugs in distributed training script (#1338, #1298) @ly015
-
Fix an upstream bug in xtoccotools that causes incorrect AP(M) results (#1308) @jin-s13, @ly015
-
Fix indentiation errors in the colab tutorial (#1298) @YuanZi1501040205
-
Fix incompatible model weight initialization with other OpenMMLab codebases (#1329) @274869388
-
Fix HRNet FP16 checkpoints download URL (#1309) @YinAoXiong
-
Fix typos in
body3d_two_stage_video_demo.py
(#1295) @mucozcan
Breaking Changes
-
Refactor bbox processing in datasets and pipelines (#1311) @ly015, @Ben-Louis
-
The bbox format conversion (xywh to center-scale) and random translation are moved from the dataset to the pipeline. The comparison between new and old version is as below:
v0.26.0v0.25.0Dataset (e.g. TopDownCOCODataset)
... # Data sample only contains bbox rec.append({ 'bbox': obj['clean_bbox][:4], ... })
... # Convert bbox from xywh to center-scale center, scale = self._xywh2cs(*obj['clean_bbox'][:4]) # Data sample contains center and scale rec.append({ 'bbox': obj['clean_bbox][:4], 'center': center, 'scale': scale, ... })
Pipeline Config(e.g. HRNet+COCO)
... train_pipeline = [ dict(type='LoadImageFromFile'), # Convert bbox from xywh to center-scale dict(type='TopDownGetBboxCenterScale', padding=1.25), # Randomly shift bbox center dict(type='TopDownRandomShiftBboxCenter', shift_factor=0.16, prob=0.3), ... ]
... train_pipeline = [ dict(type='LoadImageFromFile'), ... ]
AdvantageHighlights
-
Support Shelf and Campus datasets with pre-trained VoxelPose models, "3D Pictorial Structures for Multiple Human Pose Estimation", CVPR'2014 (#1225) @liqikai9, @wusize
-
Add
Smoother
module for temporal smoothing of the pose estimation with configurable filters (#1127) @ailingzengzzz, @ly015 -
Support SmoothNet for pose smoothing, "SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos", arXiv'2021 (#1279) @ailingzengzzz, @ly015
-
Add multiview 3D pose estimation demo (#1270) @wusize
New Features
-
Support Shelf and Campus datasets with pre-trained VoxelPose models, "3D Pictorial Structures for Multiple Human Pose Estimation", CVPR'2014 (#1225) @liqikai9, @wusize
-
Add
Smoother
module for temporal smoothing of the pose estimation with configurable filters (#1127) @ailingzengzzz, @ly015 -
Support SmoothNet for pose smoothing, "SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos", arXiv'2021 (#1279) @ailingzengzzz, @ly015
-
Add multiview 3D pose estimation demo (#1270) @wusize
-
Support multi-machine distributed training (#1248) @ly015
Improvements
-
Update HRFormer configs and checkpoints with relative position bias (#1245) @zengwang430521
-
Support using different random seed for each distributed node (#1257, #1229) @ly015
-
Improve documentation quality (#1275, #1255, #1258, #1249, #1247, #1240, #1235) @ly015, @jin-s13, @YoniChechik
Bug Fixes
-
Fix keypoint index in RHD dataset meta information (#1265) @liqikai9
-
Fix pre-commit hook unexpected behavior on Windows (#1282) @liqikai9
-
Remove python-dev installation in CI (#1276) @ly015
-
Unify hyphens in argument names in tools and demos (#1271) @ly015
-
Fix ambiguous channel size in
channel_shuffle
that may cause exporting failure (#1242) @PINTO0309 -
Fix a bug in Webcam API that causes single-class detectors fail (#1239) @674106399
-
Fix the issue that
custom_hook
can not be set in configs (#1236) @bladrome -
Fix incompatible MMCV version in DockerFile (#raykindle)
-
Skip invisible joints in visualization (#1228) @womeier
Highlights
-
Support HRFormer "HRFormer: High-Resolution Vision Transformer for Dense Predict", NeurIPS'2021 (#1203) @zengwang430521
-
Support Windows installation with pip (#1213) @jin-s13, @ly015
-
Add WebcamAPI documents (#1187) @ly015
New Features
-
Support HRFormer "HRFormer: High-Resolution Vision Transformer for Dense Predict", NeurIPS'2021 (#1203) @zengwang430521
-
Support Windows installation with pip (#1213) @jin-s13, @ly015
-
Support CPU training with mmcv < v1.4.4 (#1161) @EasonQYS, @ly015
-
Add "Valentine Magic" demo with WebcamAPI (#1189, #1191) @liqikai9
Improvements
-
Refactor multi-view 3D pose estimation framework towards better modularization and expansibility (#1196) @wusize
-
Add WebcamAPI documents and tutorials (#1187) @ly015
-
Refactor dataset evaluation interface to align with other OpenMMLab codebases (#1209) @ly015
-
Add deprecation message for deploy tools since MMDeploy has supported MMPose (#1207) @QwQ2000
-
Switch to OpenMMLab official pre-commit-hook for copyright check (#1214) @ly015
Bug Fixes
-
Fix hard-coded data collating and scattering in inference (#1175) @ly015
-
Fix model configs on JHMDB dataset (#1188) @jin-s13
-
Fix area calculation in pose tracking inference (#1197) @pallgeuer
-
Fix registry scope conflict of module wrapper (#1204) @ly015
-
Update MMCV installation in CI and documents (#1205)
-
Fix incorrect color channel order in visualization functions (#1212) @ly015
Highlights
- Add MMPose Webcam API: A simple yet powerful tools to develop interactive webcam applications with MMPose functions. (#1178, #1173, #1173, #1143, #1094, #1133, #1098, #1160) @ly015, @jin-s13, @liqikai9, @wusize, @luminxu, @zengwang430521 @mzr1996
New Features
-
Add MMPose Webcam API: A simple yet powerful tools to develop interactive webcam applications with MMPose functions. (#1178, #1173, #1173, #1143, #1094, #1133, #1098, #1160) @ly015, @jin-s13, @liqikai9, @wusize, @luminxu, @zengwang430521 @mzr1996
-
Support ConcatDataset (#1139) @Canwang-sjtu
-
Support CPU training and testing (#1157) @ly015
Improvements
-
Add multi-processing configurations to speed up distributed training and testing (#1146) @ly015
-
Add default runtime config (#1145)
-
Upgrade isort in pre-commit hook (#1179) @liqikai9
-
Update README and documents (#1171, #1167, #1153, #1149, #1148, #1147, #1140) @jin-s13, @wusize, @TommyZihao, @ly015
Bug Fixes
-
Fix undeterministic behavior in pre-commit hooks (#1136) @jin-s13
-
Deprecate the support for "python setup.py test" (#1179) @ly015
-
Fix incompatible settings with MMCV on HSigmoid default parameters (#1132) @ly015
-
Fix albumentation installation (#1184) @BIGWangYuDong
Highlights
-
Support VoxelPose "VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment", ECCV'2020 (#1050) @wusize
-
Support Soft Wing loss "Structure-Coherent Deep Feature Learning for Robust Face Alignment", TIP'2021 (#1077) @jin-s13
-
Support Adaptive Wing loss "Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression", ICCV'2019 (#1072) @jin-s13
New Features
-
Support VoxelPose "VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment", ECCV'2020 (#1050) @wusize
-
Support Soft Wing loss "Structure-Coherent Deep Feature Learning for Robust Face Alignment", TIP'2021 (#1077) @jin-s13
-
Support Adaptive Wing loss "Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression", ICCV'2019 (#1072) @jin-s13
-
Add LiteHRNet-18 Checkpoints trained on COCO. (#1120) @jin-s13
Improvements
-
Improve documentation quality (#1115, #1111, #1105, #1087, #1086, #1085, #1084, #1083, #1124, #1070, #1068) @jin-s13, @liqikai9, @ly015
-
Support CircleCI (#1074) @ly015
-
Skip unit tests in CI when only document files were changed (#1074, #1041) @QwQ2000, @ly015
-
Support file_client_args in LoadImageFromFile (#1076) @jin-s13
Bug Fixes
-
Fix a bug in Dark UDP postprocessing that causes error when the channel number is large. (#1079, #1116) @X00123, @jin-s13
-
Fix hard-coded
sigmas
in bottom-up image demo (#1107, #1101) @chenxinfeng4, @liqikai9 -
Fix unstable checks in unit tests (#1112) @ly015
-
Do not destroy NULL windows if
args.show==False
in demo scripts (#1104) @bladrome
Highlights
-
Support "Learning Temporal Pose Estimation from Sparsely-Labeled Videos", NeurIPS'2019 (#932, #1006, #1036, #1060) @liqikai9
-
Add ViPNAS-MobileNetV3 models (#1025) @luminxu, @jin-s13
-
Add inference speed benchmark (#1028, #1034, #1044) @liqikai9
New Features
-
Support "Learning Temporal Pose Estimation from Sparsely-Labeled Videos", NeurIPS'2019 (#932, #1006, #1036) @liqikai9
-
Add ViPNAS-MobileNetV3 models (#1025) @luminxu, @jin-s13
-
Add light-weight top-down models for whole-body keypoint detection (#1009, #1020, #1055) @luminxu, @ly015
-
Add HRNet checkpoints with various settings on PoseTrack18 (#1035) @liqikai9
Improvements
-
Add inference speed benchmark (#1028, #1034, #1044) @liqikai9
-
Update model metafile format (#1001) @ly015
-
Support minus output feature index in mobilenet_v3 (#1005) @luminxu
-
Improve documentation quality (#1018, #1026, #1027, #1031, #1038, #1046, #1056, #1057) @edybk, @luminxu, @ly015, @jin-s13
-
Set default random seed in training initialization (#1030) @ly015
-
Skip CI when only specific files changed (#1041, #1059) @QwQ2000, @ly015
-
Automatically cancel uncompleted action runs when new commit arrives (#1053) @ly015
Bug Fixes
-
Update pose tracking demo to be compatible with latest mmtracking (#1014) @jin-s13
-
Fix symlink creation failure when installed in Windows environments (#1039) @QwQ2000
-
Fix AP-10K dataset sigmas (#1040) @jin-s13
Highlights
-
Add AP-10K dataset for animal pose estimation (#987) @Annbless, @AlexTheBad, @jin-s13, @ly015
-
Support TorchServe (#979) @ly015
New Features
-
Add AP-10K dataset for animal pose estimation (#987) @Annbless, @AlexTheBad, @jin-s13, @ly015
-
Add HRNetv2 checkpoints on 300W and COFW datasets (#980) @jin-s13
-
Support TorchServe (#979) @ly015
Bug Fixes
-
Fix some deprecated or risky settings in configs (#963, #976, #992) @jin-s13, @wusize
-
Fix issues of default arguments of training and testing scripts (#970, #985) @liqikai9, @wusize
-
Fix heatmap and tag size mismatch in bottom-up with UDP (#994) @wusize
-
Fix python3.9 installation in CI (#983) @ly015
-
Fix model zoo document integrity issue (#990) @jin-s13
Improvements
-
Support non-square input shape for bottom-up (#991) @wusize
-
Add image and video resources for demo (#971) @liqikai9
-
Use CUDA docker images to accelerate CI (#973) @ly015
-
Add codespell hook and fix detected typos (#977) @ly015
Highlights
-
Add models for Associative Embedding with Hourglass network backbone (#906, #955) @jin-s13, @luminxu
-
Support COCO-Wholebody-Face and COCO-Wholebody-Hand datasets (#813) @jin-s13, @innerlee, @luminxu
-
Upgrade dataset interface (#901, #924) @jin-s13, @innerlee, @ly015, @liqikai9
-
New style of documentation (#945) @ly015
New Features
-
Add models for Associative Embedding with Hourglass network backbone (#906, #955) @jin-s13, @luminxu
-
Support COCO-Wholebody-Face and COCO-Wholebody-Hand datasets (#813) @jin-s13, @innerlee, @luminxu
-
Add pseudo-labeling tool to generate COCO style keypoint annotations with given bounding boxes (#928) @soltkreig
-
New style of documentation (#945) @ly015
Bug Fixes
-
Fix segmentation parsing in Macaque dataset preprocessing (#948) @jin-s13
-
Fix dependencies that may lead to CI failure in downstream projects (#936, #953) @RangiLyu, @ly015
-
Fix keypoint order in Human3.6M dataset (#940) @ttxskk
-
Fix unstable image loading for Interhand2.6M (#913) @zengwang430521
Improvements
-
Upgrade dataset interface (#901, #924) @jin-s13, @innerlee, @ly015, @liqikai9
-
Standardize model metafile format (#941) @ly015
-
Support
persistent_worker
and several other arguments in configs (#946) @jin-s13 -
Use MMCV root model registry to enable cross-project module building (#935) @RangiLyu
-
Improve the document quality (#916, #909, #942, #913, #956) @jin-s13, @ly015, @bit-scientist, @zengwang430521
Breaking Changes
- Upgrade dataset interface (#901) @jin-s13, @innerlee, @ly015
Bug Fixes
-
Fix redundant model weight loading in pytorch-to-onnx conversion (#850) @ly015
-
Fix a bug in update_model_index.py that may cause pre-commit hook failure(#866) @ly015
-
Fix a bug in interhand_3d_head (#890) @zengwang430521
-
Fix pose tracking demo failure caused by out-of-date configs (#891)
Improvements
-
Add automatic benchmark regression tools (#849, #880, #885) @liqikai9, @ly015
-
Add copyright information and checking hook (#872)
-
Add PR template (#875) @ly015
-
Add citation information (#876) @ly015
-
Improve the quality of the documents (#845, #845, #848, #867, #870, #873, #896) @jin-s13, @ly015, @zhiqwang
Highlights
-
Support "Lite-HRNet: A Lightweight High-Resolution Network" CVPR'2021 (#733,#800) @jin-s13
-
Add 3d body mesh demo (#771) @zengwang430521
-
Add Chinese documentation (#787, #798, #799, #802, #804, #805, #815, #816, #817, #819, #839) @ly015, @luminxu, @jin-s13, @liqikai9, @zengwang430521
-
Add Colab Tutorial (#834) @ly015
New Features
-
Support "Lite-HRNet: A Lightweight High-Resolution Network" CVPR'2021 (#733,#800) @jin-s13
-
Add 3d body mesh demo (#771) @zengwang430521
-
Add Chinese documentation (#787, #798, #799, #802, #804, #805, #815, #816, #817, #819, #839) @ly015, @luminxu, @jin-s13, @liqikai9, @zengwang430521
-
Add Colab Tutorial (#834) @ly015
-
Support training for InterHand v1.0 dataset (#761) @zengwang430521
Bug Fixes
-
Fix mpii [email protected] index (#773) @jin-s13
-
Fix multi-node distributed test (#818) @ly015
-
Fix docstring and init_weights error of ShuffleNetV1 (#814) @Junjun2016
-
Fix imshow_bbox error when input bboxes is empty (#796) @ly015
-
Fix model zoo doc generation (#778) @ly015
Breaking Changes
- Use MMCV EvalHook (#686) @ly015
Improvements
-
Add pytest.ini and fix docstring (#812) @jin-s13
-
Update MSELoss (#829) @Ezra-Yu
-
Move process_mmdet_results into inference.py (#831) @ly015
-
Update resource limit (#783) @jin-s13
-
Use COCO 2D pose model in 3D demo examples (#785) @ly015
-
Change model zoo titles in the doc from center-aligned to left-aligned (#792, #797) @ly015
-
Update out-of-date configs (#827) @jin-s13
-
Remove opencv-python-headless dependency by albumentations (#833) @ly015
-
Update QQ QR code in README_CN.md (#832) @ly015
Highlights
-
Support "ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search" CVPR'2021 (#742,#755).
-
Add webcam demo tool (#729)
New Features
-
Support "ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search" CVPR'2021 (#742,#755)
-
Support Webcam demo (#729)
-
Support Interhand 3d demo (#704)
-
Support 3d pose video demo (#727)
-
Add scripts to generate mim metafile (#749)
Bug Fixes
-
Change model download links from
http
tohttps
(#716)
Breaking Changes
- Switch to MMCV MODEL_REGISTRY (#669)
Improvements
-
Refactor MeshMixDataset (#752)
-
Rename 'GaussianHeatMap' to 'GaussianHeatmap' (#745)
-
Update out-of-date configs (#734)
-
Improve compatibility for breaking changes (#731)
-
Enable to control radius and thickness in visualization (#722)
-
Add regex dependency (#720)
Highlights
-
Support 3d video pose estimation (VideoPose3D).
-
Support 3d hand pose estimation (InterNet).
-
Improve presentation of modelzoo.
New Features
-
Support "InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image" (ECCV‘20) (#624)
-
Support "3D human pose estimation in video with temporal convolutions and semi-supervised training" (CVPR'19) (#602, #681)
-
Support bottom-up whole-body pose estimation (#689)
-
Support mmcli (#634)
Bug Fixes
-
Fix opencv compatibility (#635)
-
Fix demo with UDP (#637)
-
Fix bottom-up model onnx conversion (#680)
-
Fix
GPU_IDS
in distributed training (#668)
Breaking Changes
-
Reorganize configs by tasks, algorithms, datasets, and techniques (#647)
-
Rename heads and detectors (#667)
Improvements
-
Add
radius
andthickness
parameters in visualization (#638) -
Add
trans_prob
parameter inTopDownRandomTranslation
(#650) -
Switch to
MMCV MODEL_REGISTRY
(#669)
Highlights
-
Support animal pose estimation with 7 popular datasets.
-
Support "A simple yet effective baseline for 3d human pose estimation" (ICCV'17).
New Features
-
Support "A simple yet effective baseline for 3d human pose estimation" (ICCV'17) (#554,#558,#566,#570,#589)
-
Support animal pose estimation (#559,#561,#563,#571,#603,#605)
-
Support Horse-10 dataset (#561), MacaquePose dataset (#561), Vinegar Fly dataset (#561), Desert Locust dataset (#561), Grevy's Zebra dataset (#561), ATRW dataset (#571), and Animal-Pose dataset (#603)
-
Support bottom-up pose tracking demo (#574)
-
Support NMS for bottom-up (#609)
Bug Fixes
-
Fix bugs in the top-down demo, when there are no people in the images (#569).
-
Fix the links in the doc (#612)
Improvements
-
Speed up top-down inference (#560)
Highlights
-
Support Wingloss.
-
Support RHD hand dataset.
New Features
-
Support Wingloss (#482)
-
Support Human3.6m dataset for 3d keypoint detection (#518, #527)
-
Support Interhand3D model for 3d hand detection (#536)
-
Support Multi-task detector (#480)
Bug Fixes
-
Fix [email protected] calculation (#516)
-
Fix unittest (#529)
-
Fix circular importing (#542)
-
Fix bugs in bottom-up keypoint score (#548)
Improvements
Highlights
- Support DeepPose algorithm.
New Features
-
Support interhand3d dataset (#468)
-
Support Albumentation pipeline (#469)
-
Support PhotometricDistortion pipeline (#485)
-
Set seed option for training (#493)
-
Add demos for face keypoint detection (#502)
Bug Fixes
-
Change channel order according to configs (#504)
-
Fix
num_factors
in UDP encoding (#495) -
Fix configs (#456)
Breaking Changes
Improvements
-
Add README in Chinese (#462)
-
Add tutorials about configs (#465)
-
Rename
stat.py
tostats.py
(#483) -
latex to bibtex (#471)
-
Update FAQ (#466)
Highlights
-
Support fashion landmark detection.
-
Support face keypoint detection.
-
Support pose tracking with MMTracking.
New Features
-
Support fashion landmark detection (DeepFashion) (#413)
-
Support face keypoint detection (300W, AFLW, COFW, WFLW) (#367)
-
Support pose tracking demo with MMTracking (#427)
-
Support face demo (#443)
Bug Fixes
Breaking Changes
- Refactor Heads (#382)
Improvements
-
Update readme (#409, #412, #415, #416, #419, #421, #422, #424, #425, #435, #436, #437, #444, #445)
-
Add GAP (global average pooling) neck (#414)
-
Support COCO test-dev test (#433)
Highlights
-
Support more human pose estimation methods.
-
Support pose tracking.
-
Support multi-batch inference.
-
Add some useful tools, including
analyze_logs
,get_flops
,print_config
. -
Support more backbone networks.
New Features
-
Support multi-batch inference (#390)
-
Support MHP dataset (#386)
-
Support pose tracking demo (#380)
-
Support mpii-trb demo (#372)
-
Support mobilenet for hand pose estimation (#377)
-
Support ResNest backbone (#370)
-
Support VGG backbone (#370)
-
Add some useful tools, including
analyze_logs
,get_flops
,print_config
(#324)
Bug Fixes
-
Fix bugs in pck evaluation (#328)
-
Fix CrowdPose annotations and update benchmarks (#384)
-
Fix config files for aic datasets (#340)
Breaking Changes
- Rename
image_thr
todet_bbox_thr
for top-down methods.
Improvements
-
Check linting for markdown (#379)
-
Add faq.md (#350)
-
Remove PyTorch 1.4 in CI (#338)
-
Add pypi badge in readme (#329)
Highlights
-
Support more human pose estimation methods.
-
Support video pose estimation datasets.
-
Support Onnx model conversion.
New Features
-
Support MSPN (#278)
-
Support new post-processing method for MSPN & RSN (#288)
-
Support sub-JHMDB dataset (#292)
-
Support urls for pre-trained models in config files (#232)
-
Support Onnx (#305)
Bug Fixes
Breaking Changes
post_process=True|False
andunbiased_decoding=True|False
are deprecated, usepost_process=None|default|unbiased
etc. instead (#288)
Improvements
-
Set the default map_location as 'cpu' to reduce gpu memory cost (#227)
-
Support return heatmaps and backbone features for bottom-up models (#229)
-
Automatically add modelzoo statistics to readthedocs (#252)
-
Fix Pylint issues (#258, #259, #260, #262, #265, #267, #268, #270, #271, #272, #273, #275, #276, #283, #285, #293, #294, #295)
-
Support PyTorch 1.7 in CI (#274)
-
Add docs/tutorials for running demos (#263)
Highlights
-
Support more human pose estimation datasets.
-
Support more 2D hand keypoint estimation datasets.
-
Support adversarial training for 3D human shape recovery.
-
Support multi-stage losses.
-
Support mpii demo.
New Features
-
Support PoseTrack18 dataset (#220)
-
Support InterHand2.6 dataset (#202)
-
Support adversarial training for 3D human shape recovery (#192)
-
Support multi-stage losses (#204)
Bug Fixes
- Fix config files (#190)
Improvements
Highlights
-
Support HMR for 3D human shape recovery.
-
Support WholeBody human pose estimation.
-
Support more 2D hand keypoint estimation datasets.
-
Add more popular backbones & enrich the modelzoo
- ShuffleNetv2
-
Support hand demo and whole-body demo.
New Features
-
Support HMR for 3D human shape recovery (#157, #160, #161, #162)
-
Support COCO-WholeBody dataset (#133)
-
Support CMU Panoptic HandDB dataset (#144)
-
Support H36M dataset (#159)
-
Support ShuffleNetv2 (#139)
-
Support saving best models based on key indicator (#127)
Bug Fixes
Improvements
-
Add tools to transform .mat format to .json format (#126)
-
Add hand demo (#115)
-
Add whole-body demo (#163)
-
Reuse mmcv utility function and update version files (#135, #137)
-
Improve README (#176)
-
Improve version.py (#173)
Highlights
-
Add more popular backbones & enrich the modelzoo
- ResNext
- SEResNet
- ResNetV1D
- MobileNetv2
- ShuffleNetv1
- CPM (Convolutional Pose Machine)
-
Add more popular datasets:
-
Support 2d hand keypoint estimation.
-
Support bottom-up inference.
New Features
-
Support OneHand10K dataset (#52)
-
Support AIChallenger dataset (#87)
-
Support multiple backbones (#26)
-
Support CPM model (#56)
Bug Fixes
-
Fix configs for MPII & MPII-TRB datasets (#93)
-
Fix the bug of missing
test_pipeline
in configs (#14)
Improvements
-
Update benchmark (#93)
-
Add Dockerfile (#44)
-
Improve unittest coverage and minor fix (#18)
-
Support CPUs for train/val/demo (#34)
-
Support bottom-up demo (#69)
-
Add tools to publish model (#62)
Highlights
- MMPose is released.
Main Features
-
Support both top-down and bottom-up pose estimation approaches.
-
Achieve higher training efficiency and higher accuracy than other popular codebases (e.g. AlphaPose, HRNet)
-
Support various backbone models: ResNet, HRNet, SCNet, Houglass and HigherHRNet.