简体中文 | English
We implemented action recgonition model and action localization model in this repo.
Field | Model | Config | Dataset | Metrics | ACC% | Download |
---|---|---|---|---|---|---|
action recognition | PP-TSM | pptsm.yaml | Kinetics-400 | Top-1 | 76.16 | PPTSM.pdparams |
action recognition | PP-TSN | pptsn.yaml | Kinetics-400 | Top-1 | 75.06 | PPTSN.pdparams |
action recognition | AGCN | agcn.yaml | FSD | Top-1 | 62.29 | AGCN.pdparams |
action recognition | ST-GCN | stgcn.yaml | FSD | Top-1 | 59.07 | STGCN.pdparams |
action recognition | TimeSformer | timesformer.yaml | Kinetics-400 | Top-1 | 77.29 | TimeSformer.pdparams |
action recognition | SlowFast | slowfast_multigrid.yaml | Kinetics-400 | Top-1 | 75.84 | SlowFast.pdparams |
action recognition | TSM | tsm.yaml | Kinetics-400 | Top-1 | 70.86 | TSM.pdparams |
action recognition | TSN | tsn.yaml | Kinetics-400 | Top-1 | 69.81 | TSN.pdparams |
action recognition | AttentionLSTM | attention_lstm.yaml | Youtube-8M | Hit@1 | 89.0 | AttentionLstm.pdparams |
action detection | BMN | bmn.yaml | ActivityNet | AUC | 67.23 | BMN.pdparams |
shot boundary detection | TransNetV2 | transnetv2.yaml | ClipShots | F1 scores | 76.1 |
- Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification, Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen
- BMN: Boundary-Matching Network for Temporal Action Proposal Generation, Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen.
- SlowFast Networks for Video Recognition, Feichtenhofer C, Fan H, Malik J, et al.
- Temporal Segment Networks: Towards Good Practices for Deep Action Recognition, Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool
- Temporal Shift Module for Efficient Video Understanding, Ji Lin, Chuang Gan, Song Han
- Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius, Heng Wang, Lorenzo Torresani
- Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition, Sijie Yan, Yuanjun Xiong, Dahua Lin
- Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition, Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
- Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks, Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
- TransNet V2: An effective deep network architecture for fast shot transition detection, Tomáš Souček, Jakub Lokoč