This folder contains the different baselines for the action spotting task on SoccerNet-V2.
This table summarizes the performances of the published papers on SoccerNetv2 until August 2021. For a more updated leaderboard, please visit EvalAI test and challenge leaderboards.
This is the code for the paper Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting (CVSports2021), that leverages field and players localization for action spotting.
Temporally Aware Pooling: NetVLAD++
This is the code for the paper Temporally-Aware Feature Pooling for Action Spotting in Video Broadcasts (CVSports2021), that introduces the baseline NetVLAD++, among other temporally-aware feature pooling modules.
CALF: A Context-Aware Loss Function for Action Spotting in Soccer Videos
A custom loss function is used to explicitly model the temporal context around action spots. The main idea behind this loss is to penalize the frames far-distant from the action and steadily decrease the penalty for the frames gradually closer to the action. The frames just before the action are not penalized to avoid providing misleading information as its occurrence is uncertain. However, those just after the action are heavily penalized as we know for sure that the action has occurred.
Pool: NetVLAD and MaxPool
Those baseline are based on the pooling methods for action spotting introduced in the original SoccerNet dataset.