-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wyscout v3 events sometimes have 10 player formations for (opponent) team #315
Comments
In my personal opinion, it is always better for the data to reflect reality as accurately as possible. In our case, it is important to know our own and our opponent's formation, as we analyze "behavior" with different schemes and clearly when there is one or more players less on the field, that changes. Likewise, it would have to be seen what most users use Kloppy for, since these "own" issues can be solved, as until now, by performing our own processing on eventing data, in this case. |
@koenvo @JanVanHaaren @probberechts thoughts? I'd like to start implementing this |
@dvilches thanks for sharing your take on option A vs B. I understand your need of having an accurate description of a team's behaviour to perform qualitative performance analysis. Since I'm using kloppy for reading in data from different providers, the aspect that we have a standardized output for different input vendors is more importantly for my use case than the level of detail that we get extra. Therefore, my preference for option B. In the future, however, I think we should elaborate the possible Enum values of For Wyscout, we can get the X player formation directly from the |
Hi @DriesDeprest, I agree with your perspective. That's why we're clarifying that we can resolve this issue "outside" of Kloppy, and that a quick solution for most users is more important than the "best solution" for us. |
I don't have a strong opinion but I'm leaning towards option B. In an ideal world, kloppy would be able to represent the actual formations for both teams at each point in a match, but the information that the data providers are offering might be too limited in some cases. |
I would probably do these formations differently. Either you standardize them and you create a very limited list of standard formations (including 4-4-2, 4-3-3, etc.) and anything that does not fit into it is a This needs a lot of refinement, but I would do something like this. class TeamFormation:
def __init__(self, formation):
self.formation = formation
self.lines, self.formation_type = self._parse_formation(formation)
def _parse_formation(self, formation):
try:
lines = [int(number) for number in formation.split("-")]
formation_type = FormationType.NON_STANDARD # if you want an attempt to standardize it can still be made here
return lines, formation_type
except ValueError:
raise ValueError("Invalid formation string. Use a format like '4-4-2'.")
# Create a 4-4-2 formation
formation = TeamFormation("4-4-2") |
@probberechts I like your approach and agree that this would be a better way to handle formations in general. Do you think it would make sense to first finish merge the PR which uses option B to handle <11 players formations in Wyscout V3, so that at least the code does not break? And that after that we create a separate issue for introducing a |
For a quick fix, I strongly prefer Option C. Option C:
|
Okay, I've implemented option C in #330. Let me know what you think. |
I noticed that the team / opponent team object in the events of Wyscout v3 event data, doesn't always have a formation where the total amount of players summed up equals to 11. For example, I've seen occurrences of events with a "4-4-1" (opponent) team formation. Currently, our serializer crashes when this is the case, as it does not recognize this formation.
When analysing the event data of a match where we have the 'troublesome' formations, I saw that this was the result of a team getting a red card booking and in the events that followed the formation of that team was described by a 10 player formation.
The team originally had a "4-4-1-1" formation, but after the red card this shifted to "4-4-1".
How do we want to handle this?
Option A:
Option B:
I think my preference would go to option B, to have standard behaviour across different providers.
The text was updated successfully, but these errors were encountered: