Izvorni znanstveni članak
https://doi.org/10.7906/indecs.24.4.5
Data-Driven Tactical Archetypes in European Professional Football: A K-Means Clustering Approach
Tomislav Medić
orcid.org/0009-0000-3573-5603
; University of Zagreb, Faculty of Economics & Business, Zagreb, Croatia
*
Antonio Pavlečić
; University of Zagreb, Faculty of Economics & Business, Zagreb, Croatia
Mirjana Pejić Bach
; University of Zagreb, Faculty of Economics & Business, Zagreb, Croatia
* Dopisni autor.
Sažetak
Positional labels in football no longer adequately reflect the fluid and role-based nature of modern play. This study examines whether player roles can be identified directly from match performance data, without relying on predefined positional categories. Using a dataset of 1988 outfield players from the top five European leagues (2024/2025 season), K-means clustering was applied to eight per-appearance performance metrics. The optimal number of clusters was determined using the Elbow method and Silhouette scores. The analysis produced five statistically distinct player archetypes: Conservative/Disciplined players, Defensive Enforcers, Primary Attackers/Finishers, Active Wingers, and Creative Playmakers. The results demonstrate that data-driven clustering captures functional differences in player behaviour that are not reflected in traditional positional classifications. In particular, the model distinguishes clearly between defensive disruption roles, high-output attacking profiles, and intermediary creative functions. These findings support a shift from position-based to behaviour-based player evaluation. The study contributes to football analytics by providing an interpretable framework for tactical archetyping based on unsupervised learning. Practical implications relate to scouting and recruitment, where player identification can be aligned with functional performance profiles. Limitations arise from the absence of defensive tracking data, and future research should incorporate richer datasets and alternative clustering approaches.
Ključne riječi
Football; K-means clustering; tactical archetypes; performance metrics; unsupervised machine learning
Hrčak ID:
347876
URI
Datum izdavanja:
31.8.2026.
Posjeta: 0 *