Application of Hierarchical Clustering to NBA Player Statistics (2024–2025 Season)

Authors

  • Fabricio-Javier Rivadeneira-Zambrano Universidad Laica Eloy Alfaro de Manabí, ULEAM
  • Silvia-Mercedes Carvajal-Rivadeneira Unidad Educativa Julio Pierregrosse, UEJP
  • Rodolfo-Andrés Rivadeneira-Zambrano Universidad Técnica de Manabí, UTM
  • Isaac-Fabricio Rivadeneira-Carvajal Universidad Laica Eloy Alfaro de Manabí, ULEAM
  • Joshua-Javier Rivadeneira-Carvajal Universidad Laica Eloy Alfaro de Manabí, ULEAM

DOI:

https://doi.org/10.56124/encriptar.v9i17.010

Keywords:

clustering, NBA, dissimilarity measures, segmentation, unsupervised learning

Abstract

Hierarchical clustering was applied to a dataset containing aggregated statistics for 731 NBA players from the 2024–2025 season. The goal was to uncover performance profiles without predefined labels, using an unsupervised, distance-based approach. We considered quantitative variables capturing offensive output (points, rebounds, assists), usage/volume (minutes played, field-goal and free-throw attempts), and defensive contributions (steals, blocks), together with indicators such as turnovers and personal fouls. After preprocessing to make variables comparable across scales (normalization), a dissimilarity matrix was computed and a dendrogram was constructed; the final partition was selected by cutting the hierarchy using separation and internal consistency criteria. The analysis yielded three clearly differentiated groups: a high-usage, high-production cluster; a role/support cluster with lower usage and more balanced contributions; and an intermediate cluster characterized by moderate involvement paired with relatively efficient production. Overall, the resulting segmentation supports player comparison, summarizes dominant season patterns, and provides a practical baseline for downstream tasks such as roster construction, and decision support.

Downloads

Download data is not yet available.

References

Basketball-Reference. (2025). NBA player totals — 2024–25 season. Recuperado el 3 de febrero de 2026, de https://www.basketball-reference.com/leagues/NBA_2025_totals.html

Dhulipala, L., Eisenstat, D., Lącki, J., Mirrokni, V., & Shi, J. (2021). Hierarchical agglomerative graph clustering in nearly-linear time (arXiv:2106.05610).

IBM Corp. (2021). IBM SPSS Statistics Algorithms (Version 27). IBM. https://public.dhe.ibm.

com/software/analytics/spss/documentation/statistics/27.0/en/client/Manuals/IBM_

SPSS_Statistics_Algorithms.pdf

IBM Corp. (s. f.). Hierarchical cluster analysis method. IBM Documentation. Recuperado el 3 de febrero de 2026, de https://www.ibm.com/docs/en/spss-statistics/cd?topic=analysis-hierarchical

Miyamoto, S. (2022). Theory of agglomerative hierarchical clustering. Springer.

Published

2026-02-26

How to Cite

Rivadeneira-Zambrano, F.-J. ., Carvajal-Rivadeneira, S.-M. ., Rivadeneira-Zambrano, R.-A. ., Rivadeneira-Carvajal, I.-F. ., & Rivadeneira-Carvajal, J.-J. . (2026). Application of Hierarchical Clustering to NBA Player Statistics (2024–2025 Season). Scientific Journal of Informatics ENCRYPT - ISSN: 2737-6389., 9(17), 193–208. https://doi.org/10.56124/encriptar.v9i17.010