Published April 28, 2025 | Version v3
Dataset Open

NBA Player Dataset & Prediction Model Artifacts

  • 1. ROR icon TU Wien

Description

Description

 

This dataset contains end-of-season box-score aggregates for NBA players over the 2012–13 through 2023–24 seasons, split into training and test sets for both regular season and playoffs. Each CSV has one row per player per season with columns for points, rebounds, steals, turnovers, 3-pt attempts, FG attempts, plus identifiers.

 

Brief overview of Files

  1. end-of-season box-score aggregates (2012–13 – 2023–24) split into train/test;

  2. the Jupyter notebook (Analysis.ipynb); All the code can be executed in there

  3. the trained model binary (nba_model.pkl); Serialized Random Forest model artifact

  4. Evaluation plots (LAL vs. whole‐league) for regular & playoff predictions are given as png outputs and uploaded in here

  5. FAIR4ML metadata (fair4ml_metadata.jsonld);
    see README.md and abbreviations.txt for file details.”

  6. For further information you can go to the github site (Link below)

File Details

Notebook

Analysis.ipynb: Involves the graphica output of the trained  and tested data.

 

Trained/ Test csv Data

Name Description PID
regular_train.csv For training purposes, the seasons 2012-2013 through 2021-2022 were selected as training purpose 4421e56c-4cd3-4ec1-a566-a89d7ec0bced
regular_test.csv: For testing purpose of the regular season, the 2022-2023 season was selected f9d84d5e-db01-4475-b7d1-80cfe9fe0e61
playoff_train.csv For training purposes of the playoff season, the seasons 2012-2013 through 2022-2023 were selected  bcb3cf2b-27df-48cc-8b76-9e49254783d0
playoff_test.csv For testing purpose of the playoff season, 2023-2024  season was selected de37d568-e97f-4cb9-bc05-2e600cc97102

Others

abbrevations.txt: Involves the fundemental abbrevations of the columns in csv data

 

Additional Notes 

Raw csv files are taken from Kaggle (Source: https://www.kaggle.com/datasets/shivamkumar121215/nba-stats-dataset-for-last-10-years/data)

Some preprocessing has to be done before uploading into dbrepo 

Plots have also been uploaded as an output for visual purposes.

A more detailed version can be found on github (Link: https://github.com/bubaltali/nba-prediction-analysis/)

Files

abbrevations.txt

Files (4.3 MiB)

Name Size
md5:3c0d2d594b575f86303e9d1e7b4d84cb
1.5 KiB Preview Download
md5:4cbf38044d79807ea36951fac80c43a8
548.0 KiB Preview Download
md5:b527fe022fb6a8c1443f7df7041d0059
903 Bytes Preview Download
md5:f9cc57e240a734b33f7b77d2aad5ac89
7.0 KiB Download
md5:d5b056c52d9fb3b6299ac5d21d40a1b0
1.7 MiB Download
md5:b4ca1b3e6c943562fe94989dc36b23a5
41.2 KiB Preview Download
md5:2524e4d5783d9385cec39cc2f19a2f2f
120.4 KiB Preview Download
md5:1b63f2271aae326bf98ed81cf2af7b6c
98.5 KiB Preview Download
md5:5f9a58274ffe4010f074fca017597012
115.9 KiB Preview Download
md5:5a2fdbcda527e1b046133dc6e7f09e5e
4.1 KiB Preview Download
md5:e6e56f93704aee7f83cdffa5d7e9ffce
103.6 KiB Preview Download
md5:7fd008027a2810234555ac423dda0d4b
144.0 KiB Preview Download
md5:2a96b6fe280ae7ed63d310dd7c2c1abd
411.7 KiB Preview Download
md5:fd6901eedc5a5f03699f5631dbf46dc9
1.1 MiB Preview Download

Additional details