Draft Smart: Identifying Hidden Value in WNBA Expansion

Authors

Ella Sylvester

Isabella Ferrandiz

Jada Douthit

Published

July 25, 2025

Introduction

In the WNBA, player salaries are governed by the league’s Collective Bargaining Agreement (CBA), which imposes strict financial constraints on team rosters. Each team must remain under a designated salary cap, and salary distributions often vary depending on a player’s experience. During an expansion draft, a new franchise can select one player from each existing team to build its roster. However, each team is allowed to “protect” a limited number of players from being drafted. With the current CBA set to expire at the end of this season, a significant number of players are entering 2026 as unrestricted free agents. Placing a constraint on expansion teams, as they are only allowed to select one unrestricted free agent that is not protected. Previous expansion draft rules allowed current teams to protect up to six players. With two teams coming into the 2026 WNBA Expansion Draft, we are making the assumption that current teams are going to be allowed to only protect 5 players.

Our main research question is: How can we use player performance and salary data to optimize an expansion draft strategy in the WNBA?

The goal of this project is to predict which players will be left unprotected by their current 2025 team and from this pool, quantify salary inefficiencies by comparing the player performance to their current compensation. Ultimately, we aim to recommend players who would be strategic targets for an expansion team seeking to maximize talent. However, there are some factors that front offices take into account that may not be publicly available when evaluating available players such as risks or inside intel.

Data

For our research, we web scraped publically available data from three sites: salary data from Spotrac, stats from Her Hoop Stats, and historical data from Basketball-Reference.

We evaluated a dataset that consisted of 150–160 players for the 2023 to 2025 seasons. During the 2023 and 2024 season, there were only 12 teams to evaluate, with an additional expansion team added in the 2025 season. We looked at three main components for each player. First is the player demographics, which consist of their name, age, position, and team affiliation. Next, we looked at performance metrics, which include box score statistics (per-game stats) and advanced box score stats (such as player efficiency ratings). Lastly, we examined financial metrics, which include salary earnings, years remaining on the contract, and contract status.

Table: Star players' salary and performance metrics.
Player Salary PER Position Team True Shooting % Effective FG % Total Rebounding %
Aliyah Boston $83,371  24.7 F-C IND 61.4% 57.1% 1,510.0%
Caitlin Clark $78,066  17.2 G IND 49.1% 44.1% 960.0%
Napheesa Collier $214,284  29.9 F MIN 62.1% 55.7% 1,320.0%
A'ja Wilson $200,000  32.1 C LVA 57.6% 49.3% 1,750.0%

Exploratory Data Analysis

We were curious to explore the relationship between on-court production and salary in the WNBA. Specifically, we examined how player efficiency rating (PER) corresponds to 2025 salary across the league. Important salary benchmarks in the league include the rookie-scale, maximum, and supermax contracts; we aimed to uncover where mismatches occur, including efficient players on low-paying rookie deals or veterans earning near the salary cap despite middling production. Highlighting standout players further helped surface cases of potential undervaluation or overpayment.

This scatterplot shows player efficiency rating (PER) vs. salary for active players in the 2025 WNBA season.

Methods

To help quantify player value in the WNBA, we fit and trained an XGBoost model to predict player salaries based on performance and contract-related factors. The model used three key variables: a multi-year weighted PER (with recent seasons weighted more heavily), player age, and the player’s share of their current team’s salary cap. Our goal was to identify players who may be underpaid relative to their on-court contributions. We focused on positive residuals, cases where the model predicted a higher salary than the player currently earns, as a way to flag potential value picks for expansion.

To evaluate model performance, we used 5-fold cross-validation. The dataset was divided into five parts; each fold served once as a test set, while the remaining folds were used for training. We compared five models:

  • Linear Regression

  • Polynomial Regression, adding a squared term for age

  • LASSO Regression, using L1 regularization to penalize complexity and perform feature selection

  • Ridge Regression, using L2 regularization to shrink coefficients and reduce variance;

  • XGBoost, a gradient boosting algorithm that builds an ensemble of shallow trees trained sequentially on residuals.

All models were evaluated using Root Mean Squared Error (RMSE). RMSE was summarized across folds using means, standard deviations, standard errors, and 95% confidence intervals; the XGBoost model was selected as the best performing model. Unlike a single large decision tree that may overfit, boosting improves performance incrementally by focusing each tree on correcting the errors of the current model. A learning rate controls how much each tree contributes, helping to balance flexibility and generalization.

To predict which players each WNBA team would protect in the expansion draft, we created a heuristic player ranking formula. It assigned a performance score to each player based on: Player Efficiency Rating (PER), Win Shares (WS), years remaining on their contract, and age.

  • PER and WS were selected to represent on-court impact, individual efficiency and contribution to team success.

  • Years remaining on contract, assuming teams are likely to retain players with long-term control.

  • Age, assuming teams will prioritize younger players with growth potential.

Individually, PER and WS were aggregated from the 2023-2025 seasons using a weighted average. Weighted performance for 2023: 10%, 2024: 30% and 2025: 60%. Reasoning for the weights was the idea that teams consider multiple years of performance, but place more emphasis on recent seasons to better reflect a player’s current play and impact. Variables were standardized using scale() to account for differences in scale and magnitude.

Initial weights in the ranking formula were chosen based on domain knowledge:

  • Weighted PER and Weighted WS were used to capture on-court performance

  • Contract years to represent a team want to retain players under contract.

  • Age penalty to players over 35 to reflect decreased likelihood of protection due to aging.

The formula combined the weighted metrics to produce a performance score, and players were ranked within their team. The top five scored players were assumed to be protected. Explore our Shiny App to see all team breakdowns.

\text{PER}(0.3) + \text{WS}(0.2) + \text{Contract Years}(0.1) + \text{Age penalty}(-0.3)

Evaluation of formula

To evaluate the robustness of the heuristic formula, a series of model iterations called “runs” were implemented. Each run applied a different combination of weights to metrics used in the protection formula.

For each run, the formula recalculates a score for every player and identifies the top five players on each team as “protected.” Dozens of runs were generated using different weight combinations, simulating a wide range of decision-making possibilities. This approach allowed observations of players who were consistently protected and which were more sensitive to weight changes. By tracking these protection decisions across runs, each player’s protection rate was calculated. It assessed the stability of the protection formula and identified swing players( those whose protection status changed frequently depending on weights).

Results

To determine the final expansion draft targets, we began with the full pool of unprotected players. Players were first categorized by position: centers, forwards, and guards. Once players were grouped by position, we ranked them in descending order based on their residuals from the salary prediction model. This highlighted players whose actual salaries were lower than their predicted value, showing undervalued talent.

The selection process prioritized players with the highest positive residuals within each position group, helping us identify the most undervalued prospects. This approach ensured our recommendations balanced positional needs while maximizing potential value across all key positions.

Our final results are a list of players we believe would be great prospects in the upcoming expansion draft due to their contact status, age, experience, and ability.

Top Expansion Draft Prospects
Name Age Position Team Actual Salary Predicted Salary Residual
Sug Sutton 26 G WAS $78,831 $89,522 $10,691.03
Janelle Salaun 23 F GSV $66,079 $76,203 $10,124.32
Marquesha Davis 24 G NYL $71,753 $79,068 $7,315.49
Makayla Timpson 22 F-C IND $69,267 $73,866 $4,599.10
Nyara Sabally 25 C NYL $79,999 $82,960 $2,961.34
Maya Caldwell 26 G ATL $66,079 $67,742 $1,663.48
Olivia Nelson-Ododa 24 C CON $80,823 $81,554 $731.49
Julie Vanloo 32 G LAS $66,079 $66,723 $643.55
Li Yueru 26 C DAL $66,079 $66,497 $418.02
Naz Hillmon 25 F ATL $80,823 $76,889 -$3,934.27
Kayla McBride 33 G MIN $205,000 $199,756 -$5,243.61

Discussion

We faced several limitations in our research and data collection. Public information on player protection status from past WNBA expansion drafts is extremely limited, and frequent roster changes, due to injuries, overseas commitments, personal matters, and contract movement, made it challenging to maintain consistent and accurate data. Since the 2025 WNBA season is still ongoing, shifting lineups and salaries may further affect the reliability of our predictions.

Looking ahead, upcoming changes to the WNBA’s Collective Bargaining Agreement (CBA), expected to reset later this year, will likely significantly reshape league economics. Players are pushing for higher salaries, revenue sharing, expanded benefits, and a reduced draft age. These changes will likely influence salary distributions and roster-building strategies and evolving landscape in the WNBA presents opportunity. Future research could incorporate projected CBA changes and player availability to build more robust models, simulating how an expansion team might perform competitively and financially in seasons to come.

Acknowledgments

Special thanks to James Hyman, Quang Nguyen, Dr. Ron Yurko and the CMSACamp TA’s for their feedback and guidance during the research process.