2026 NCAA Stats + Scouting Model

Moderators: Duke4life831, Marcus

greenOakX
Freshman
Posts: 62
And1: 56
Joined: Jan 14, 2025
 

2026 NCAA Stats + Scouting Model 

Post#1 » by greenOakX » Sun Jan 4, 2026 8:42 am

I’ve been working on a draft model designed to predict peak NBA performance using a combination of statistical production, anthropometric measurements, and scouting evaluations. I plan to continue refining the model, but I believe it’s already at a stage where it’s worth sharing. Below, I’ve presented the model’s current top 30 NCAA prospects for the 2026 draft class.

P.S. (Jan 8, 2026) - Model includes all NCAA players ranked on NBAdraft.net or ESPN top 100 big boards with the exception of Alijah Arenas (19 on NBAdraft.net, unranked on ESPN) who has not played an NCAA game.


Methodology (skip if you wish)

To measure peak performance, I use peak DPM. DPM is a hybrid box-score and regression-based metric, similar in spirit to EPM, LEBRON, xRAPM, or ESPN’s now-defunct RPM. I chose DPM for two main reasons. First, in a poll conducted a few years ago, NBA executives voted DPM the best publicly available metric, narrowly edging out EPM. Second, peak DPM data is freely accessible via nbarapm.com, whereas EPM requires a subscription. I also avoided RAPM because its estimates are highly unreliable for players with small minute samples, which is common in this dataset.

The dataset includes all drafted prospects from 2008 through 2021. I rely on the Barttorvik database for college basketball data, as it is both comprehensive and easy to work with, though it only extends back to 2008. I selected 2021 as the cutoff because more recent draftees still have significant room to develop before reaching their peak. While some 2020–21 draftees may still improve meaningfully, extending the cutoff earlier allows for a larger and more stable sample of players who have plausibly reached their peak performance.

For scouting input, I use historical Top 100 draft boards from NBADraft.net, as well as ESPN’s Top 100 boards when available. NBADraft.net provides complete boards going back to 2008, while ESPN boards were accessible for many—but not all—years. I am open to incorporating additional scouting sources, provided they cover multiple seasons within the 2008–2021 range.

Anthropometric data is sourced primarily from the NBA’s official website. When that information is unavailable, I supplement it with data from CraftedNBA or NBADraft.net to at least capture height and weight. If a player’s wingspan is not listed, I assume it to be 3.5 inches greater than height, which is approximately the NBA average. Since no official measurements exist yet for 2026 prospects, current projections rely on their listed height and weight from NBADraft.net, along with the same assumed +3.5-inch wingspan.

Model Inputs

The model currently incorporates the following inputs:

Consensus scouting rank*
Position
Draft age
BPM
Offensive rebound rate
Assist rate
Steal rate
Block rate
Position-adjusted wingspan**
Predicted three-point percentage***

* Consensus scouting rank: The average of a prospect’s ranking across multiple scouting boards (currently NBADraft.net and ESPN). Unranked prospects are assigned a default rank of 125.

** Position-adjusted wingspan: The difference between a player’s wingspan and the average wingspan for their listed position within the dataset. Positive values indicate longer-than-average wingspans; negative values indicate shorter-than-average wingspans.

*** Predicted three-point percentage: Career three-point percentage is estimated via linear regression using free-throw percentage, three-point percentage, and three-point attempt rate as explanatory variables.

Further Notes

L1 regularization was used during development to inform variable selection, but the final model applies no regularization. In testing, regularization did not meaningfully reduce out-of-sample mean squared error. The variables listed above represent the most comprehensive feature set I found that preserved predictive performance without introducing excessive overfitting.

Simpler specifications—such as models excluding anthropometric variables or ignoring block rate for non-bigs—produce similar out-of-sample error, but I prefer this model since it considers the most facets of the game.

P.S. (Jan 8) - Should've mentioned that players who do play a game in the NBA are assigned a peak DPM of -2 which appears to be close to what DPM uses as a default prior is for an undrafted player. Moreover, players whose peak DPM is less than -2 are assigned a peak DPM value of -2.


***********************************************
PROSPECTS (As of Jan 6, 2026)
***********************************************

RankProspectPositionCollegeClassConsensusEstimated Peak DPM
1Cameron BoozerPFDukeFr3.03.48
2Darryn PetersonPG/SGKansasFr1.52.60
3AJ DybantsaSFBYUFr1.52.44
4Caleb WilsonPFNorth CarolinaFr4.51.91
5Kingston FlemingsPGHoustonFr6.01.85
6Yaxel LendeborgPF/CMichiganSr17.01.52
7Hannes SteinbachPFWashingtonFr12.50.95
8Labaron PhilonPGAlabamaSo14.50.91
9Patrick NgongbaCDukeSo24.50.82
10Nate AmentSF/PFTennesseeFr7.00.78
11Jayden QuaintancePF/CKentuckySo21.50.58
12Zuby EjioforPF/CSt. John'sSr34.50.56
13Joseph TuglerSF/PFHoustonJr59.50.54
14Bennett StirtzPGIowaSr15.50.41
15Joshua JeffersonPFIowa StateSr54.50.37
16Mikel Brown Jr.PGLouisvilleFr4.50.36
17Koa PeatPFArizonaFr16.00.33
18JT ToppinPF/CTexas TechJr38.50.33
19Morez Johnson Jr.PFMichiganSo45.00.27
20Miles ByrdSGSan Diego St.Jr32.00.27
21Keaton WaglerSGIllinoisFr41.00.26
22Matt AbleSGNC StateFr26.50.17
23Tounde YessoufouSGBaylorFr22.00.16
24Thomas HaughSFFloridaJr14.00.15
25Tyler TannerPGVanderbiltSo97.00.15
26Darius Acuff Jr.PG/SGArkansasFr16.00.14
27Alex CondonPF/CFloridaJr37.50.14
28Aday MaraCMichiganJr59.00.13
29Isaiah EvansSG/SFDukeSo18.50.04
30Dailyn SwainSF/PFTexasJr60.00.03
31Meleek ThomasPG/SGArkansasFr21.00.02
32Cameron CarrSGBaylorSo15.0-0.10
33Tarris Reed Jr.CConnecticutSr67.5-0.12
34Braylon MullinsSGConnecticutFr14.0-0.18
35Johann GrunlohPF/CVirginiaFr49.0-0.19
36Acaden LewisPGVillanovaFr110.0-0.22
37Henri VeesaarPF/CNorth CarolinaJr28.5-0.26
38Brayden BurriesPG/SGArizonaFr33.5-0.28
39Eric ReibeCConnecticutFr42.0-0.28
40Motiejus KrivasCArizonaJr61.0-0.30
41KJ LewisSGGeorgetownJr72.0-0.30
42Christian AndersonPGTexas TechSo52.0-0.31
43Dame SarrSG/SFDukeFr33.5-0.31
44Nate BittleCOregonSr72.5-0.33
45David PunchPFTCUSo91.5-0.36
46Richie SaundersPG/SGBYUSr47.0-0.36
47Anthony Robinson IIPG/SGMissouriJr102.5-0.36
48Magoon GwathPF/CSan Diego St.So42.5-0.46
49David MirkovicPFIllinoisFr88.0-0.47
50Zvonimir IvisicPF/CIllinoisJr110.0-0.47
51Flory BidungaPF/CKansasSo72.5-0.48
52Alex KarabanSF/PFConnecticutSr44.0-0.55
53Elyjah FreemanSG/SFAuburnSo70.0-0.57
54Amaël L’EtangCDaytonSo69.0-0.60
55Chris Cenac JrPF/CHoustonFr10.5-0.61
56Boogie FlandPGFloridaSo57.5-0.64
57Braden SmithPGPurdueSr80.5-0.67
58Ja'Kobi GillespiePGTennesseeSr110.5-0.69
59Baba MillerCCincinnatiSr80.0-0.70
60Nolan WinterPF/CWisconsinJr81.5-0.74
61Kwame Evans Jr.PFOregonJr101.5-0.75
62Oscar CluffPF/CPurdueSr109.5-0.77
63Malik ReneauSF/PFMiami FLJr100.0-0.85
64Dillon MitchellPFSt. John'sSr94.5-0.86
65Paul McNeil, Jr.SGN.C. StateSo92.5-0.86
66Ryan ConwellSGLouisvilleSr93.5-0.88
67Neoklis AvdalasSG/SFVirginia TechFr24.0-0.91
68Collin ChandlerSGKentuckySo67.5-0.92
69Emanuel SharpPG/SGHoustonSr106.5-0.94
70PJ HaggertyPGKansas St.Jr82.5-0.96
71Jaden BradleyPGArizonaSr108.5-0.97
72Karter KnoxSFArkansasSo54.0-0.98
73Wesley Yates IIISGWashingtonSo64.5-1.00
74Ian JacksonSGSt. John'sSo60.0-1.01
75Kylan BoswellPG/SGIllinoisSr94.0-1.03
76Tomislav IvisicCIllinoisJr47.0-1.08
77Juke HarrisSFWake ForestSo84.0-1.12
78Darrion WilliamsSFN.C. StateSr83.5-1.12
79Robert WrightPG/SGBYUSo108.5-1.12
80Moustapha ThiamCCincinnatiSo78.0-1.14
81Tahaad PettifordPGAuburnSo66.0-1.15
82Keyshawn HallSG/SFAuburnSr77.5-1.16
83Baye NdongoPF/CGeorgia TechJr105.0-1.16
84Jalen HaralsonSG/SFNotre DameFr97.0-1.18
85Solo BallSGConnecticutJr57.5-1.22
86Taylor Bol BowenSFAlabamaJr110.5-1.25
87Bruce ThorntonPG/SGOhio St.Sr102.5-1.26
88Tyrone Riley IVSFSan FranciscoSo101.5-1.29
89Otega OwehSGKentuckySr107.0-1.31
90Malique EwinPF/CArkansasSr104.0-1.32
91Pryce SandfortSG/SFNebraskaSo106.0-1.38
92Mouhamed SyllaCGeorgia TechFr95.0-1.39
93John BlackwellSGWisconsinJr101.0-1.40
94Andrej StojakovicSG/SFIllinoisJr60.0-1.41
95Jaland LowePGKentuckyJr112.5-1.42
96Milan MomcilovicPFIowa St.Jr95.5-1.44
97Trevon BrazilePFArkansasSr103.5-1.44
98Trey Kaufman-RennSF/PFPurdueSr105.0-1.48
99Coen CarrSFMichigan St.Jr80.5-1.48
100Tucker DeVriesSG/SFIndianaSr95.0-1.58
101Tre WhiteSGKansasSr104.5-1.58
102D.J. WagnerPG/SGArkansasJr107.5-1.61
103Mackenzie MgbakoSF/PFTexas A&MJr74.0-1.64
104Milos UzanPG/SGHoustonSr57.0-1.67
105Naithan GeorgePG/SGSyracuseJr105.5-1.67
106Nick MartinelliSFNorthwesternSr108.0-1.69
107Josh DixSGCreightonSr103.0-1.74
108Tyler HarrisSG/SFVanderbiltJr87.5-1.79
109Jasper JohnsonPG/SGKentuckyFr99.5-1.79
110Felix OkparaCTennesseeSr111.5-1.80
111Tobi LawalPFVirginia TechSr77.0-1.89
112Joson SanonSGSt. John'sSo106.0-1.96
113Jaron Pierre Jr.PG/SGSMUSr70.5-2.14
114Donald HandSGBoston CollegeJr111.0-2.35
greenOakX
Freshman
Posts: 62
And1: 56
Joined: Jan 14, 2025
 

Re: 2026 NCAA Stats + Scouting Model 

Post#2 » by greenOakX » Sun Jan 4, 2026 9:03 am

IMPROVEMENTS

1. Use past seasons (not just the current season) in projections. This should help stabilize the projections for non-freshmen who have outlier (good or bad) final seasons. High school / AAU stats could be used for freshmen but I don't have any high-school data outside of recruit rank. (DONE)

2. Position-specific projections. I tried creating position-specific projections, and while I got significantly better predictions for bigs, and decent predictions for guards, the predictions for wings were a disaster.

3. Predict peak DPM for current players using NBA performance, and use that prediction in place of peak DPM. This would account for some of the younger players in the dataset not being at their true peak. Furthermore, it may allow me to use prospects from the most recent drafts in the training set.


I'll expand the prospect list outside the top 30 once I get the data for all the relevant prospects.
greenOakX
Freshman
Posts: 62
And1: 56
Joined: Jan 14, 2025
 

Re: 2026 NCAA Stats + Scouting Model 

Post#3 » by greenOakX » Thu Jan 8, 2026 6:45 am

The OP has been updated with a complete list of prospects. Any NCAA prospect appearing on either NBADraft.net or ESPN’s Top 100 big boards is included. For players with multiple NCAA seasons, the model uses multi-year data. All statistics reflect games played through January 6, 2026.

MORE TECHNICAL DETAILS

I’m largely satisfied with the current version of the model and do not expect to make further changes for the remainder of the season. After additional tuning to reduce out-of-sample prediction error, I made a few adjustments:

Removed variables (inclusion of these variables consistently made out-of-sample prediction error worse):

Wingspan
Predicted 3-point percentage

Added variables:

Rim frequency – percentage of a player’s 2-point shots taken at the rim
Dunk rate – made dunks per minute played

MODEL PERFORMANCE

To evaluate performance, I randomly subsampled 20% of the data as test data (with the remaining 80% used for training) and repeated this process 10,000 times. Below is a summary of the results (MSE = mean squared error):

Best Linear Model – the current model
NBA Draft Order – vanilla linear regression using only draft position to predict peak DPM
Scouting Only – vanilla linear regression using only consensus scouting rank to predict peak DPM

ModelMSER^2
Best Linear Model2.03830.2647
NBA Draft Order2.08870.2451
Scouting Only2.2420.1944


The results show that actual NBA draft position is a significantly better predictor of success than pre-draft big boards. It’s unclear how much of this advantage comes from NBA teams being better evaluators versus teams investing more heavily in the development of highly drafted players. I am pleased that the current model meaningfully outperforms NBA draft position.
User avatar
babyjax13
RealGM
Posts: 36,321
And1: 18,760
Joined: Jul 02, 2006
Location: Fresno, eating Birria
     

Re: 2026 NCAA Stats + Scouting Model 

Post#4 » by babyjax13 » Mon Jan 26, 2026 3:09 am

Is there collinearity with the block/assist/steal/rebounding rates and BPM? Since BPM includes box-score stats to calculate I would imagine there is (I am not sure if it matters much or the solution here - perhaps it is to have a draft age-BPM ranking, and a draft age-box score metric ranking).

FWIW I really like this : )
Image

JazzMatt13 wrote:just because I think aliens probably have to do with JFK, doesn't mean my theory that Jazz will never get Wiggins, isn't true.

JColl
greenOakX
Freshman
Posts: 62
And1: 56
Joined: Jan 14, 2025
 

Re: 2026 NCAA Stats + Scouting Model 

Post#5 » by greenOakX » Mon Jan 26, 2026 4:44 am

babyjax13 wrote:Is there collinearity with the block/assist/steal/rebounding rates and BPM? Since BPM includes box-score stats to calculate I would imagine there is (I am not sure if it matters much or the solution here - perhaps it is to have a draft age-BPM ranking, and a draft age-box score metric ranking).

FWIW I really like this : )


Happy your interested!

There are collinearity issues with BPM. In my datababse, BPM is highly correlated with TS (r=0.617), EFG (r=0.545), and to a lesser extent TOV (r=-0.402), but I don't use any of those stats in the model. It's worth noting that BPM is more strongly correlated to scouting rank than any statistic I use in the model - despite some of those stats being directly used in BPM. The correlation between scouting rank and BPM is -0.277, while STL% has the next strongest correlation with BPM at 0.233.

That being said, if, for example, you include both BPM and AST% in your model, in a way, you are including AST% twice. It's why some variables (3PT%, from my testing) can paradoxically end up with negative coefficients. It's not that having a higher 3PT% is bad - it's that BPM is already capturing 3PT%, and that the negative coefficient spit out by the model is the model's way of saying that BPM values 3PT% too high. As a tangent, the only coefficient in my final model whose sign is opposite what you'd expect is dunk rate (negative coefficient). Still, high dunk rate tend to have better model predictions because I also include frequency of shots at the rim as a variable and high dunk guys tend to be high rim rate guys. I suspect the model is trying to fade the garbage men with hyper-efficient shooting percentages who get a large percentage of their offense through assisted dunks. As a tangent, I suspect similar reasons as to why anthropometric data did not improve performance - scouting rank already takes into account that information. This is supported by the fact that anthropometric data significantly improves the performance of a stats-only model.

Ultimately, I'm not sure how big a problem collinearity actually is. Collinearity increases the uncertainty of the estimated coefficients - it does not inherently bias the results one way or another. The estimated coefficients for BPM (and the other variables) appear to be somewhat stable across different training sets, so I don't think collinearity is wreaking havoc on the model. At the end of the day, short of doing something like PCA, there's going to be some collinearity between the inputs. I tested models that did not use BPM, but they had significantly worse out-of-sample prediction error, which is why BPM ended up making it into the model.
User avatar
babyjax13
RealGM
Posts: 36,321
And1: 18,760
Joined: Jul 02, 2006
Location: Fresno, eating Birria
     

Re: 2026 NCAA Stats + Scouting Model 

Post#6 » by babyjax13 » Mon Jan 26, 2026 4:50 am

greenOakX wrote:
babyjax13 wrote:Is there collinearity with the block/assist/steal/rebounding rates and BPM? Since BPM includes box-score stats to calculate I would imagine there is (I am not sure if it matters much or the solution here - perhaps it is to have a draft age-BPM ranking, and a draft age-box score metric ranking).

FWIW I really like this : )


Happy your interested!

There are collinearity issues with BPM. In my datababse, BPM is highly correlated with TS (r=0.617), EFG (r=0.545), and to a lesser extent TOV (r=-0.402), but I don't use any of those stats in the model. It's worth noting that BPM is more strongly correlated to scouting rank than any statistic I use in the model - despite some of those stats being directly used in BPM. The correlation between scouting rank and BPM is -0.277, while STL% has the next strongest correlation with BPM at 0.233.

That being said, if, for example, you include both BPM and AST% in your model, in a way, you are including AST% twice. It's why some variables (3PT%, especially from what I found) paradoxically end up with negative coefficients. It's not that having a higher 3PT% is bad - it's that BPM is already capturing 3PT%, and that the negative coefficient spit out by the model is the model's way of saying that BPM values 3PT% too high. As a tangent, the only coefficient in my final model whose sign is opposite what you'd expect is dunk rate (negative coefficient). Still, high dunk rate tend to have better model predictions because I also include frequency of shots at the rim as a variable and high dunk guys tend to be high rim rate guys. I suspect the model is trying to fade the garbage men with hyper-efficient shooting percentages who get a large percentage of their offense through assisted dunks.

I'm not sure how big a problem collinearity actually is. Collinearity increases the uncertainty of the estimated coefficients - it does not inherently bias the results one way or another. The estimated coefficients for BPM (and the other variables) appear to be somewhat stable across different training sets, so I don't think collinearity is wreaking havoc on the model. At the end of the day, short of doing something like PCA, there's going to be some collinearity between the inputs. I did test models that did not use BPM, but they had significantly worse out-of-sample prediction error which is why I ended up including it in the model.

Interesting to read! FWIW I did a first watch on Keaton Wagler and a rewatch of Ejiofor because of this!
Image

JazzMatt13 wrote:just because I think aliens probably have to do with JFK, doesn't mean my theory that Jazz will never get Wiggins, isn't true.

JColl
greenOakX
Freshman
Posts: 62
And1: 56
Joined: Jan 14, 2025
 

Re: 2026 NCAA Stats + Scouting Model 

Post#7 » by greenOakX » Mon Jan 26, 2026 4:54 am

Yeah, the model really loves Zuby Ejiofor. As for Keaton Wagler, you can expect him to be a lot higher when I update this model. My guess is he'll be top 10.

Return to Mock Drafts