25-Year RAPM (RS and Playoffs)

eminence · Post #21 » by **eminence** » Fri Jun 23, 2023 1:43 pm

OhayoKD wrote:-> For RAPM, A penalty is applied to "outliers" so that they "converge towards zero". In other words, the gaps between players here are suppressed, and not actually indicative of what they would be in the real-world. "Closeness" can only really be gauged here in a relative sense(ex: gap between #1 and #2 vs #2 and #3), an extrapolation like "player a is worth 15% more than player b" doesn't really work.

An adjacent note on this - the 'penalty' is tied to sample size, so if level of play is maintained (not easy) it's 'easier' for a large sample player to pull their result higher.

Eg - If Player A is truly +6 for 200k possessions, they may get a +5 grade, while if Player B is truly +6 for only 100k possessions they may get a +4 grade.

*numbers above made up to show directionality, not magnitude of effect

AEnigma · Post #22 » by **AEnigma** » Fri Jun 23, 2023 1:49 pm

70sFan wrote:
OhayoKD wrote:
70sFan wrote:
Playoffs matter.

Volume also matters but it may be unfair of me to say "clear cut". KG's prime-years have small sample size and when we had an extended run i dont know 2004 was clearly worse than 2003

Do you think there is a way to estimate Shaq's total value with the available data? Would he be close to the top?

Not RAPM, but I am confident saying Shaq would be third in career accumulated plus/minus going back thirty years. And while it would be too much work for me to be interested in confirming, I suspect he would be no worse than… eighth… in accumulated point differential (basically a form of accumulated on/off), and the seven who could be realistically ahead of him on that measure happen to be the top seven listed for “overall” weighted RAPM (I doubt Manu and Draymond have the requisite minutes to keep pace, but without doing the work I will leave the possibility open).

I also feel reasonably confident that a playoff sample including 1994-96 would at least move him above Harden on that list.

DatAsh · Post #23 » by **DatAsh** » Fri Jun 23, 2023 2:22 pm

So looking at this data,

In the regular season, it's:

1. Lebron
2. Garnett
... gap
(everyone else)

In the post season, it's:

1. Lebron
2. Draymond
3. Manu
4. Garnett
...

I think it's important to realize that the postseason data is less than useless. There simply aren't enough available possessions to form a good conclusion. I say it's "less than useless" because people are inevitably going to try to ascribe "some" value to it, when really you'll get a more accurate picture of actual player goodness by ascribing 0 value to it. Really, it would be best to just lump those possessions into the regular season possessions.

Interesting data nonetheless. This is definitely more data pointing to the fact that Garnett was secretly better than Duncan, especially when you look at how good the Spurs were with Duncan off but Manu still on(which is why it's rating Manu so highly). That said, I still have Duncan ahead of KG, (#4 and #6 or #7), respectively. It's just so hard for me to not highly value those 5 championships, even if it's somewhat illogical in a team sport. I have to ascribe some value to it.

No surprise Lebron is at the top, though.

WestGOAT · Post #24 » by **WestGOAT** » Fri Jun 23, 2023 2:50 pm

eminence wrote:
WestGOAT wrote:Thanks for sharing, it's just a pity that most of the time there are no confidence intervals available for these numbers. I know they span an entire career, but still would be valuable to know how precise these numbers are. If I had the time for this, I'd love to give it a go myself.

Depending on career length I'd expect somewhere from +/-1 to +/-2 for the majority of guys. JE's 26 year RAPM as a comparison point: https://docs.google.com/spreadsheets/d/1OzfLtHanVmSCPy8Y3cvCj5uFG9k7cPbDO9sQq9JgbuU/edit#gid=0

Very nice! Where'd you get this from, where was this published originally?

eminence · Post #25 » by **eminence** » Fri Jun 23, 2023 2:59 pm

70sFan wrote:
OhayoKD wrote:
70sFan wrote:
Playoffs matter.

Volume also matters but it may be unfair of me to say "clear cut". KG's prime-years have small sample size and when we had an extended run i dont know 2004 was clearly worse than 2003

Do you think there is a way to estimate Shaq's total value with the available data? Would he be close to the top?

In terms of total value, we're missing about 1/4 of Shaqs career minutes from this dataset. I think one could reasonably estimate that as 1/4 of his career possessions and then estimate him around 210k career possessions (same range as KG). What it would do to his impact estimate - a lot more nebulous, though my personal guess would be that it'd stay pretty similar.

Assuming ~210k possessions and similar career impact estimate, he'd certaintly trail KG/Duncan/LeBron in an estimated career impact metric and likely Dirk/CP3 (so 6th) - based on this RS+PO sample (I assume they're weighted equally, but am not certain on that). Curry may also be in that range with the two added seasons since then.

eminence · Post #26 » by **eminence** » Fri Jun 23, 2023 2:59 pm

WestGOAT wrote:
eminence wrote:
WestGOAT wrote:Thanks for sharing, it's just a pity that most of the time there are no confidence intervals available for these numbers. I know they span an entire career, but still would be valuable to know how precise these numbers are. If I had the time for this, I'd love to give it a go myself.

Depending on career length I'd expect somewhere from +/-1 to +/-2 for the majority of guys. JE's 26 year RAPM as a comparison point: https://docs.google.com/spreadsheets/d/1OzfLtHanVmSCPy8Y3cvCj5uFG9k7cPbDO9sQq9JgbuU/edit#gid=0

Very nice! Where'd you get this from, where was this published originally?

JE was doing some aging curve work it seems.

https://www.apbr.org/metrics/viewtopic.php?p=38805

AdagioPace · Post #27 » by **AdagioPace** » Fri Jun 23, 2023 3:00 pm

DatAsh wrote:So looking at this data,

In the regular season, it's:

1. Lebron
2. Garnett
... gap
(everyone else)

In the post season, it's:

1. Lebron
2. Draymond
3. Manu
4. Garnett
...

I think it's important to realize that the postseason data is less than useless. There simply aren't enough available possessions to form a good conclusion. I say it's "less than useless" because people are inevitably going to try to ascribe "some" value to it, when really you'll get a more accurate picture of actual player goodness by ascribing 0 value to it. Really, it would be best to just lump those possessions into the regular season possessions.

Interesting data nonetheless. This is definitely more data pointing to the fact that Garnett was secretly better than Duncan, especially when you look at how good the Spurs were with Duncan off but Manu still on(which is why it's rating Manu so highly). That said, I still have Duncan ahead of KG, (#4 and #6 or #7), respectively. It's just so hard for me to not highly value those 5 championships, even if it's somewhat illogical in a team sport. I have to ascribe some value to it.

No surprise Lebron is at the top, though.

it's not only about championships/trophies here though (a team achievement). Timmeh simply has a different level of PS presence, production, possessions,....whatever you want to call them. RS-wise even D-Rob and Cp3 are not less valuable than TD.
The point is: you can't compare something massive to something not big enough (considering a minor x poss. difference) . yeah maybe KG was a bit unlucky but that's life.

Shaq without his PS would be a totally different discussion as evidenced from this thread. PS keeps him afloat among the greats.

Bad Gatorade · Post #28 » by **Bad Gatorade** » Fri Jun 23, 2023 3:10 pm

A few thoughts -

Has the data been adjusted for age at all? I know that Engelmann does so, and some of the "longer" term samples aren't as heavily impacted as a result (in fact, some players such as Stockton look amazing in his data sets because they follow a different ageing curve to the typical player).

I think that 5 year blocks are much better. Large samples can capture "too much" information - e.g. even in older samples that captured, say, 8 years of Durant's career, the vast improvement Durant made in his 3rd year in the league was captured by other players, and so he appeared highly underwhelming in RAPM for quite a while.

Even 5 year samples are imperfect - IIRC, Kyle Korver's 2012-2016 sample looked awesome, but it also looked very clearly better than 2011-2015 and 2013-2017 - even one year can influence things a lot. Honestly, we should be looking at as many non-baffling samples as possible.

I'm sceptical of postseason data in general being vastly different from regular season data, because we've already seen that single year splits (82 games) can yield many, many weird results... and 82 games is often over half of a playoff career. That's a huge chunk, not to mention limited lineup data on account of fewer total equations to work off (fewer players) plus shorter "off" stints and I think that there's a limit to how much we can ascertain without a prior. I think we can see this in the comparison between regular season vs playoffs vs combined data - some players (e.g. Harden) are closer to their playoff data in their "combined" splits, whereas players such as CP3 (with huge negative playoff splits relative to the regular season) don't seem to be heavily impacted when you combine the data.

The lack of a "complete" dataset also doesn't really allow us to troubleshoot results that make no sense - in a larger RAPM sample such as 1998-2019 playoff RAPM, to use CP3 as an example again, some of the players that have played alongside him (Bledsoe, Tucker, Jared Dudley) rank very highly even though they've got tiny samples. A holi stic sample, even one encompassing a large range, is still going to yield "suspicious" results. By the same token, one must consider when players in the playoffs played together in particular. Just... too many variables for me to take too seriously, IMO.

I'd probably be a tad hesitant to reward possessions in the playoffs on the virtue of circumstance - e.g. KG not having as many postseason possessions in his prime Minnesota years shouldn't go against him.

This isn't to say that postseason analysis should play zero part at all, but rather, I'd definitely hedge against a playoff only sample. I'm more than fine with, say, a combined sample that includes increased postseason weighting, or a postseason sample operating off a regular season prior. I've seen the latter on APBRmetrics and I agree quite strongly with the conclusions - you will get players like post-2013 LeBron, Draymond etc that lift their game in the playoffs, and this order of magnitude is larger than the decreases (which are occasionally notable, but generally nothing too crazy at all). I'm more inclined to believe that, as there are players not willing to show all of their tricks, increased effort etc that takes place in the playoffs.

Now, since Kobe has been a huge discussion point, I do think there's potential for his larger RAPM samples to be impacted by his early career results - IIRC, Kobe had a few seasons in the early 2000s missing games, with the Lakers barely missing a beat (almost shockingly so), and his early 2000s RAPM samples don't look as impressive as what we saw during his 2006-10 run. I don't think having Shaq + resilient team structure/GOAT level coaching should go against him as a player, although it goes both ways too with regard to "5 rangzzzz" or whatever justification we choose to use.

eminence · Post #29 » by **eminence** » Fri Jun 23, 2023 3:19 pm

Bad Gatorade wrote:Has the data been adjusted for age at all? I know that Engelmann does so, and some of the "longer" term samples aren't as heavily impacted as a result (in fact, some players such as Stockton look amazing in his data sets because they follow a different ageing curve to the typical player).

I believe it has a minutes played/team quality prior in place of an aging prior.

Bad Gatorade · Post #30 » by **Bad Gatorade** » Fri Jun 23, 2023 3:21 pm

eminence wrote:
Bad Gatorade wrote:Has the data been adjusted for age at all? I know that Engelmann does so, and some of the "longer" term samples aren't as heavily impacted as a result (in fact, some players such as Stockton look amazing in his data sets because they follow a different ageing curve to the typical player).

I believe it has a minutes played/team quality prior in place of an aging prior.

I think that's how Daniel Myers calculates his RAPM too - on the whole, a good prior.

eminence · Post #31 » by **eminence** » Fri Jun 23, 2023 3:25 pm

Bad Gatorade wrote:
eminence wrote:
Bad Gatorade wrote:Has the data been adjusted for age at all? I know that Engelmann does so, and some of the "longer" term samples aren't as heavily impacted as a result (in fact, some players such as Stockton look amazing in his data sets because they follow a different ageing curve to the typical player).

I believe it has a minutes played/team quality prior in place of an aging prior.

I think that's how Daniel Myers calculates his RAPM too - on the whole, a good prior.

I'm not as sold on it, as the prior does seem very similar to the data being fitted with it, the PM bit without the adjustment.

MyUniBroDavis · Post #32 » by **MyUniBroDavis** » Fri Jun 23, 2023 6:49 pm

70sFan wrote:Thank you for the data and the effort put in the visualization.

I think this confirms that we have two clear top 2 players of the last 25 years:

1 LeBron James (gap)
2. Tim Duncan (gap)

The rest is more interesting though, it would be interesting to take a deeper dive into best 5 years stretches or top 5 seasons combined.

This isn’t her tableu lol

OhayoKD · Post #33 » by **OhayoKD** » Fri Jun 23, 2023 6:51 pm

MyUniBroDavis wrote:
70sFan wrote:Thank you for the data and the effort put in the visualization.

I think this confirms that we have two clear top 2 players of the last 25 years:

1 LeBron James (gap)
2. Tim Duncan (gap)

The rest is more interesting though, it would be interesting to take a deeper dive into best 5 years stretches or top 5 seasons combined.

This isn’t her tableu lol

shut up timo

DQuinn1575 · Post #34 » by **DQuinn1575** » Fri Jun 23, 2023 8:21 pm

70sFan wrote:Thank you for the data and the effort put in the visualization.

I think this confirms that we have two clear top 2 players of the last 25 years:

1 LeBron James (gap)
2. Tim Duncan (gap)

The rest is more interesting though, it would be interesting to take a deeper dive into best 5 years stretches or top 5 seasons combined.

I don't think leading any of these measures confirm anything.
The google sheet showing career shows
1. LeBron
2. KG
3. CP3
4. Tatum
5. Stockton
6. Jokic
7. MJ
8. Duncan

If we confirm this means LeBron is the best player of the last 25, does it also confirm Tatum and Stockton are 4 and 5? And this is Stockton AFTER his probable prime?

https://docs.google.com/spreadsheets/d/1OzfLtHanVmSCPy8Y3cvCj5uFG9k7cPbDO9sQq9JgbuU/edit#gid=0

ShaqAttac · Post #35 » by **ShaqAttac** » Fri Jun 23, 2023 8:24 pm

so many pings

OhayoKD · Post #36 » by **OhayoKD** » Fri Jun 23, 2023 8:32 pm

70sFan wrote:
Jaivl wrote:
70sFan wrote:
Playoffs matter.

And KG is still ahead there -- Duncan doesn't get to take Manu's performance as his!

As OhayoKD said, the volume gap is massive. I don't really have low playoff sample against Garnett, but Duncan accumulated significantly more playoff value than him, it matters.

The other thing is that it's harder to maintain averages over longer runs and/or larger samples. That's why everyone(besides Lebron) became "playoff fallers" when they hit a certain threshold. It's not just the accumulation of value, it's also speaks to resiliency and replicability.

70sFan wrote:
OhayoKD wrote:
70sFan wrote:
Playoffs matter.

Volume also matters but it may be unfair of me to say "clear cut". KG's prime-years have small sample size and when we had an extended run i dont know 2004 was clearly worse than 2003

Do you think there is a way to estimate Shaq's total value with the available data? Would he be close to the top?

Hmm. Might want to check with Jaivl/Eminence if this is above board but maybe we can establish a fairly generous upper bound by taking his stuff from his best 5-year sample(also with Ahmed) and then multiplying that by his possessions played in the seasons that are missing?

Lower bound would just be using what's there. Regardless, KG and Duncan, at least with this dude's data, is probably out of reach.

Bad Gatorade wrote:I think that 5 year blocks are much better. Large samples can capture "too much" information - e.g. even in older samples that captured, say, 8 years of Durant's career, the vast improvement Durant made in his 3rd year in the league was captured by other players, and so he appeared highly underwhelming in RAPM for quite a while.

Even 5 year samples are imperfect - IIRC, Kyle Korver's 2012-2016 sample looked awesome, but it also looked very clearly better than 2011-2015 and 2013-2017 - even one year can influence things a lot. Honestly, we should be looking at as many non-baffling samples as possible.

I and eminence have posted this before, but Ahmed also has 5-year data here:

https://public.tableau.com/app/profile/ahmed.cheema8618/viz/FiveYearRAPMPeaks1997-2021/FiveYearRAPMPeaks1997-2021

Post #37 » by **Doctor MJ** » Fri Jun 23, 2023 8:51 pm

70sFan wrote:
Jaivl wrote:
70sFan wrote:
Playoffs matter.

And KG is still ahead there -- Duncan doesn't get to take Manu's performance as his!

As OhayoKD said, the volume gap is massive. I don't really have low playoff sample against Garnett, but Duncan accumulated significantly more playoff value than him, it matters.

As someone who has thought about achievement vs goodness, I think this is a watershed.

I'll say that the fact that Garnett has the edge in numbers in both RS & PS is an argument for him to be the better player, and I think it reasonable to have Garnett ahead of Duncan in a Top 100 list.

But as someone who is scaling back to focus more on achievement as the basketball world sees it, Duncan is coming out just ahead of Garnett too.

Post #38 » by **Doctor MJ** » Fri Jun 23, 2023 8:59 pm

AdagioPace wrote:Wow ultimate post. Thanks

I love how Draymond and Manu are close to each other, for the great enjoyment and sexual pleasure of PC board.

lol. Both are great, and Draymond has a minutes edge that's no small thing.

I do want to emphasize though that Draymond's edge comes from early in the playoffs:

If we make a list of top players by raw +/- in the playoffs we get:

Draymond Green +991
Manu Ginobili +955

April Only:

Draymond Green +554
Manu Ginobili +337

May Only:

Manu Ginobili +434
Draymond Green +300

June Only:

Manu Ginobili +184
Draymond Green +137

If someone wants to do a RAPM from only the 2nd round on I'd be interested in seeing it, but I'll just say I'm pretty confident that Ginobili would come out ahead of Green.

OhayoKD · Post #39 » by **OhayoKD** » Fri Jun 23, 2023 9:03 pm

Doctor MJ wrote:
70sFan wrote:
Jaivl wrote:And KG is still ahead there -- Duncan doesn't get to take Manu's performance as his!

As OhayoKD said, the volume gap is massive. I don't really have low playoff sample against Garnett, but Duncan accumulated significantly more playoff value than him, it matters.

As someone who has thought about achievement vs goodness, I think this is a watershed.

I'll say that the fact that Garnett has the edge in numbers in both RS & PS is an argument for him to be the better player, and I think it reasonable to have Garnett ahead of Duncan in a Top 100 list.

But as someone who is scaling back to focus more on achievement as the basketball world sees it, Duncan is coming out just ahead of Garnett too.

Ehh. Even just looking at the numbers I'd say Duncan looks better in the playoffs. Volume is a part of "better" not merely "achievement".

Also if we look at 5-year stuff(keeping in mind the gaps are more important than the raw values):

Duncan's gap for 99-03(look at the circles below the top circle) relative to the field still trails what we see from lebron, but it is bigger than what we see with KG. In fact, the gap there loooks bigger than what I'd expect for any non-bron stretch other than kareem inthe 70's and russell in the 60's

Post #40 » by **Doctor MJ** » Fri Jun 23, 2023 9:12 pm

ftr, I'm not super-in-love with ultra-long RAPM samples.

I think working in 3 or 5 year chunks, and in general putting more focus on the ceiling we see from a guy tells us how good he was, though of course that doesn't factor in all context either.

One thing I think is pretty funny is the idea that Nash disappointed +/- in the playoffs. I'm having trouble finding my initial study on this, but when I looked at a bunch of stars and compared their playoff Team Wins to their playoff OnWins (positive +/- in the game), Nash stood out above anyone else I saw.

A quick bkref query shows Nash:

69 OnWins
57 team wins
That's a +12 edge.

While I understand this is a very coarse metric and that someone's impact can still disappoint relative to the regular season even while coming off well with the metric, just for perspective here, if I do the same thing in the regular season:

748 OnWins
764 team wins
Down 16 instead of up 12.

If that seems bizarre to you given that Nash is known for his regular season impact, just know that this is the norm. Stars on great teams often have teams capable of outscoring the opponents on the bench. So this data, while unexpected to most, is actually pretty normal from my analysis.

The spike in the playoffs though, while not utterly unique, is a definite stand out.

What does it say about a guy when is RAPM says he disappoints some in the playoffs, but a study like this shows him as a positive outlier? At the very least I'd say, it means we shouldn't run too far too fast with the data.