[/quote]OhayoKD wrote:Looking at these results...Moonbeam wrote:OhayoKD wrote:I’m happy to take any feedback you may have or ideas for modifications I haven’t considered.
would it be greedy of me to ask for a graph charting magic, bird, hakeem, and jordan specifically?
No problem! Happy to add more graphs if you (or anyone else) would like.
Possible takeaways(not sure how much I should weigh this, but it seems like a promising approach)
-> Magic potentially the true "impact king"
-> MJ's era-relative impact peak might actually be during the 2nd-three-peat(expansion goes brr)
-> Delta between 80's Hakeem and "peak" Hakeem overplayed?
I imagine there is still some box-bias in these results but I'm guessing it's made up for by more stable adjustments...
Moonbeam wrote:lessthanjake wrote:Interesting stuff, and I think is very similar to Thinking Basketball’s WOWYR.
It feels to me like there’s really just not a lot of data for a lot of this though, particularly in these past eras where players missed very few games. For instance, Doc mentioned being surprised about Karl Malone being below Charles Barkley, so I’ll use Karl Malone as an example. Karl Malone is well below Barkley in the 1989-1993 timeframe. But what is the data for Karl Malone’s 1989-1993 timeline based on? Well, here’s the list of number of missed games by players who played over 18 minutes a game in a season for the Jazz in that time period (in years they actually averaged 18+ minutes per game):
Karl Malone: 3
John Stockton: 4
Mark Eaton: 3
Thurl Bailey: 0
Darrell Griffith: 0
Bob Hansen: 37
Blue Edwards: 22
Jeff Malone: 17
Tyrone Corbin: 13
Mike Brown: 0
Jay Humphries: 4
David Benoit: 0
Larry Krystkowiak: 11
So, at least as I understand it (and I’ve admittedly not read through the actual paper, so sorry if I’m misinterpreting anything!), the model is basically trying to figure out Karl Malone’s impact essentially based on regressing what occurred in those missed games. There’s some missed games there, but nothing particularly substantial for anyone and the most substantial number of missed games are from relatively minor players. I don’t really see how a WOWY-based model can in any way accurately assess Karl Malone’s impact without almost any missed games from Malone, Stockton, or Eaton, and with several other relevant players having 0 missed games at all. It seems like the results would inevitably just be based on statistical noise centered largely around what randomly happened to occur when pretty inconsequential players like Bob Hansen and Blue Edwards were out.
Another example of this is the 1989-1993 timeframe for Jordan. He does fairly well in this timeframe, but what is the data based on? Here’s the total missed games of people on the Bulls who played 18 MPG in a given season in that timeframe:
Michael Jordan: 7
Scottie Pippen: 10
Horace Grant: 14
BJ Armstrong: 0
Bill Cartwright: 55
Scott Williams: 11
John Paxson: 7
Stacey King: 0
Craig Hodges: 33
Sam Vincent: 12
Brad Sellers: 2
Dave Corzine: 1
There’s basically virtually zero missed-game data there, except for what happened in a bunch of missed games from Bill Cartwright and Craig Hodges. Players like that don’t *really* affect games that much, but when they make up a huge portion of the teams’ missed games, what randomly happens to occur in missed games by players like that can really skew a model like this. For instance, we see above that Craig Hodges missed 33 games in years he played 18+ MPG. This was all in the 1989 season. And, based on the charts provided, we actually see Jordan’s rating in this measure tank from the 1984-1988 time period to the 1985-1989 time period and he didn’t get super high until 1989 was out of the time period, so it seems reasonably obvious that something happened in 1989 that tanked his rating. The only person that missed a lot of games that season was Craig Hodges. The Bulls happened to go 32-17 in the games Craig Hodges played and 15-18 in the games Hodges Missed (and I’m sure the difference in average margin of victory is pretty significant too). So my guess is that the model thinks Craig Hodges was really impactful (and his missed games make up a significant portion of the entire set of missed games that’s being regressed), so what happened in those games has a significant impact on Jordan’s perceived impact in time periods that contain that year (and note that Pippen dropped that same year too—though a bit less, probably because he missed several games that Hodges missed too).
Thanks for the comment and getting into the detail. You're right in that the utility of these metrics are limited due to the nature of the data we have available --- I can't and wouldn't shy away from that. I wouldn't feel like I was being responsible if I just pushed out these numbers without some important caveats like that as well as others in the document I shared. I do think there's some value in this, though.
Speaking of your specific examples, there is a bit more to it than that. These models are still making use of data with all of the players healthy to form a sort of baseline, so it's not like the "With" data doesn't matter --- it still does. Moreover, when players leave a team, they would be considered "missing" for those seasons. In your Bulls example, for instance, Craig Hodges would be listed as missing for the entirety of the 1992-93 season with respect to being Jordan's teammate. Stacey King would be considered missing for all but the 1989-90 season due to the minutes threshold. Sam Vincent would be considered missing for all but the 1988-89 season. And so on and so forth. This extra missingness allows for transactions between seasons to inform the estimates a bit more than merely looking at missed games. I'll note that for these players, their time away from Chicago would inform their baseline impact. Sam Vincent's impact with Orlando for the 1989-90, 1990-91, and 1991-92 seasons will inform his baseline impact and therefore inform his contribution to the 1988-89 Bulls' scoring margins.
Hopefully this helps clear it up a bit.
I'm happy for you or anyone else to ask more questions or offer more critiques. They may help me improve these metrics going forward!
Correct me if I'm wrong, but aren't you also using the internal box-scaling of teammates to "stabalize" the samples similar to the way a LEBRON or EPM or RPM would?
Honestly, all considered I wouldn't be shocked if this graded out as "industry standard" for pre-data ball RAPM approximation. Someone should definitely share this with Ben.[/quote]
Nope, this is completely devoid of any individual box score statistics. I understand why some hybrid metrics have been put together, and these results could possibly be extended in a similar way, but I thought it would be interesting to look at “pure” WOWY data here. Yes, we’ll get the odd Ed Nealys as I mentioned in the document, but I think those occurrences are interesting in and of themselves, and can possibly be explained in the context of how these models are built. Others, like Paul Pressey, may very well emerge as bonafide impact powerhouses that would be otherwise ignored (or have their signals diluted) by combining with box score stuff. Ultimately, I think it’s important to have box score data and impact data separately to inform our evaluations. I’m not opposed to hybrid measures at all, but I think they should only be developed as a bridge between the two, and communicated as such.
I’m not familiar with all of the details of the different implementation of RAPM and other metrics, as much of that detail is kept under the hood from what I’ve seen, but I’ve felt pretty strongly here about making the code and the detail as transparent as possible. I’ve felt somewhat frustrated at times at a lack of transparency of other methods. Taking RAPM as an example, I think it is usually estimated with Bayesian methods, which require what are known as prior distributions to produce their estimates, but those details are usually kept under wraps from what I’ve seen. Maybe I just haven’t looked in the right places for those details. The thorough breakdown by Squared2020 of RAPM was such a breath of fresh air for that reason.