What's the strongest data-driven argument for Michael Jordan as GOAT?

Moderators: KingDavid, cupcakesnake, Domejandro, ken6199, infinite11285, Clav, Dirk, bwgood77, bisme37, zimpy27

KrAzY3
Starter
Posts: 2,104
And1: 2,617
Joined: Jun 26, 2016
Contact:
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#701 » by KrAzY3 » Thu Jul 13, 2023 3:21 pm

Once you really analyze things (yes even individual statistics), the argument for Lebron is basically longevity and the totals he accumulated due to a longer career.

If you care that LeBron is still playing now, and if you care that Jordan retired prematurely, then you might choose LeBron. If you care about championships, or darn near anything else you are going to go with Jordan. People who don't understand stats claim stats favor LeBron, but you put me in front of a basketball goal long enough and I can score more points than Jordan to. That doesn't make me better.
Yank3525
Starter
Posts: 2,396
And1: 2,777
Joined: Jan 28, 2013
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#702 » by Yank3525 » Thu Jul 13, 2023 3:30 pm

Chanel Bomber wrote:
NbaAllDay wrote:A lot of it will always just come back to how people weigh criteria.

If we give MJ the edge on impact data over th years he played, how many years of impact does Lebron end to match it? If Lebron is 80% as impactful for 50% longer, how does this stack up?

For a lot of people who give Lebron the nod, the long term impact outweighs the peak/Prime impact gap that MJ supposedly has.

There is also real legitimacy to context of teams they played with and teams they played against because as much as some like to believe they gauge players in a vacuum, that vacuum is different I'd that player plays under different circumstances, even if you believe that isn't the case.

It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.
70sFan
RealGM
Posts: 30,231
And1: 25,504
Joined: Aug 11, 2015
 

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#703 » by 70sFan » Thu Jul 13, 2023 6:39 pm

iggymcfrack wrote: I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

I hope you will wonder long enough to conclude that it's not the best path to follow...
OhayoKD
Head Coach
Posts: 6,042
And1: 3,935
Joined: Jun 22, 2022

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#704 » by OhayoKD » Thu Jul 13, 2023 9:35 pm

Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
TheGOATRises007 wrote:
Chanel Bomber wrote:It would be interesting to see how RAPTOR and RAPM measure peak MJ’s on-court impact but I don’t think those metrics go that far back (on the phone now).

All those other isolated stats fail to capture the +/- (let alone adjusted +/-) data that in my opinion is a necessary ingredient to having a fully informed discussion.

Also the conversation on TS% needs to use league-adjusted TS as league average in efficiency rose significantly since the 1990s. Using raw TS% as a point of comparison is misguided.


https://fivethirtyeight.com/features/lebron-or-mj-raptor-picks-the-best-nba-players-of-the-past-40-years/

Raptor actually grades Jordan over LeBron with a sizable gap for a 7 year stretch.


I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
Bergmaniac wrote:As someone who is neutral in the whole GOAT debate (and mostly bored by it) the Jordan fans has always been way worse than the LeBron fans on this board when it comes to being rude and dismissive of the opinions they disagree with. So many of them act like they are personally offended when someone dares to claim someone else is their GOAT and go on and on how it's impossible for anyone reasonable to hold such an opinion.


On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this
lessthanjake wrote:
eminence wrote:Haven't followed closely enough to tell - but this variance discussion, seems to have been started off a claim that z-scores are more favorable to the top Russell Celtics squads and top Duncans Spurs squads relative to MJs best Bulls. Or am I off there somehow?


Yes, I think that’s right. Though what I was objecting to specifically is the idea that that method would better identify likelihood of winning a championship than simply looking at SRS (as opposed to standard deviations above the mean in SRS) would. I assumed that the baseline factual claim—i.e. that Duncan’s Spurs were as many or more standard deviations above the mean as Jordan’s Bulls—was correct, without checking if that was actually true, so the discussion was not about the validity of the underlying factual claim.

Yeah, why don't we check that:
The "chance" Duncan was given in season 2 was probably smaller than what Jordan was given for most or all of his championships. And with that "chance", the Spurs went 16-4 and obliterated the competition(by san's std they were actually a bigger outlier than all but 2 Chicago sides).

'
All in all, I think then,it can be reasonably argued prime Duncan
...
-> Led two dominant teams(statistically better than most of the Bulls if you go by standard deviation(more relevant to winning championships than "srs")), one was probably with less
-> Led a third team not too far behind in 2005(not sure what "help" is there but there's still no Pippen equivalent) with less

Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:
lessthanjake wrote:“More variance” is otherwise known as a higher parity environment where the best teams win a smaller percent of their games.

No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:
lessthanjake wrote:
AEnigma wrote:Uh huh. I do not know, are you sure that is not actually Jokic’s fault for not keeping his bench players in proper “rhythm” to create offence without him on the court? This seems to be a recurring issue for Jokic, and seeing how he has never led even a 5 SRS team despite playing with talented players like Jamal Murray and Aaron Gordon and Michael Porter, Jr., that gives me very serious and sincere concern about his ability to lead good teams. :blank:

Maybe it is. It’s less plausible, since he isn’t as ball-dominant, so other guys do get touches on the ball a good deal. Notably, you don’t hear NBA players talking about how it is an issue to get in rhythm playing alongside him, like you do regarding the really heliocentric ball-dominant guys. In fact, players talk the exact opposite about playing with Jokic. So it seems much less plausible.

Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:

But the better explanation is probably just that they’re not at all deep at the center position, so they’re often trotting out some pretty bad players when he’s off the court

Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Chanel Bomber wrote:
NbaAllDay wrote:A lot of it will always just come back to how people weigh criteria.

If we give MJ the edge on impact data over th years he played, how many years of impact does Lebron end to match it? If Lebron is 80% as impactful for 50% longer, how does this stack up?

For a lot of people who give Lebron the nod, the long term impact outweighs the peak/Prime impact gap that MJ supposedly has.

There is also real legitimacy to context of teams they played with and teams they played against because as much as some like to believe they gauge players in a vacuum, that vacuum is different I'd that player plays under different circumstances, even if you believe that isn't the case.

It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.
KrAzY3
Starter
Posts: 2,104
And1: 2,617
Joined: Jun 26, 2016
Contact:
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#705 » by KrAzY3 » Thu Jul 13, 2023 9:49 pm

OhayoKD wrote:Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best.

I can see where you make that argument, if you think winning, and in particular championships don't matter.

That's the entire point of playing basketball though. So it's like saying person A is better at basketball than person B, except when it came to beating the other team and then he got way worse. This is kind of the issue I have with boxing by the way, where a boxer can get knocked on his butt a few times, nearly get knocked out, just basically get beaten up and all the judges are like he won because he scored more points. That's forgetting what the point of the contest is, it's a fight, you're trying to beat up the other guy not just accumulate meaningless points.

I notice none, not one of these piles of data you threw at us delved into things like the fact that Jordan won more championships than he was projected to win while LeBron basically broke even. Or that LeBron lost multiple times with home court advantage, while Jordan never did. I'm pretty sure nothing you cited had to do with winning at all.

As long as you see the world in such a way that basketball is an individual sport, you can at least claim LeBron is better. But it isn't...
lessthanjake
Analyst
Posts: 3,517
And1: 3,142
Joined: Apr 13, 2013

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#706 » by lessthanjake » Thu Jul 13, 2023 9:50 pm

OhayoKD wrote:.


Yeah, I’m not going to substantively address what is, in large part, an angry set of personal attacks that essentially just rehashes an overwhelming boatload of prior discussion while interspersing personal insults. The vast majority of what’s in here that’s directed to me are things that I’ve already addressed in responses in the PC forum threads, so I’d direct anyone to peruse those if they’re curious. And I’d request that maybe these sorts of posts can please be ended, since they’re not very pleasant.
OhayoKD wrote:Lebron contributes more to all the phases of play than Messi does. And he is of course a defensive anchor unlike messi.
OhayoKD
Head Coach
Posts: 6,042
And1: 3,935
Joined: Jun 22, 2022

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#707 » by OhayoKD » Thu Jul 13, 2023 10:05 pm

KrAzY3 wrote:
OhayoKD wrote:Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best.

I can see where you make that argument, if you think winning, and in particular championships don't matter.

Sir, I have literally gone to bat for Bill Russell as the greatest peak/prime ever:
https://forums.realgm.com/boards/viewtopic.php?p=107550888#p107550888
https://forums.realgm.com/boards/viewtopic.php?p=106210076#p106210076

Lebron being better is entirely about championships. But I'm going to guess you don't reduce things to ring-counting in all circumstances.

lessthanjake wrote:Yeah, I’m not going to substantively address what is, in large part, an angry set of personal attacks that essentially just rehashes an overwhelming boatload of prior discussion while interspersing personal insults. The vast majority of what’s in here that’s directed to me are things that I’ve already addressed in responses in the PC forum threads, so I’d direct anyone to peruse those if they’re curious. And I’d request that maybe these sorts of posts can please be ended, since they’re not very pleasant.

Fine, by me. We can let posterity decide:
Letskissjake wrote:The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior.

:clap:
KrAzY3
Starter
Posts: 2,104
And1: 2,617
Joined: Jun 26, 2016
Contact:
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#708 » by KrAzY3 » Thu Jul 13, 2023 10:37 pm

OhayoKD wrote:Sir, I have literally gone to bat for Bill Russell as the greatest peak/prime ever:
https://forums.realgm.com/boards/viewtopic.php?p=107550888#p107550888
https://forums.realgm.com/boards/viewtopic.php?p=106210076#p106210076

Lebron being better is entirely about championships. But I'm going to guess you don't reduce things to ring-counting in all circumstances.

The two most active threads I've been in are this one and the Bill Russell one as of late, I think people tend to underestimate what Russell is capable of due to the fact that he focused solely on winning to the point of deferring and even forgoing a chance to get individual stats or accolades entirely. You can't just look at stats and go oh hey this guy is better than this guy when things like that are factors.

Some people try to throw out guys like Bill Russell or Robert Horry like they are diversionary tactics (not accusing you) and the fact is I think Bill Russell has a GOAT argument, certainly an argument to be made for greatest center of all time and I think Robert Horry is one of the greatest role players in history. In fact if I had to pick a favorite it might be Robert Horry. I'm not really a Jordan fan.

But, I think people who focus on LeBron's individual stats and not the fact that he did lack mental fortitude sometimes in the postseason (2011 being the best example but not the only one), are kind of covering for him. LeBron might be the most talented basketball player in history. But, to make an argument for his greatness based on individual stats is to overlook the fact that even peak LeBron wasn't always the greatest player in a series, let alone ever. Consistent greatness is a part of being great. Bill Russell and Michael Jordan were always going to beat you if they had any advantage whatsoever. The same can not be said for LeBron.
OhayoKD
Head Coach
Posts: 6,042
And1: 3,935
Joined: Jun 22, 2022

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#709 » by OhayoKD » Thu Jul 13, 2023 11:10 pm

KrAzY3 wrote:
OhayoKD wrote:Sir, I have literally gone to bat for Bill Russell as the greatest peak/prime ever:
https://forums.realgm.com/boards/viewtopic.php?p=107550888#p107550888
https://forums.realgm.com/boards/viewtopic.php?p=106210076#p106210076

Lebron being better is entirely about championships. But I'm going to guess you don't reduce things to ring-counting in all circumstances.

The two most active threads I've been in are this one and the Bill Russell one as of late, I think people tend to underestimate what Russell is capable of due to the fact that he focused solely on winning to the point of deferring and even forgoing a chance to get individual stats or accolades entirely. You can't just look at stats and go oh hey this guy is better than this guy when things like that are factors.

So this gets back to what "stats" we use, because by "winning", Russell looks like the most individually most valuable peak/prime-ever imo(the argument for that can be found at the bottom half of the first link if you're curious). This is not a definitive claim, but I cannot construct a positive counter-argument, especially for years like 1962 where the best competition is like +2(srs). When a team entirely gains seperation on defense and you have one of the smartest players ever, who also is the best man and help defender of the time period, no real reason to be tossing out offensive box like that proves something.

Some people try to throw out guys like Bill Russell or Robert Horry like they are diversionary tactics (not accusing you) and the fact is I think Bill Russell has a GOAT argument, certainly an argument to be made for greatest center of all time and I think Robert Horry is one of the greatest role players in history. In fact if I had to pick a favorite it might be Robert Horry. I'm not really a Jordan fan.

To me it's just about being consistent with your reasoning. I think I keep to that.
But, I think people who focus on LeBron's individual stats and not the fact that he did lack mental fortitude sometimes in the postseason (2011 being the best example but not the only one), are kind of covering for him. LeBron might be the most talented basketball player in history. But, to make an argument for his greatness based on individual stats is to overlook the fact that even peak LeBron wasn't always the greatest player in a series, let alone ever. Consistent greatness is a part of being great. Bill Russell and Michael Jordan were always going to beat you if they had any advantage whatsoever. The same can not be said for LeBron.


I think the difference here is you frame things relative to expectations where I just want to see whose worth more. I do not think Jordan has ever been as good at making teams win as Lebron was in 2009, 2012, 2013, and 2016. I also do not think Lebron at his best needs as much to win or contend(and I think he has shown that, most emphatically in 2015 and 2016), even if simply looking at the ends of possessions(box-score) might suggest that. I don't think anyone has ever played basketball as well as Lebron did at 24 over the last 40 years and then the did it again and again. Willing to overlook a blemish when ultimately the comparison is more about expectation(in terms of pure value jordan has worse years than 2011) than what you've contributed.

We also probably see the mental side of things differently for both players, but that is another matter.
User avatar
Chanel Bomber
RealGM
Posts: 23,902
And1: 42,015
Joined: Sep 20, 2018
 

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#710 » by Chanel Bomber » Fri Jul 14, 2023 2:48 am

OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
TheGOATRises007 wrote:
https://fivethirtyeight.com/features/lebron-or-mj-raptor-picks-the-best-nba-players-of-the-past-40-years/

Raptor actually grades Jordan over LeBron with a sizable gap for a 7 year stretch.


I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
Bergmaniac wrote:As someone who is neutral in the whole GOAT debate (and mostly bored by it) the Jordan fans has always been way worse than the LeBron fans on this board when it comes to being rude and dismissive of the opinions they disagree with. So many of them act like they are personally offended when someone dares to claim someone else is their GOAT and go on and on how it's impossible for anyone reasonable to hold such an opinion.


On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this
lessthanjake wrote:
Yes, I think that’s right. Though what I was objecting to specifically is the idea that that method would better identify likelihood of winning a championship than simply looking at SRS (as opposed to standard deviations above the mean in SRS) would. I assumed that the baseline factual claim—i.e. that Duncan’s Spurs were as many or more standard deviations above the mean as Jordan’s Bulls—was correct, without checking if that was actually true, so the discussion was not about the validity of the underlying factual claim.

Yeah, why don't we check that:
The "chance" Duncan was given in season 2 was probably smaller than what Jordan was given for most or all of his championships. And with that "chance", the Spurs went 16-4 and obliterated the competition(by san's std they were actually a bigger outlier than all but 2 Chicago sides).

'
All in all, I think then,it can be reasonably argued prime Duncan
...
-> Led two dominant teams(statistically better than most of the Bulls if you go by standard deviation(more relevant to winning championships than "srs")), one was probably with less
-> Led a third team not too far behind in 2005(not sure what "help" is there but there's still no Pippen equivalent) with less

Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:
lessthanjake wrote:“More variance” is otherwise known as a higher parity environment where the best teams win a smaller percent of their games.

No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:
lessthanjake wrote:Maybe it is. It’s less plausible, since he isn’t as ball-dominant, so other guys do get touches on the ball a good deal. Notably, you don’t hear NBA players talking about how it is an issue to get in rhythm playing alongside him, like you do regarding the really heliocentric ball-dominant guys. In fact, players talk the exact opposite about playing with Jokic. So it seems much less plausible.

Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:

But the better explanation is probably just that they’re not at all deep at the center position, so they’re often trotting out some pretty bad players when he’s off the court

Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Chanel Bomber wrote:It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.

It’s a shame if RAPTOR from the 1990s doesn’t incorporate +/- data. It’d kind of defeat the purpose.

I think RAPM’s an interesting metric which shows some relevant trends but the calculation doesn’t adjust for environment nearly enough (compared to RAPTOR for instance which does a much better job of extracting noise). I wouldn’t use RAPM at face value in this exercise is my point.

You mention cherry-picked stats but I don’t think there is yet an all-encompassing metric that incorporates finely-adjusted data that we can use for cross-generation comparisons. Although I need to learn more about WOWYR. But as a general point, we also need to think critically about the accuracy of those metrics by the way they are put together.

I think the metric that’s available to the public that came the closest is RAPTOR but if the on/off slash +/- component is missing for Jordan then it will make any comparison to LeBron useless.

An over-reliance on impact metrics then also opens the door for questions like is Jokic or Curry the GOAT (over James and Jordan)? If you’re willing to have these discussions, then fair enough.
MavsDirk41
Lead Assistant
Posts: 5,925
And1: 4,578
Joined: Dec 07, 2022
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#711 » by MavsDirk41 » Fri Jul 14, 2023 2:51 am

OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
TheGOATRises007 wrote:
https://fivethirtyeight.com/features/lebron-or-mj-raptor-picks-the-best-nba-players-of-the-past-40-years/

Raptor actually grades Jordan over LeBron with a sizable gap for a 7 year stretch.


I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
Bergmaniac wrote:As someone who is neutral in the whole GOAT debate (and mostly bored by it) the Jordan fans has always been way worse than the LeBron fans on this board when it comes to being rude and dismissive of the opinions they disagree with. So many of them act like they are personally offended when someone dares to claim someone else is their GOAT and go on and on how it's impossible for anyone reasonable to hold such an opinion.


On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this
lessthanjake wrote:
Yes, I think that’s right. Though what I was objecting to specifically is the idea that that method would better identify likelihood of winning a championship than simply looking at SRS (as opposed to standard deviations above the mean in SRS) would. I assumed that the baseline factual claim—i.e. that Duncan’s Spurs were as many or more standard deviations above the mean as Jordan’s Bulls—was correct, without checking if that was actually true, so the discussion was not about the validity of the underlying factual claim.

Yeah, why don't we check that:
The "chance" Duncan was given in season 2 was probably smaller than what Jordan was given for most or all of his championships. And with that "chance", the Spurs went 16-4 and obliterated the competition(by san's std they were actually a bigger outlier than all but 2 Chicago sides).

'
All in all, I think then,it can be reasonably argued prime Duncan
...
-> Led two dominant teams(statistically better than most of the Bulls if you go by standard deviation(more relevant to winning championships than "srs")), one was probably with less
-> Led a third team not too far behind in 2005(not sure what "help" is there but there's still no Pippen equivalent) with less

Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:
lessthanjake wrote:“More variance” is otherwise known as a higher parity environment where the best teams win a smaller percent of their games.

No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:
lessthanjake wrote:Maybe it is. It’s less plausible, since he isn’t as ball-dominant, so other guys do get touches on the ball a good deal. Notably, you don’t hear NBA players talking about how it is an issue to get in rhythm playing alongside him, like you do regarding the really heliocentric ball-dominant guys. In fact, players talk the exact opposite about playing with Jokic. So it seems much less plausible.

Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:

But the better explanation is probably just that they’re not at all deep at the center position, so they’re often trotting out some pretty bad players when he’s off the court

Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Chanel Bomber wrote:It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.




All this crap on here and you said nothing substantial. “Lebron was better” lol thanks man good job
User avatar
homecourtloss
RealGM
Posts: 11,536
And1: 18,979
Joined: Dec 29, 2012

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#712 » by homecourtloss » Fri Jul 14, 2023 3:04 am

OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
TheGOATRises007 wrote:
https://fivethirtyeight.com/features/lebron-or-mj-raptor-picks-the-best-nba-players-of-the-past-40-years/

Raptor actually grades Jordan over LeBron with a sizable gap for a 7 year stretch.


I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
Bergmaniac wrote:As someone who is neutral in the whole GOAT debate (and mostly bored by it) the Jordan fans has always been way worse than the LeBron fans on this board when it comes to being rude and dismissive of the opinions they disagree with. So many of them act like they are personally offended when someone dares to claim someone else is their GOAT and go on and on how it's impossible for anyone reasonable to hold such an opinion.


On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this
lessthanjake wrote:
Yes, I think that’s right. Though what I was objecting to specifically is the idea that that method would better identify likelihood of winning a championship than simply looking at SRS (as opposed to standard deviations above the mean in SRS) would. I assumed that the baseline factual claim—i.e. that Duncan’s Spurs were as many or more standard deviations above the mean as Jordan’s Bulls—was correct, without checking if that was actually true, so the discussion was not about the validity of the underlying factual claim.

Yeah, why don't we check that:
The "chance" Duncan was given in season 2 was probably smaller than what Jordan was given for most or all of his championships. And with that "chance", the Spurs went 16-4 and obliterated the competition(by san's std they were actually a bigger outlier than all but 2 Chicago sides).

'
All in all, I think then,it can be reasonably argued prime Duncan
...
-> Led two dominant teams(statistically better than most of the Bulls if you go by standard deviation(more relevant to winning championships than "srs")), one was probably with less
-> Led a third team not too far behind in 2005(not sure what "help" is there but there's still no Pippen equivalent) with less

Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:
lessthanjake wrote:“More variance” is otherwise known as a higher parity environment where the best teams win a smaller percent of their games.

No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:
lessthanjake wrote:Maybe it is. It’s less plausible, since he isn’t as ball-dominant, so other guys do get touches on the ball a good deal. Notably, you don’t hear NBA players talking about how it is an issue to get in rhythm playing alongside him, like you do regarding the really heliocentric ball-dominant guys. In fact, players talk the exact opposite about playing with Jokic. So it seems much less plausible.

Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:

But the better explanation is probably just that they’re not at all deep at the center position, so they’re often trotting out some pretty bad players when he’s off the court

Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Chanel Bomber wrote:It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


Great post.
lessthanjake wrote:Kyrie was extremely impactful without LeBron, and basically had zero impact whatsoever if LeBron was on the court.

lessthanjake wrote: By playing in a way that prevents Kyrie from getting much impact, LeBron ensures that controlling for Kyrie has limited effect…
MavsDirk41
Lead Assistant
Posts: 5,925
And1: 4,578
Joined: Dec 07, 2022
     

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#713 » by MavsDirk41 » Fri Jul 14, 2023 3:07 am

homecourtloss wrote:
OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this

Yeah, why don't we check that:


Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:


Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


Great post.



Jordan over James in the following

RAPTOR
PER
BPM
Regular season winning pct
Postseason winning pct
Championships
League mvps
Finals mvps
Scoring titles
All defensive teams
DPOY

But none of the matters lol
User avatar
Chanel Bomber
RealGM
Posts: 23,902
And1: 42,015
Joined: Sep 20, 2018
 

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#714 » by Chanel Bomber » Fri Jul 14, 2023 3:24 am

There’s no point to this arms race whereby we see which player ticks the most boxes in metrics that overvalue boxscore numbers or inversely fail to sufficiently extract the noise from +/- data.

Until we can agree that there is one metric (or metrics plural) that is soundly calculated and that allows for cross-generation comparisons (regardless of its output), it will be a struggle to compare these players from a statistical perspective.

I just don’t think stacking flawed metrics/stats and seeing which pile goes higher is a pertinent approach (for either side of the argument).
ty 4191
Veteran
Posts: 2,598
And1: 2,017
Joined: Feb 18, 2021
   

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#715 » by ty 4191 » Fri Jul 14, 2023 3:25 am

OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

SNIP

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


This is compelling, seminal, amazing work here, Ohayo. Please keep it up!! Outstanding!!!!!!!!! 8-)
ShaqAttac
Rookie
Posts: 1,189
And1: 370
Joined: Oct 18, 2022

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#716 » by ShaqAttac » Fri Jul 14, 2023 4:01 am

homecourtloss wrote:
OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this

Yeah, why don't we check that:


Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:


Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


Great post.

RIP Draymond copper
JordansBulls
RealGM
Posts: 60,472
And1: 5,352
Joined: Jul 12, 2006
Location: HCA (Homecourt Advantage)

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#717 » by JordansBulls » Fri Jul 14, 2023 4:24 am

Lebron has lost to virtually every elite big in his era. Ben Wallace, Tim Duncan, Dwight Howard with HCA, KG with HCA, Dirk with HCA, Durant, Jokic to name a few. Only beat old 36 year old Duncan when Lebron was at his peak and needed heroics for that from the other team making a dumb decision to not have there elite big in the game at the end. He has a losing record on the highest stage which isn't a good thing. He also won multiple bronze medals for America with Peak Duncan and Iverson on his squad and then Wade, Dwight, Melo. Was down in a series every year of his career at least 3-2 with fans in the arena. Missed the playoffs as much as titles won. Other players have more titles in the same amount of years or less without stacking the deck and moving from one team to the next. Only player to ever get swept in 3 different decades 4-0 in a series.

Lebron is soo good that he has a losing record on the highest stage, the most finals losses of players who won league mvp, 2 bronze medals, 3 series in which he lost with HCA and down every year where fans were in the arena 3-2. Also lost in round 2 with Shaq and stacked the deck his career and still doesn't have more titles than Curry who came in after him and beat him head to head more times. :o
He also only won 1 title in 11 years for the franchise that drafted him. Dirk did as good as did Isiah, Hakeem to name a few.
Has also missed the playoffs as much as titles won.
Jordan also has more accolades than Lebron in 5 less seasons at that.
Image
"Talent wins games, but teamwork and intelligence wins championships."
- Michael Jordan
WarriorGM
General Manager
Posts: 8,933
And1: 4,225
Joined: Aug 19, 2017

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#718 » by WarriorGM » Fri Jul 14, 2023 4:40 am

OhayoKD wrote:
There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:

Image

[...]

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


If we are to believe that graph then John Stockton was the best player in the league from 1998-2002 when he was age 35-39.

The assertion KG was the best player in the league for 4 5-year periods is risible enough but the above kind of makes one wonder if we should just hand back your homework for a redo.

Indeed the list of players RAPM churns out is less than intuitive which I think explains why its heyday was probably 5 years ago and why others have kept on coming out with new variations. The first thought that entered my mind after seeing the most recent RAPM list I saw last year was "this makes no sense".

Of course if we look deeper in the appendices of the RAPM user manual we are told that the way RAPM is often simplistically presented is not how you are supposed to interpret RAPM but really it is obscure enough a metric its use in discussions such as these should be qualified heavily and probably best avoided altogether. Instead it's being used as a centerpiece in arguments. Then you have the separate issue of how Engelmann's RAPM counts but someone else's calculation doesn't. Data cherrypicking at its finest.
DB23
Senior
Posts: 696
And1: 617
Joined: Jun 10, 2018

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#719 » by DB23 » Fri Jul 14, 2023 4:49 am

OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

You know what, let's start with some housekeeping
iggymcfrack wrote:
TheGOATRises007 wrote:
https://fivethirtyeight.com/features/lebron-or-mj-raptor-picks-the-best-nba-players-of-the-past-40-years/

Raptor actually grades Jordan over LeBron with a sizable gap for a 7 year stretch.


I need to bookmark this for when it’s time to argue for CP3 and Stockton in the PC project. It’s amazing how well Stockton fares in the little bit of advanced analytics we have for the past. Everything I see about him makes me like him more and more. I’m starting to wonder if maybe Stockton > Wilt is actually a reasonable take.

RAPTOR does not have on/off pre-97(and even then it's heavily informed by the prior years). IOW is functionally a box-metric for the Jordan and Stockton and as has been noted during the project repeatedly...
Bill Russell — the only defender definitively better than Hakeem — finished higher than ninth in PER once. His average WS/48 placement was 8th (much smaller league). If we want to use someone more directly comparable in style, 1998-2001 Duncan averaged 8th in WS/48, 6th in PER, and 12th in BPM. Insightful stuff.

Just because you can see a few one-number metrics easily accessible on basketball-reference does not mean they have any real merit in these comparisons. Again, toy with formulas enough and you can get all sorts of results. We just discussed that with Engelmann’s “xRAPM”. IBM set up a formula that awarded the crown to Barkley three years running from 1986-88 (just outstanding Barkley years huh) and then to Dennis Rodman in 1992. Or we could use PIPM, which is imo the best pure box metric because it does a better job of weighing the defensive aspects of box inputs. There we see Hakeem start out his career 8th - 2nd - 3rd - 3rd - 2nd - 3rd - 5th - 6th. That reads a lot more honestly to me, but ultimately we just get back to these all being formulas toying with a few inputs not actually substituting for real impact.

If you are not confident enough to assess a player outside of what you can read off a stat-sheet, you can say so without trying to poison the well against those who actually took the time to develop their assessments.

....the box-score is a fickle mistress.

I will get to the non-box(aka impact) data present for Jordan and Lebron below (spoiler: it mostly favors Lebron), but let's start with the lede:
TheGOATRises007 wrote:
Bergmaniac wrote:As someone who is neutral in the whole GOAT debate (and mostly bored by it) the Jordan fans has always been way worse than the LeBron fans on this board when it comes to being rude and dismissive of the opinions they disagree with. So many of them act like they are personally offended when someone dares to claim someone else is their GOAT and go on and on how it's impossible for anyone reasonable to hold such an opinion.


On the GB board, I agree with you. I disagree regarding the PC board.

Gregoire wrote: There are definitely a group of LeBron superfans that are toxically aggressive, and it gets tiring.

VanWest82 wrote:Over there it's less than 10% with those making pro Jordan arguments getting shouted down and enduring personal attacks

So is said. Yet, the first "personal" attack(not to be confused with your arguments being criticized/challenged) I can find comes from...a Jordan voter:
Also why even bother to quote me when you know I don't take you seriously as a poster anyway? Seems like you're just looking to muck up the project by starting meaningless arguments.

(note that when dquinn similarly challenged the criteria for voting someone for lebron on page 1, they were not personally attacked?)

It was then, another Jordan voter who said it would be a "bad look" if MJ was voted below Russell:
i kinda think it would be a bad look for the project if he drops to 4, making anyone from the outside looking in want to dismiss the whole thing. i suppose we can say we don't care about such things, but i'm not sure it furthers discussion if that happens. but we shall see i guess.


And it was someone who prefers Jordan who decided to try and derail a thread by bringing up personal beef(after a rather embarrasing apology demand we'll get to later):
I'm not sure you're going to get it with this poster. I post this not to bring back old baggage, but to offer a bit of support -- You're not the only one who's struggled with this poster. Four posters including me left the latest Peaks project after verbal abuse and bad-faith arguments from this guy, and I would hate for people to start leaving this new project for the same reason.

Notably the poster above, has outright lied about what the poster they "struggled" with claimed in order to potray them as dishonest:
https://forums.realgm.com/boards/viewtopic.php?p=104349742#p104349742
Why?

Possibly to deflect from their calculations of playoff on/off swinging towards Lebron with a more accurate process and several posters realizing they had been "cooking"(chucking out inconvenient years, inconsistent process, ect)numbers to make Jordan look as good:
https://forums.realgm.com/boards/viewtopic.php?p=104353387#p104353387
While various posters have complained I(and the "mob") were being unfair, none have bothered trying to challenge this account of events despite me offering them multiple opportunities to do so

That lie above seems to have spread with gregoire citing it as he has yet another meltdown spurned by Jordan being evaluated differently:
Gregoire wrote:try to falsificate plus-minus metrics (some of them claimed that manually counted 500 MJ games only to prove that Lebron is better lol

It was of course angry jordan fans who flooded the thread I'm referencing largely contributing to it getting locked, and it's jordan fans again who've jumped into the ongoing project complaining about anti-mj bias.

And then, last but not least...
lessthanjake wrote:Lol, just read through the last few pages of this thread, and was surprised to find that my own posts in the PC board and the responses to them were discussed a bunch in this thread. My two cents is somewhere in the middle to be honest: I do actually think that there’s a lot of hostility on the PC board to anything that is perceived as pro-Jordan or anti-LeBron. I think I can reasonably say that I myself experienced quite a bit of hostility—including a boatload of sarcastic and mocking remarks—while arguing in favor of Jordan (though I won’t claim to be a delicate flower that doesn’t punch back).

Hmmm.


Well you certainly throw punches:
Spoiler:
LessthanJake wrote:It might behoove you to step back for a moment and question whether there might actually be something to learn here from someone with more knowledge than you.

Among other things, I also come at this discussion as an independently knowledgeable person about Jordan’s career and his teams, since I watched virtually all their games, and I’ve tried to impart some of that knowledge on you, which you obviously do not independently have.

It’s all just completely motivated-reasoning-based data manipulation, and, again, you surely realize that.

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things

I don’t really feel the need to go through all this, because there’s a lot of obviously motivated reasoning going on here that I don’t think many people would find persuasive on its face, but just a few minor things:

And even on the separate question about what is more impressive in some sense, the point being made was applying this logic even to Tim Duncan’s Spurs, who did not play in a less-expanded league (the opposite actually). The argument that Duncan’s Spurs were as dominant a team as Jordan’s Bulls despite having worse records and lesser SRS, because of some argument about standard deviations is pretty obviously just motivated reasoning.

To be clear(on that last quote)...

A. that was not what was argued
Spoiler:
Nor does this
lessthanjake wrote:
Yes, I think that’s right. Though what I was objecting to specifically is the idea that that method would better identify likelihood of winning a championship than simply looking at SRS (as opposed to standard deviations above the mean in SRS) would. I assumed that the baseline factual claim—i.e. that Duncan’s Spurs were as many or more standard deviations above the mean as Jordan’s Bulls—was correct, without checking if that was actually true, so the discussion was not about the validity of the underlying factual claim.

Yeah, why don't we check that:
The "chance" Duncan was given in season 2 was probably smaller than what Jordan was given for most or all of his championships. And with that "chance", the Spurs went 16-4 and obliterated the competition(by san's std they were actually a bigger outlier than all but 2 Chicago sides).

'
All in all, I think then,it can be reasonably argued prime Duncan
...
-> Led two dominant teams(statistically better than most of the Bulls if you go by standard deviation(more relevant to winning championships than "srs")), one was probably with less
-> Led a third team not too far behind in 2005(not sure what "help" is there but there's still no Pippen equivalent) with less

Yeah, I don't know about you, but that does not read to me like a blanket comparison between Jordan's Bulls and Duncan's Spurs. Rather one between specific teams from that run.

Probably could have worded that less definitively, (San's playoff-heavy method is hardly the only approach) but I do think these claims count as "reasonably arguable"...


B. turns out standard deviations do correlate more with championships than srs
Spoiler:
[spoiler]
Let's look it it the other way, using data from 1984 onward. Here are the mean/median Z scores for champs and non-champs in different SRS bands.

SRS between 2 and 4:

Non-champs: Median 0.64, Mean 0.65 (163 teams)
Champs: Median 0.85, Mean 0.76 (3 teams)

SRS between 4 and 6:

Non-champs: Median 1.06, Mean 1.07 (102 teams)
Champs: Median 1.16, Mean 1.14 (13 Teams)

SRS between 6 and 8:

Non-champs: Median 1.44, Mean 1.48 (69 teams)
Champs: Median 1.60, Mean 1.58 (12 Teams)

SRS above 8:

Non-champs: Median 1.91, Mean 1.92 (11 teams)
Champs: Median 2.09, Mean 2.08 (11 teams)

Within each of these bands, the Z scores are higher for the champs than the non-champs.

What's more, building a model (logistic regression) for a title using either SRS or the Z score, the model with the Z score is more predictive (AIC of 120.07 vs 109.7 using SRS vs Z score for all league history, and AIC of 11.25 vs 8.09 using SRS vs Z score for teams since 1984).

Gap increases using pre-84


I bring this up because in the very next thread...
Spoiler:
lessthanjake wrote: The idea that you’d accuse me of having “sample size only matter[] when you want it to matter” is actually completely insane and wildly dishonest. You need to apologize for this promptly, otherwise it is clear you are discussing in bad faith.

Unbelievable. The fact that you can’t bring yourself to apologize for making such a blatantly dishonest attack is breathtaking. You don’t even address it, but instead go on some free-wheeling subject-changing maneuver to justify your stated refusal to apologize for something that was plainly unacceptable behavior. You were straightforwardly dishonest. Just apologize. It shouldn’t be difficult to do so if you’re a remotely mature human being. And it’s actually substantively important to discussion like this project that you apologize for behavior like this, because it will genuinely discourage people from honestly identifying appropriate caveats to the points they make and the evidence they provide if they know that people will just dishonestly attack them as if they did not do so. And that will ultimately lower the standard of discussion.

And you’re right—there was one particular part of your post I didn’t respond to. And that’s because all you did was take out half the data that was most inconvenient to you and then still only got to a conclusion that you yourself admitted wasn’t even particularly flattering for Hakeem. It was not something I felt the need to respond to, because I didn’t feel it made even a facially persuasive point.

Yeah... I think "delicate" might be an understatement. Also, despite Jake's demands for an apology, I think Anenigma is on the money:
anenigma wrote:Cool, let me know when you feel like apologising for actually doing everything that you just projected onto me. No interest in staying on subject, showing all data rather than manipulated snippets, properly responding to points made, maintaining honest portrayals, advancing sincere engagement, responding with any real maturity… So much for high standards of discussion. :-?

Let's start with the bold bit as, per usual, there are picks and cherries:
Jordan is a lot lower in raw WOWY (32nd in prime WOWY), so that’s one data point that doesn’t support his case—basically the only one.

Yeah, no.

Summary, extended samples, aka, "metrics"
Spoiler:
There is very little of what "you" consider impact data but of course you are rather inconsistent with what is "good-faith" running with WOWYR while complaining about much more straightforward "adjustments". As it so happens, "WOWYR" is the one and only type of metric Jordan actually looks "good" in relative to Lebron. Using your standard of what counts as an impact metric we have:

-> Playoff on/off
(Lebron looks better)
-> On/off
(Lebron looks better, 97/98 rank below 17 and 18 lebron years respectively)
-> On+ON/off
(Lebron looks better, 2nd and 5th best regular season teams rank 8th and 11th respectively)
-> WOWY
(Lebron looks better, and Jordan ranks 4th amongst his contemporaries, literally does not matter what you use)
-> Indirect samples(what eminece outlined in the #2), Lebron looks much better
-> AUPM
(Lebron looks better with the exception of 3-year consecutive where MJ is a bit behind Duncan)
-> Squared RAPM
(Lebron looks better in the same set)
-> Full RAPM
(Lebron looks signficantly better with the potential for Jordan to close in if his early years score better(there is reason to think they might not))

Larger Samples, multi-year extraps(something WOWYR and its variant also use but in a much noisier manner with tiny samples of "off"), one-year indirect
Spoiler:
Why don't we start with the large-samples which are not "limited"? We have 82 games telling us the Bulls were a 28-win team in 1984. They were a 30-win team without Jordan in 1986. That team then got better with additions such as oakley. And then, pre-triangle(a scheme that turned the Bulls from a good offense and a below average defense into a historically great offense and a -3 defense over the course of a season), the Bulls peaked at +2 at a 52 or 53-win pace(ben takes the former, eballa extrapolates the latter based on a 28 game-sample after a trade). If we unreasonably give Jordan all the credit for that improvement, out 82-game off sample leaves Jordan at +8.

That is

-> lower lift on a weaker team than what Kareem manages on the 77 Lakers if we ignore they traded significant pieces for Kareem.
-> within range of several kareem marks including when he was a rookie
(note the gap actually becomes quite large when you consider that srs was suppressed post-merger and championship teams of that era ranged from +4 to +6)
-> Dramatically worse than what we see with the same starting rotation(without Lebron) from the 11 cavs who go from 61-wins to 18-wins over a 21-game sample(19-win for the season overall), and worse than what we see over substantial without samples in the second cleveland stint(the most favorable appraisal of the cast gets lebron taking a 30-win team to 60 in the regular season) and small ones for Miami(40 to 60)
-> on the level of what we see with 69 Bill Russell, a retiring player-coach who had just knocked out a gauntlet tougher than any Micheal faced on the Bulls(marginally better replacement fwiw)
-> On par with what we see with 04/05 Duncan through injuries over a substantial sample(he would win a championship in 03 with a weaker version of what he had in 05)
-> On par with multiple substantial samples for Hakeem
-> Worse than substantial samples for Magic


We can also use 94 and 95 where the Bulls(with us inflating 93 by using the bulls full-strength rs-rating) only drop by 5.3-points in the regular season and 2-points in the playoffs making for an overall drop of "5" for the season(this is similar to what we get for 08/09 kg).

We can also look to 86 and 95 and the result remains the same. Over the largest possible samples, Jordan does not look on par with the other major Goat-candidates(Lebron, Russell, Kareem). He also does not gain separation over players he is assumed to be better than including his own contemporaries(magic is consistently advantaged, and Hakeem is competitive in the rs to go along with nigh unrivalled team-level playoff elevation)

If you are looking for the largest sample-sizes, that is what we have. I can make the most favorable assumptions and Jordan still does not reach the same heights as players he is considered comparable with. That is the lede. If you are going to "summarize all the available impact data for Jordan", there's no reason to bury it.

MJ:
'84 to '85 Bulls (w/MJ): +11 wins
'93 (w/MJ) to '94 Bulls: +4 wins
'95 (pre/MJ) to '95 Bulls (w/MJ): +20
LeBron:
'03 to '04 Cavs (w/LeBron): +17 wins
'10 (w/LeBron) to '11 Cavs: +46 wins
'10 to '11 Heat (w/LeBron): +12 wins
'14 (w/LeBron) to '15 Heat: +18 wins
'14 to '15 Cavs (w/LeBron): +26 wins
'18 (w/LeBron) to '19 Cavs: +31 wins
'18 to '19 Lakers (w/LeBron): +7 wins

As has repeatedly established(and ignored by those with positions the evidence is inconvenient for), non-box data consistently favors Lebron. In fact, with larger and more inclusive samples, Lebron's best signals look even better while Jordan's look worse. Even with WOWYR(which excludes much of the data for Lebron listed above and compounds a sample-size issue by tossing around noisy adjustments across the duration of a player's prime(EX: Bulls without 89 Pippen factors largely into how we assess 91 Pippen)), "Jordan being ahead of every goat candidate" is misleading. His advantage over Kareem is questionable as corrected data puts Kareem's WOWY ahead(the base of WOWYR). His advantage over Russell is entirely a matter of framing(do you value championships or srs?) as WOWYR suggests that Russell's 11 championships came with average help.

There is no non-box stat which generally puts Jordan #1, even that where Jordan is competing against a narrowed field(almost no on/off pre-97 for any player besides Micheal). Those unsourced github-sets(something Lessthanjake has refused to acknowledge while calling whatever doesn't favor Jordan "flawed") only really advantages Jordan over the likes of Malone and of course full RAPM for 97/98 places him well below Lebron under the same creators:
Image
(Cheema, the spax)

(keep in mind, it is not clear that second-three-peat MJ's "empirical impact" is lower than first-three peat MJ with the snippets we have arguably favoring the latter years). Lebron also dominates a field of players who can scale favorably against Mike when we use data that is available for both(Duncan, KG, Steph, Shaq, ect):
James is, arguably, the king of overall plus-minus stats. 2018 is the 25th season of league-wide plus-minus data, which covers nearly 40 percent of the shot-clock era and touches 12 of the top-20 players on this list. None have achieved LeBron’s heights: He holds four of the top-five scaled APM seasons on record, and six of the top eight. Since 2007, 10 of his 11 years land in the 99th percentile.

(scaled-apm, JE)

Among contemporaries, prime Magic(mostly excluded in the data less-than-jake pulled up), is consistently advantaged. So is David-Robinson(pollock, wowy, wowyr, concentrated), with Hakeem mostly grading out as a regular-season peer(and then sees the most team-level elevation in the postseason of anyone from that time-period).


Lessthanjake, bastion of fairness and honesty, repeatedly seeing this sort of non-box data, found various excuses to disregard it(and then attack the credibility of those who provided it) on grounds that were better applied to a metric they preferred(WOWYR):
Spoiler:
It’s logical extrapolation using basically completely unrelated data points and trying to draw a meaningful conclusion from them. You might as well pick a completely random team that you happen to think is similarly or less talented than the Bulls without Jordan and look at their wins/SRS and say that the difference is the upper bound of Jordan’s impact.

They are different rosters in that the 88 Bulls were better. I am comfortable giving Jordan an unfair amount of credit, yes. You quite literally just threw out 3-variants of WOWYR, a metric which uses 89 Pippen to assess 91 Pippen as a teammate. This is what "impact" comparisons pre-97 is. You look at the with, you look at the without, and you use your understanding of the game to try and adjust for context and isolate variables. The difference here is I'm using 82-games while WOWYR is using either a handful(overall) or 8 games a season(it is not at all clear what is counted and Ben explicitly makes a distinction between 1994 and "wowy") for one guy and 2.2 games for another(that's bill russell) and THEN decides to make "corrections" based on the 3 games a teammate missed 5 years back or the 70 games another teammate missed 10 years forward. This is why Ben explcitly outlines using "shorter-time periods" as a solution and then uses "shorter-time periods" as the backbone of his impact write-ups.

WOWYR is extrapolated from "WOWY" utilizing "corrections" that are then applied to every year within a stretch. IOW. They take teammate "impact" from different versions of a team, multiple years removed from each other(the thing you said was "useless!", "ridiculous!", and "bad-faith!") to "correct" WOWY. The difference being the sample here is much smaller than with the extrapolations you didn't like for that same reason.

Whether it's deliberate, or simply a byproduct of jake mostly having no clue what they're talking about...
AEnigma wrote:
lessthanjake wrote:“More variance” is otherwise known as a higher parity environment where the best teams win a smaller percent of their games.

No, variance here is essentially referring to how low scores and deeper rosters present easier opportunities for an upset. Three NHL teams have won more than sixty times across an eighty-two season; none of them won the cup nor even made the Finals. NHL and MLB teams could win the same percentage and be innately far more prone to upsets than NBA teams. And it is weird you would try to drag in a baseball analogy without understanding that on a base level.

...the end-result is often an nonsensical jumbling of buzz-words, silly assumptions, and implicit or explicit hostility disguised with relatively sophisticated sentence structure.

For example...
lessthanjake wrote:Actually kind of yeah. I’ve explained this a bit above. But if you have a really ball-dominant style, when you go to the bench, your teammates will have not really touched the ball much in the flow of the offense beyond just finishing some plays with a shot, so they’ll be less in rhythm overall.

Wow you "explained" this, huh. Gee, thanks! Just one problem:
Spoiler:
letskissbro wrote:Both Luka Doncic and James Harden have actually played under the exaggerated caricature that people have made "LeBron ball" out to be, yet their teams fare significantly better when they're off the court than LeBron's teams and often field positive net ratings with their stars on the bench.

LeBron's average times of possession in that 16-21 stretch from Ben's graph (minus 2019): 5.3, 6.4, 6.7, 7.4, 6.4
Corresponding net ratings for those teams while he sat: -4.3, -8.8, -0.5, -0.9, -2.0

Luka (20-23): 8.9, 8.9, 9.3, 9.1
Off NetRtg: +4.2, +0.3, +3.4, -2.7

Harden (17-23): 9.3, 8.8, 9.3, 8.6, 8.6, 9.2, 8.6
Off NetRtg: +3.7, +5.2, +1.1, -3.4, +2.4, +1.2, +2.8

If LeBron spends approximately six seconds with the ball in his hands, and that leads to over-reliance on him, why is it that Harden and Luka, who hold the ball for 50% longer, dribble the ball significantly more per touch, play under a similar team setup, and have comparable usage rates, see their teams perform well in their absence?

AEnigma wrote:
lessthanjake wrote:Maybe it is. It’s less plausible, since he isn’t as ball-dominant, so other guys do get touches on the ball a good deal. Notably, you don’t hear NBA players talking about how it is an issue to get in rhythm playing alongside him, like you do regarding the really heliocentric ball-dominant guys. In fact, players talk the exact opposite about playing with Jokic. So it seems much less plausible.

Yet somehow his exit to the bench dwarfs the bench effects we see from “the really heliocentric ball-dominant guys” like Harden, Luka, Wall, Morant, Parker, DWill, Rose… Even Chris Paul and Steve Nash and peak helio Westbrook (which does need to be specified because for whatever mysterious reason the team survived his absences much better outside of that 2016-18 period…) see less of a drop-off than Steph Curry. Very strange, given this very real and very legitimate theory about how “helio” stars routinely ruin their benches.
:thinking:

But the better explanation is probably just that they’re not at all deep at the center position, so they’re often trotting out some pretty bad players when he’s off the court

Fascinating. I wonder if you might be onto something there: not being easily replaced means your team struggles more without you??? Going to need to double-check the numbers on that, because I was under the impression stars controlled their replacements and that having replacements incapable of recreating their effect was in fact a failing on them.

It's bullsh-t,

That is why you've been "mocked", not because you voted for Jordan(plenty of people did without facing "toxic" pushback), but because you present assumptions you cannot actually back-up as if they are obvious truths, questioned for centering your arguments around these assumptions, and instead of responding like a reasonable person, you lash out and accuse other of being toxic and "drinking their kool-aid":
But you’re really just repackaging preconceived notions into boldly-pronounced arguments that are based on absurdity, like that we can measure Jordan’s impact by comparing the results of two essentially completely different rosters.

This an example of projection.

It was after all, you who assumed that every team Russell has ever played on was stacked because a player who by 1969 was a 6th-man was voted in the top 100. You who assumed the Bulls were no better than the Heat in terms of help because the players at the top of the rosters were similar(while actively trying to dismiss health as a factor). And you who disregarded actual results without Jordan to confidently claim they were "bottom line: not a significant contender" because you had "watched enough to know their talent level" while apparently not picking up on how the triangle worked.

You have literally spent the last four threads spamming unsourced data while accusing others of statistical malpractice. You then had the audacity to demand an apology when you were rightly called out. If you have any self-awareness, I'd suggest you stop crying wolf. The backlash you've faced is on you, no one else.

PS: posters in the project have literally told me they were going to obscure their preference for peak as to not "spark controversy", so let me, on their behalf, say the quiet part out loud:
Yank3525 wrote:
Chanel Bomber wrote:It's something I've been thinking about as well.

We can reach the most well-informed level of debate, but the way we weigh those different criteria will always be somewhat arbitrary and tough to rationally arbitrate.

I would just say that - assuming a level playing field in terms of external circumstances - in the peak vs longevity argument (simplistically framing the 80% vs 50% example that you brought up), the value of longevity would be to enable the player to achieve as much if not more, but definitely not less.

And this is where people lose me with the longevity argument for LeBron. Because it's not met with the outcome that would be reasonably expected from it. It rings hollow.


Exactly. The longevity argument would hold more weight to me if LeBron was still cranking out years like 2019-20 instead of what he has done the last couple of years. He has basically become a part time player who has accumulated nice counting stats but doesn’t have the impact on winning that he once did.

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


You sound like my marketing department, spending a lot of time and effort making the numbers look how you want them to justify your position whilst ignoring the most important part. Results and context.
Gregoire
Analyst
Posts: 3,533
And1: 672
Joined: Jul 29, 2012

Re: What's the strongest data-driven argument for Michael Jordan as GOAT? 

Post#720 » by Gregoire » Fri Jul 14, 2023 5:03 am

ty 4191 wrote:
OhayoKD wrote:Will keep all this in GB as to not derail the project but uh...

SNIP

Lebron is better at basketball, as the data(when it's not comically cherrypicked) suggests he is. Longevity isn't necessary. Lebron was better young, he was better old, and he was better at his best. Jordan's stock in PC has not fallen because kareem played extra-years. It's fallen because people are now more interesting in winning than subjective definitions of "production", and the idea that it was simply longevity that dragged Jordan down is just a misrepresentation on what's been happening over the last year. It wasn't longevity that has seen multiple people argue for Hakeem, Duncan, Kareem, and Russell on the basis of how good they were. So let's stop reducing this debate to a regurgitation of Zach Lowe. It wasn't just "lebron fans", it was a bunch of posters who like different players coming together and thinking about basketball a bit differently. Why don't we analyze that, instead of hitting the same beats as first things first.


This is compelling, seminal, amazing work here, Ohayo. Please keep it up!! Outstanding!!!!!!!!! 8-)


Oh, man, its what Im talking about... Two more James fanboys get here and try to make up numbers as they usually do only from frustration that his idol is not near basketball player MJ was. :D
Heej wrote:
These no calls on LeBron are crazy. A lot of stars got foul calls to protect them.
falcolombardi wrote:
Come playoffs 18 lebron beats any version of jordan
AEnigma wrote:
Jordan is not as smart a help defender as Kidd

Return to The General Board