Penalized Regression of WOWY data

Moderators: Doctor MJ, trex_8063, penbeast0, PaulieWal, Clyde Frazier

User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#101 » by Moonbeam » Fri Aug 4, 2023 1:13 am

Moonbeam wrote:
homecourtloss wrote:
Moonbeam wrote:Here are the MVP winners of the 2010s, minus Giannis (I have to tweak the code to add more than 6 players). The data runs through 2022 as I still have to get box scores for 2023.

Image


Could you run for Draymond, Gobert, Dikembe, Mourning, and RAPM warrior Bo Outlaw?


Image

Bo doesn’t fare so well here.


Fixed the post-96 data here. Bo Outlaw looks a bit better than he did.

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#102 » by Moonbeam » Fri Aug 4, 2023 1:17 am

Moonbeam wrote:
AEnigma wrote:
Moonbeam wrote:
Image

Bo doesn’t fare so well here.

Bo not too surprising imo; he is ultimately a role-player, and there is no WOWY approach where he seems to come across as a secret superstar (or even a secret Bobby Jones).

What is a lot more interesting is Dikembe and how yet again we come across a substantial disconnect with WOWYR and the conclusions often drawn from it.

On that note, could I see a graph with Kobe, Pierce, Vince, McGrady, Ray Allen, and Eddie Jones?

And with the expansion backward, I would be similarly interested in a graph with Thurmond, Unseld, Lanier, Hayes, Cowens, and Reed.

As always, whenever convenient. :love:


Here are the 90s/00s stars:

Image

And here are the 60s/70s centers:

Image

By the way, I think the modern guys can give us some nice "sanity checks" on this methodology. The Cheema 5-year RAPM scores I compared RWOWY-Ridge to in the document are prior-informed, whereas these are not prior-informed, so that may explain some discrepancies, but it's still good to benchmark against what is available through more precise +/- data.


Here is the update for the 90s/00s stars:

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#103 » by Moonbeam » Fri Aug 4, 2023 1:21 am

Moonbeam wrote:
Doctor MJ wrote:So, the next query on my mind are current players that aren't already listed but who could end up in the Top 100. List is going to be longer, so I'll just make groups of 4 to make them as contemporaneous as possible. Please feel ZERO pressure to do any of this. I just have enough interest that I can probably ask questions at an annoying rate.

Paul George
Kawhi Leonard
Jimmy Butler
Klay Thompson

Anthony Davis
Damian Lillard
Giannis Antetokounmpo
Nikola Jokic

Devin Booker
Joel Embiid
Jayson Tatum
Luka Doncic


Here you go. Some of these to me suggest some winning bias, potentially? Not sure. Also, my console is not fond of accented characters, so my apologies for the '?' symbols that it converts everything it doesn't recognize to. :lol:

Image

Image

Image


Here are the updates:

Image

Image

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#104 » by Moonbeam » Fri Aug 4, 2023 1:23 am

Moonbeam wrote:
eminence wrote:An Eminence favorites request:
Andrei Kirilenko
Baron Davis
Ricky Rubio
Derrick Favors
Joe Ingles


Here you go. Rubio and Jingles looking pretty comparable here!

Image


Here is the fixed version:

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#105 » by Moonbeam » Fri Aug 4, 2023 1:25 am

Moonbeam wrote:
70sFan wrote:I think comparing the 1980s high scoring wings wojld be cool to see as well:

Adrian Dantley
Alex English
Bernard King
Mark Aguirre
Marques Johnson
Dominique Wilkins


Some of my favorite players ever in this group. I had some graphs including these guys in the document, but now I've got their earlier seasons too.

Image


Fixed the tail end of Nique and Aguirre with the post-96 window:

Image
User avatar
eminence
RealGM
Posts: 17,112
And1: 11,905
Joined: Mar 07, 2015

Re: Penalized Regression of WOWY data 

Post#106 » by eminence » Fri Aug 4, 2023 1:26 am

Rudy Rudy Rudy
I bought a boat.
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#107 » by Moonbeam » Fri Aug 4, 2023 1:27 am

Moonbeam wrote:
lessthanjake wrote:A somewhat specific/random request from me:

Gilbert Arenas
Caron Butler
Antawn Jamison
Larry Hughes
Brendan Haywood

I’m kind of bizarrely convinced from having watched them for years that Brendan Haywood was actually the most impactful player on the early/mid 2000’s Wizards, and I’m just curious if I can add some fuel to my fire on that take.


Image

Lots of overlap among these guys.


The fixed version gives Haywood a bit more love.

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#108 » by Moonbeam » Fri Aug 4, 2023 1:32 am

AEnigma wrote:
Moonbeam wrote:

What about Reggie Miller, Tim Hardaway, Penny Hardaway, Grant Hill, Allen Iverson, and Chauncey Billups?

Thanks again. Fascinating stuff.


Here you go!

Image
lessthanjake
Analyst
Posts: 3,468
And1: 3,102
Joined: Apr 13, 2013

Re: Penalized Regression of WOWY data 

Post#109 » by lessthanjake » Fri Aug 4, 2023 1:32 am

Moonbeam wrote:
Moonbeam wrote:
lessthanjake wrote:A somewhat specific/random request from me:

Gilbert Arenas
Caron Butler
Antawn Jamison
Larry Hughes
Brendan Haywood

I’m kind of bizarrely convinced from having watched them for years that Brendan Haywood was actually the most impactful player on the early/mid 2000’s Wizards, and I’m just curious if I can add some fuel to my fire on that take.


Image

Lots of overlap among these guys.


The fixed version gives Haywood a bit more love.

Image


Yesssss!!!
OhayoKD wrote:Lebron contributes more to all the phases of play than Messi does. And he is of course a defensive anchor unlike messi.
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#110 » by Moonbeam » Fri Aug 4, 2023 1:37 am

DraymondGold wrote:So it looks like the only standard ~Top 25 candidates we're missing (i.e. not in this thread or in the paper in OP) would be,

-Young 70s Kareem (we only have 77+ Kareem. e.g. could compare him vs other NBA 70s stars you're interested in, such as Walton or Cowens or McAdoo or Frazier or Havlicek)
-The 90s–2010s Big men: Shaq vs Duncan vs KG vs Dirk
-the 00s/10s guards: Chris Paul vs Wade

If you have time (no rush, and no worries if not!), I'd love to see these comparisons :D


Here you go:

Image

Image

Image
ceiling raiser
Lead Assistant
Posts: 4,531
And1: 3,754
Joined: Jan 27, 2013

Re: Penalized Regression of WOWY data 

Post#111 » by ceiling raiser » Fri Aug 4, 2023 1:37 am

Hi Moonbeam, this is really promising. Thanks for your work on this. Do you have a dataset with the top 10 guys by percentile by year for RWOWY-Ridge?
Now that's the difference between first and last place.
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#112 » by Moonbeam » Fri Aug 4, 2023 1:43 am

homecourtloss wrote:I feel like we’re getting an exclusive service not available anywhere else for free. Is it possible to run one for Drexler and Terry Porter?

And then one for Ewing, Pippen, Barkley, Reggie Miller, and Payton?


Here you go! Surprised Drexler outpaced Porter that much.

Image

Image
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#113 » by Moonbeam » Fri Aug 4, 2023 1:47 am

Doctor MJ wrote:I'm going through this just like AEnigma did, so I think I'll just interlace my comments with his (with my own thoughts in blue)

AEnigma wrote:Love to see all the new graphs!
Initial reactions (with an eye to the top 100 project):



Bob Davies fans betrayed

I wouldn't quite say that. I think we have to remember that Davies retired in 1955 as a 35 year old so there isn't any period on the chart where he's actually playing the whole time, and he's never in prime in the periods in question. It's really the years 1945-46 to 1950-51 where the Rochester Royals make their mark, and Davies' heyday is in the '40s.

On the same graph:

Slater Martin looks exceptional. Notice he's literally ahead of Cousy basically their entire career.

Andy Phillip also looks strong.

I have to say I'm a little weirded out by Cousy looking so strong at the end of his Celtics run given that they improved the year after he retired. I feel like in general the WOWY metrics are at their best with sustained runs where the player in question and all major teammates are all playing throughout it. When guys retire or switch teams, I'm sure there are times when that's quite telling, but other times it kind feels to me like there's a kind of inertia going on.


- Prime Sharman somewhat affirmed, but for top 100 purposes the question continues to be about his prime length. Would have a stronger case with separation over Cousy.

I'd say I agree here. To me it's always been a question of whether Sharman actually deserves to be ahead of Cousy, but I've never been able to pull that trigger, and this data certainly isn't going to help with that.

Paul Seymour is a guy I've championed - though not realistically as a Top 100 guy - so I should speak to that here. Note that Seymour takes over as coach in '56-57 and effectively demotes himself. While it's possible that his +/- was excellent in this time period, even if he was, in such limited minutes, I'd be surprised if he made much of a mark after that.


- Interesting Arizin results

Indeed. I have to acknowledge some surprise. Given the way the team fell of a cliff without him after his initial run and then rose back up to record the best offense of the '50s while winning the title, I'd have expected he look better by a WOWY measure. What I will say though is that I feel like the multi-year way that Moonbeam is doing this, while entirely reasonable to reduce noise, probably doesn't show Arizin's impact as well as a more granular measure.

I also think that frankly Wilt Chamberlain's arrival messes a lot up for his early teammates in these measures. With Arizin really never seeing his ability to volume score with above league average efficiency cease, I think in another situation his WOWY metrics would look more graceful as he aged.

Pollard looks excellent. Not all that surprised.

Yardley looks excellent in the later years, which makes sense statistically, but it's interesting because his deep playoff runs are basically entirely separate from his best WOWY scores.


- Schayes maybe not as outstanding as we would want

I actually think Schayes looks quite good, it's just that like Yardley, he peaks after the team peaks. So long as we refrain from adopting a rings narrative with him, I think we're good. In between Mikan & Russell, I'd say Schayes career stands above everyone else in that period.

Mikkelsen is eye-opening. After seeing that Pollard & Martin looked so good, I was wondering about Mikkelsen. Had he been outstanding, I think we'd need to worry about the Laker dominance just making all the guys look amazing. Turns out, not so much.



- Mikan affirmed


So, in general, Mikan looking so good makes sense, but I have to say it's hard for me to understand why we don't see more of a dip in the end. After MIkan retires in 1954, the team is still quite good the next year, and when Mikan returns the next year - while they are better with him than without him - they don't really seem like they are better than the previous year. I'd have thought we'd at least seem him fall to something other than the 90+ percentile.

The end numbers of Neil Johnston and especially Larry Foust are part of what I'd point to to urge caution. I don't believe that Foust was a below average player through his prime and became elite as an old man, for example.



I'll jump in h ere to just mention Larry Costello as a guy who was an excellent player. Wouldn't expect to champion him in the Top 100, but I don't think he's far away from that.


- Neither Greer nor Jones looking like a true standout, but Jones has his playoff elevation.

So in this graph it's Tom Gola who stands out to me. A college mega-star who was particularly noteworthy in how much he hated playing with Wilt Chamberlain - at least the way the scheme was implemented. He clearly felt like the result that was something that took away what was special about him, and I think the data here seems to agree with him. Once again I'm left thinking that the team Wilt joined actually had solid pieces that a coach like Alex Hannum probably could have done something great with, given that he had spikes of success with Wilt in his later stints with him.

I too am surprised that both Greer and Jones look so close to average most of their years.


My first thought is that I'm surprised Hagan doesn't look better, but looking more closely, and looking at the next graph, Hagan clearly has a pretty good run - better than Pettit during the championship time period. I think what surprises me is that I know that in the ABA Hagan looked pretty damn good in the first year before tailing off. That really gives me the impression that he probably could have been doing more for the Hawks in the NBA in the mid-60s, but of course, there's no reason that this metric would infer this.

I'm really not comfortable with Baylor's graph here. By this measure Baylor really figures out how to be impactful after he's injured and carrying on as a first option with a better first option in Jerry West being suppressed. I think we're getting some confounding variables here.

Surprised that Twyman is so, so low. I think that's not really fair to the guy.



- Pettit even less so


Yeah, Pettit looks like a good player, but really not that amazing.

Heinsohn is the stud by this graph, and that's something I'm going to have to chew on. Feel like I should ask for a Celtics' only graph from the '50s & '60s. It's well within the realm of possibility that Heinsohn made more of a WOWY difference because of the Celtics lack of other options, but that Pettit would have been considerably stronger by the same metric were he on those Celtics.


- You already showed Wilt in this thread, but will emphasise it is neat how this goes against some of the common narratives about his “impact”


This is all true, though I do think that the story would look different with more granular WOWY. Hard to imagine a metric that was specifically focused just on the years '67-68 & '68-69 would show Wilt to be anything like elite. Again, Moonbeam focusing on larger samples makes sense for noise reduction, but from a perspective of "I guess all those criticisms of Wilt amounted to nothing", I would object on the grounds that the metric is allowing Wilt to look more consistent in his WOWY impact than he literally was.

Kerr looks like another guy with iffy high numbers at the end of his career.


- Wilkens looking worse than I may have thought


Yeah, his post-Hawk numbers just aren't the same here.

And for completeness I'll just note that we have Oscar & Wilt on the same graph, and both look outstanding.



I'm disappointed that Dick Van Arsdale doesn't look better. Everything I've seen of him seems to indicate he'd be an extremely valuable role player.

Telling that Goodrich has a spike of value around the time of the Laker chip and that's it. I think Goodrich legit was a critical part of what made that team amazing, but I think in general he was just a gunner even when that wasn't what the team needed.

Interesting that Jeff Mullins seems to look like the best of the trio. Not a guy I've thought a lot about.



The next graph is interesting. Billy Cunningham comes off as a legit star for a sustained period of time.

Tom Van Arsdale, Dick's twin, comes off awful, and I'm not surprised. I've not read about their youth but it seems like they adopted a playing style together where Tom was the scorer and Dick was the guy filling in the gaps. In the pros, that made Tom not good enough at the only thing he was good at, whereas Dick scaled better. I find this twin-as-AB-experiments to be very interesting whenever we see them. No guarantee they had literally the same talent, but probably Tom could have learned to play more like Dick from a young age.



Next graph has Dave DeBusschere as the most consistently impressive of the lot, with weird graphs for the other guys especially Jerry Lucas. As I've said, guys spiking their WOWY at the end of their career raise my eyebrows.



- Happy to say Beaty affirmed

Indeed, interesting that both he and Nate Thurmond seem to have the general edge over Willis Reed.

Next couple graphs are ABA, and I'd say Jimmy Jones impresses the most, though Donnie Freeman seems remarkably consistent.

We finally get Connie Hawkins...and honestly I'm not surprised that he falls off quickly, but I'm glad to at least see he starts off strong.

In the graph, I'd say it's a pretty tight race but that John Havlicek has a slight edge over Rick Barry, with Chet Walker as something of a wild card. Glad to see Walker come off looking impressive.


Next graph is not very exciting. Bob Netolicky's early years seem like something I need to look more closely at.

Interesting the way Wes Unseld and Mel Daniels parallel each other before Daniels falls off first. Makes sense.

- I myself have noticed Frazier’s more paltry late career WOWY indicators; makes for a tough assessment when his box production did not actually drop off too much.

His prime impact certainly looks legit though, we're not seeing numbers like this from Reed or DeBusschere.

Dave Bing looks awful, and I'm afraid I'll never be able to think about him without thinking about the fact that he made the NBA's Top 75. Nothing personal against the guy, but it's just indefensible on any actual basketball level.


Earl Monroe comes off looking more consistently solid than I'd have expect.

Warren Jabali starts off strong before falling off, which is of course the story of his drug-addled career.


- Decent support for Dandridge, although the drop-off replacing 1974 with 1979 (his box score peak) is odd.

- Billy Knight surprising

I'm surprised too, but I think what's going on here is that I'm very impressed by Dandridge's role in making the Bullets more matchup-resilient in the playoffs, but to the WOWY metric, the Bullets look like they were already quite good. Good to see him scoring highly at least for those earlier year.

Yikes Knight looks bad.

Erving as I said before: More consistently excellent than I'd expected.



The next graph is kind of a trainwreck. I've long been skeptical about Elvin Hayes, and I'd say this graph makes it look justified. Haywood and McGinnis also look like guys with a lot of blah years.

Dan Issel looks like the best of the bunch but even with him, I was expecting his calling card to be consistency through longevity, and that's not what we're seeing.

Poor Sidney Wicks looking like the opposite of John Wick.



- McAdoo underwhelming (never been a big fan of his archetype), but the other five 1970s centres look like confident top 50 contenders (or in Walton’s case, would-be).


So to me this is a graph to really focus on. Kareem looks great by all reasonable standards...but also looks more like what I see his impact compared to the guys who seem to surf the 100th percentile like Russell, Wilt...and Walton along certain stints.

Bob Lanier looks really stellar, he's definitely going to be someone I have to think more about.

Cowens looks excellent as well.

Gilmore has the issues that I'd expect. I'm not particularly disappointed, but I will be considering whether Cowens & Lanier should rank ahead of him.

Yeah, McAdoo, I'd have expected better.[/color]


- How dare you excise Gus Williams :cry:


I'm not a Gus Williams true believer, but would be interested in seeing him - and would be cool to see DJ & Sikma in the same graph.


I'm really surprised at how meh Gervin looks.

I think people in the Top 100 who are so skeptical of Bird need to see this data. He's absolutely among the guys who seem to surf the 100th percentile. Any guy who can do that is showing signs of being an outlier.

Interesting seeing how good Walter Davis looks early on. Another guy whose career gets messed up with drug issues.

Marques is another of those guys, and he looks good up front, but I'd have thought he'd look even better.

Bernard King looks absolutely legit for that brief window.

Dantley & English look iffy.


Bobby Jones looks quite good, particularly when we consider that limited minutes should be hurting him here.

Dan Roundfield and Larry Kenon looks quite good too.

Mo Lucas, a bit soft.



So, Robert Parish looks better than Moses Malone. That's really something. I'm not ready to argue for Parish before Moses, but it's certainly going to help me champion Parish later on if the opportunity avails.

Jack Sikma also looks awesome, with Bill Laimbeer not looking bad either, but I'd have thought Laimbeer would look better than Sikma.


Magic Johnson a clear cut 100th percentile surfer. Just incredible.

Isiah looking good but not like a guy who was a worthy rival to Magic or Bird.

Kinda expected Mo Cheeks to look stronger here. He doesn't look bad, but he also doesn't seem like someone I should be championing in the 100.


Dennis Johnson looks really awesome. Sidney Moncrief looks almost as awesome, but I was higher on Moncrief and had more questions about DJ.

Something that should be noted about DJ's performance here: The way he jumped from Seattle to Phoenix to Boston on great teams the whole time may be helping him more than it should. Obviously, he deserves credit for being a vital part of all these good teams - and we shouldn't forget that - but I could see the algorithm assuming DJ was carrying more impact with him than was actually so.

David Thompson absolutely looks like a legend early on before falling off, which seems right.


Feel like I already said this before but in the next graph, the way Worthy & Wilkins seems to change places is really funny. I'm inclined to say that the way Wilkins spikes late in his run is a bit suspicious of the algorithm whereas the way Worthy falls off feels more like a concern for Worthy post-Magic.

And yeah, Mullin seems like the guy whose niche is just plain valuable everywhere and he could do it for a long time.



In the next graph, to me Larry Nance ends up looking better than everyone else, and that's so interesting because Kevin McHale is in the graph and Bird & Parish look so impressive. Though as I look back up the graphs, the data doesn't really seem to be saying Parish was more valuable in his prime so much as that he and McHale were similarly valuable in prime, but that Parish has the longevity edge.

Love seeing Buck Williams here. Always a guy I'm considering championing. Not sure if I will this time.


I think the 90s bigs is the same as before: Similar peaks for all 4 guys with David Robinson being the most consistent of the bunch. Makes sense, and if the regular season were the only season, he'd be above Olajuwon on my list. As is, Olajuwon > Robinson > Ewing >> Daugherty remains my assessment.


With the '90s point guards, we get very close primes with something of a gap between the rest and Tim Hardaway. Stockton definitely takes the prize with his sustained excellence, and Payton I think comes in next, though he didn't maintain his excellence the same way. I do think KJ looks quite good with him lasting longer than we generally think.


- Interesting Reggie Miller results, if not too affecting for the most substantial playoff riser among that group


I have to say I'm really surprised Miller doesn't look stronger here. Having a weaker prime showing that Jordan & Drexler isn't really surprising, but the way Dumars maintains that lead over Reggie surprises me. As you say, Playoff Reggie is absolutely a thing, but I'm going to want to chew on this more. Miller is a guy I've championed as a matter of course for a long time on our projects and I probably will this time too because I'm just higher on him than most, but I thought he'd look stronger here.

Alvin Robertson really looks awful and he's not making those DPOY voters look like they knew what they were doing at all.

'90s small forwards: Pippen looks very solid - though Majerle and Schrempf look pretty solid too.

'90s power forwards: Yeah this thing where Barkley looks consistently better than Malone for most of their career surprises me.

'90s-00s center: Shaq's a 100th percentile surfer. Zo looks superb and considerably better than fellow Hoya Deke.


Lots of great comments here. I agree that these results are smoothed out with some "inertia" due to the 5-year timespan. I am generating 3-year windows at the moment which would make for an interesting comparison.
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#114 » by Moonbeam » Fri Aug 4, 2023 1:52 am

eminence wrote:
Doctor MJ wrote:So, just wanted to have a post specifically for the 100th percentile surfers. Basically guys who regularly hit that 100th percentile in sustained runs in the 90s and above.

George Mikan
Bill Russell
Wilt Chamberlain
Oscar Robertson
Jerry West
Bill Walton
Larry Bird
Magic Johnson
Michael Jordan
Shaquille O'Neal

Honestly, seems about right. Curious who else is like that when we see more graphs.


My guesses would be Duncan/KG/Dirk/LeBron/CP3/Steph based of the more granular stuff, but who knows.

I would enjoy having some of this stuff in a spreadsheet/table to browse for sure.


I am thinking about the best way to share this data. I've still got to finish generating the 3-year windows (and maybe I'll even do a single-season version, though I don't know that it will be that valuable). One possibility is to develop an app/site that allows people to choose the window size (and possibly the model if I get around to some extensions) and create these graphs for themselves for players they select. That will take some learning on my part, but it is something I've been wanting to learn already.

A gigantic spreadsheet is another option, but I don't know how easy that would be to work with given the number of players and the different variations (Ridge, Lasso, Elastic Net).
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#115 » by Moonbeam » Fri Aug 4, 2023 1:53 am

Doctor MJ wrote:
eminence wrote:
Doctor MJ wrote:So, just wanted to have a post specifically for the 100th percentile surfers. Basically guys who regularly hit that 100th percentile in sustained runs in the 90s and above.

George Mikan
Bill Russell
Wilt Chamberlain
Oscar Robertson
Jerry West
Bill Walton
Larry Bird
Magic Johnson
Michael Jordan
Shaquille O'Neal

Honestly, seems about right. Curious who else is like that when we see more graphs.


My guesses would be Duncan/KG/Dirk/LeBron/CP3/Steph based of the more granular stuff, but who knows.

I would enjoy having some of this stuff in a spreadsheet/table to browse for sure.


Oh definitely, but I love the graphs Moonbeam is doing. I can process things considerably faster when they are so visual.

Feel like saying:

Identifying this tier of player dominance feels like a really big deal to me given the skepticism people are expressing about Bird. When I'm doing my lists, it's the ultra outliers I want to consider first. Now that doesn't mean I'm super high on Mikan compared to others because league difficulty is a sperate thing, but among contemporaries it's a critical starting point.

As I say this, the fact this is all regular season data is a thing to be cautious with. This weekend I want to think more about Dirk because I think he might be in this tier for the regular season...but he does have some significant playoff impact issues for a good chunk of his career, and since Dirk is someone I might be looking to nominate after Oscar so I should make sure I feel I'm properly factoring that in before I do so.

Of course Dirk's not the only one who may have these issues. Bird has been talked about as someone with issues along these lines.

Regardless, I do think the onus is finding arguments for the non-100th-percentile guys over the 100th-percentile guys.


Just a note that the data in these graphs isn't just regular season data, but playoff data (and playin data!) as well.
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#116 » by Moonbeam » Fri Aug 4, 2023 1:56 am

ceiling raiser wrote:Hi Moonbeam, this is really promising. Thanks for you work on this. Do you have a dataset with the top 10 guys by percentile by year for RWOWY-Ridge?


That information is all loaded into the files I'm using to generate these graphs, but I haven't separated it out yet. I am still thinking about the best way to present the raw data, but if you have particular windows you're interested in, feel free to let me know!
User avatar
Moonbeam
Forum Mod - Blazers
Forum Mod - Blazers
Posts: 10,340
And1: 5,102
Joined: Feb 21, 2009
Location: Sydney, Australia
     

Re: Penalized Regression of WOWY data 

Post#117 » by Moonbeam » Fri Aug 4, 2023 1:58 am

Doctor MJ wrote:A few quick thoughts:

First, I mentioned in my long post about concerns about guys who switch between good teams potentially getting too much credit. I think it's worth considering what players have a DJ-like contender-to-contender run and seeing how many guys come out looking as good as DJ and guys who don't.

Second, I notice that all of this is percentile-based. So long as we're talking about contemporaries that's really perfect, but when we consider between eras, I think we should be actively considering if there's a massive difference between eras, and in particular, if there are any massive jumps between different years. Moonbeam's 5-year approach will smooth this out, and that's not a bad thing in and of itself, but still, if we're seeing similar percentile levels in different eras that represent, say, more than double the scoreboard-impact compared to each other, I think it would influence our interpretation.

Third, at this point I'm an addict just waiting for my next hit and am just expecting to see stuff from all the big guys in the 21st century when Moonbeam has it ready, and I hope Moonbeam can continue to see this as a fun thing, but we're all putting work on a busy man. So Moonbeam, just let us know if we're driving you nuts.

I think ideally what I'd really love is the ability for us to do queries as we go along a project like the Top 100. "These are the guys we're discussing right now, and this is how this data looks". If this is something Moonbeam is cool to do, I'm happy to keep asking pretty please, but it probably behooves the rest of us to consider learning a thing.

Beyond that, I think what I'd be most interested in seeing are particular cores presented together. To just name some going through history. (Please don't feel compelled to do all or most of these, I'd expect that the dynasties would be most interesting.)

- Rochester Royals if we can get good numbers at least back to their joining of the BAA. (NBL back to '45-46 would be amazing, but the data is super sparse)
Key players: Bob Davies, Arnie Risen, Bobby Wanzer, Jack Coleman, Arnie Johnson

- Minneapolis Lakers ideally back to their joining of the BAA.
Key players: George Mikan, Jim Pollard, Herm Schaeffer, Slater Martin, Vern Mikkelsen, Clyde Lovellette

- Syracuse Nationals
Key players: Dolph Schayes, Paul Seymour, Red Rocha, Earl Lloyd, George King, Red Kerr

- Philadelphia Warriors
Key players: Paul Arizin, Neil Johnston, Jack George, Tom Gola, Wilt Chamberlain

- Boston Celtics
Key players: Bob Cousy, Ed Macauley, Bill Sharman, Bill Russell, Tom Heinsohn, Frank Ramsey

- Boston Celtics
Key players: Bill Russell, Sam Jones, John Havlicek, KC Jones, Tom Sanders, Bailey Howell

- Boston Celtics
Key players: John Havlicek, Dave Cowens, Jo Jo White, Paul Silas, Don Chaney, Don Nelson

- St. Louis Hawks
Key players: Bob Pettit, Cliff Hagan, Lenny Wilkens, Clyde Lovellette, Zelmo Beaty, Lou Hudson

- Philadelphia 76ers
Key players: Wilt Chamberlain, Hal Greer, Chet Walker, Billy Cunningham, Luke Jackson, Wali Jones

- Los Angeles Lakers
Key players: Elgin Baylor, Jerry West, Dick Barnett, Rudy LaRusso, Wilt Chamberlain, Gail Goodrich

- New York Knicks
Key players: Walt Frazier, Willis Reed, Dave DeBusschere, Dick Barnett, Earl Monroe, Bill Bradley

- Milwaukee Bucks
Key players: Kareem Abdul-Jabbar, Oscar Robertson, Bob Dandridge, Jon McGlocklin, Greg Smith

Okay I'm going to stop there because this is already beyond ridiculous and I think it's clear how I'm thinking. By all means if it makes sense, we could just focus on dynasties up to the present.

Thanks again Moonbeam!


Definitely happy to provide these graphs soon, but I've got some less fun work to do in the meantime. :lol:

I don't want you or anyone else to feel guilty about asking me for these. It's both fun for me and it's giving me ideas for a more formal article I'm hoping to submit for publication. I've never published any sports analytics articles before, but why not give it a whirl?
ceiling raiser
Lead Assistant
Posts: 4,531
And1: 3,754
Joined: Jan 27, 2013

Re: Penalized Regression of WOWY data 

Post#118 » by ceiling raiser » Fri Aug 4, 2023 2:00 am

Moonbeam wrote:
ceiling raiser wrote:Hi Moonbeam, this is really promising. Thanks for you work on this. Do you have a dataset with the top 10 guys by percentile by year for RWOWY-Ridge?


That information is all loaded into the files I'm using to generate these graphs, but I haven't separated it out yet. I am still thinking about the best way to present the raw data, but if you have particular windows you're interested in, feel free to let me know!

I like the three year windows. In particular, I want to take a look at Kareem's career, and the other big minute centers who overlapped with him. Chamberlain, Reed, Cowens, Walton, Moses.
Now that's the difference between first and last place.
OhayoKD
Head Coach
Posts: 6,042
And1: 3,934
Joined: Jun 22, 2022

Re: Penalized Regression of WOWY data 

Post#119 » by OhayoKD » Fri Aug 4, 2023 2:17 am

AEnigma wrote:That still counts lol. It is just one of those things where people should be careful not to track backward — “oh wow imagine what his ‘impact’ must have been at his physical peak!” Not that different from people looking at 1971/72 Kareem and then extrapolating to 1977 Kareem. It is why we still need to look at the players themselves rather than just whether they have some massive impact indicator outside of their precise peak range.

It is to Jordan’s credit that at 33-35 he was the best player in the world, just as it was for Lebron, and just as it was for Russell, and just as it theoretically could have been for Magic. And we can fairly analyse the competition at play for those years — Karl Malone and pre-peak Shaq maybe underwhelms compared to Jerry West and Willis Reed and a down-year Wilt — but it does not change that he was able to hold that rarefied status when most players are firmly post-prime. Those are the years which enshrined him, whether they be at #1 or #4 or #6.

Still, it's ironic that the years that "enshrine" him are his "post-prime" given that is typically presented as the case for Lebron and Kareem.

ShaqAttac wrote:
Doctor MJ wrote:So, just wanted to have a post specifically for the 100th percentile surfers. Basically guys who regularly hit that 100th percentile in sustained runs in the 90s and above.

George Mikan
Bill Russell
Wilt Chamberlain
Oscar Robertson
Jerry West
Bill Walton
Larry Bird
Magic Johnson
Michael Jordan
Shaquille O'Neal

Honestly, seems about right. Curious who else is like that when we see more graphs.

isnt mj only going top in expansion?

he was lower than hakeem, bird n magic earlier

Well yeah, but in a strictly "how does he compare to the the other best players" that isn't really wrong. Does pose some other problems(we have data-ball for 2nd three-peat Mj and it doesn't look that good if that's an emperical peak as opposed to a "post-prime" period)

All said, Lebron, Russell, and Magic all look like pretty strong(and sustained) impact kings. Kareem looks like a weaker one, but one nonetheless. Jordan, Duncan, Shaq, and Curry top at points but do not run an era like the players listed in the first(emperically at least).

Of course it's important to keep in mind what is actually being included or excluded here(moonbeam cannot make players miss games in their best years), and it is not a substitute for year-to-year analysis. KG barely misses games during his best years so takes a tumble. Kareem is barely missing years in various potential impact outleirs(70, 71, 72, 74, 77) while Lebron, Magic and Jordan see teams miss a large section of the season near their best years.

As you would with RAPM, and RAPM approximation should always be used side-by-side with raw signals

Interesting how different it looks pre-shrinkage:
Image
AEnigma
Assistant Coach
Posts: 4,130
And1: 5,977
Joined: Jul 24, 2022

Re: Penalized Regression of WOWY data 

Post#120 » by AEnigma » Fri Aug 4, 2023 2:19 am

Moonbeam wrote:Image

Image

Garnett and Jokic are some of the more fascinating deviations from what conventional WOWY may have suggested, and I think in that they might present a question to explore further: could it be possible that this regression is a tad skewed toward a sort of “on-court” result? Because that Boston spike is really jumping out at me relative to the “pure” WOWY indications from Garnett’s time in Minnesota.

Return to Player Comparisons