Data Sets

Moderator: Doctor MJ

User avatar
Silver Bullet
General Manager
Posts: 8,313
And1: 8
Joined: Dec 24, 2006

Data Sets 

Post#1 » by Silver Bullet » Thu Apr 22, 2010 10:53 pm

Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)
ElGee
Assistant Coach
Posts: 4,041
And1: 1,202
Joined: Mar 08, 2010
Contact:

Re: Data Sets 

Post#2 » by ElGee » Fri Apr 23, 2010 1:37 am

Silver Bullet wrote:Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)


Interesting collaboration - but the ones listed are relatively simple calculations with B-R's filter, no?
Check out and discuss my book, now on Kindle! http://www.backpicks.com/thinking-basketball/
User avatar
Silver Bullet
General Manager
Posts: 8,313
And1: 8
Joined: Dec 24, 2006

Re: Data Sets 

Post#3 » by Silver Bullet » Fri Apr 23, 2010 2:17 am

ElGee wrote:
Silver Bullet wrote:Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)


Interesting collaboration - but the ones listed are relatively simple calculations with B-R's filter, no?


Well, BR only filters by guard, forward and center.

And it would probably be close to 50 pages of data that you'd then have to clean up, because B-R puts in headers, every 10 or 15 lines.
bbstats
Ballboy
Posts: 4
And1: 0
Joined: Apr 26, 2010

Re: Data Sets 

Post#4 » by bbstats » Tue Apr 27, 2010 1:42 pm

how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.
azuresou1
Head Coach
Posts: 7,416
And1: 1,072
Joined: Jun 15, 2009
   

Re: Data Sets 

Post#5 » by azuresou1 » Tue Apr 27, 2010 6:09 pm

Silver I hope you didn't take that much time to do your sorting, because bbstats' solution does everything in, like, 20 minutes tops.
User avatar
Silver Bullet
General Manager
Posts: 8,313
And1: 8
Joined: Dec 24, 2006

Re: Data Sets 

Post#6 » by Silver Bullet » Fri Apr 30, 2010 12:56 am

bbstats wrote:how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.


I knew there had to be a simple solution :)

Unfortunately, I did spend eons cleaning it out one by one.
User avatar
CellarDoor
Retired Mod
Retired Mod
Posts: 11,146
And1: 972
Joined: May 11, 2008
         

Re: Data Sets 

Post#7 » by CellarDoor » Fri Apr 30, 2010 4:54 pm

Silver Bullet wrote:
bbstats wrote:how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.


I knew there had to be a simple solution :)

Unfortunately, I did spend eons cleaning it out one by one.

Ouch.
tsherkin wrote:You can run away if you like, but I'm not done with this nonsense, I'm going rip apart everything you've said so everyone else here knows that you're completely lacking in basic basketball knowledge...

Return to Statistical Analysis