Page 1 of 1

Data Sets

Posted: Thu Apr 22, 2010 10:53 pm
by Silver Bullet
Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)

Re: Data Sets

Posted: Fri Apr 23, 2010 1:37 am
by ElGee
Silver Bullet wrote:Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)


Interesting collaboration - but the ones listed are relatively simple calculations with B-R's filter, no?

Re: Data Sets

Posted: Fri Apr 23, 2010 2:17 am
by Silver Bullet
ElGee wrote:
Silver Bullet wrote:Hey guys I need some help

Compiling data sets is the most time consuming and arduous process in Statistical Analysis. It can take hours or days, depending upon the dataset to compile.

It would help us all greatly if we share our datasets and put together a database of all the sets.

So if you wouldn't mind sharing some of the data sets you have compiled over the past few months years, it would be a great help.

I'll start of with my most recent one:

Shooting Percentages for all guards averaging over 32mpg and 20ppg (1964-2010)
http://rapidshare.com/files/378985085/Guard_Shooting_.xlsx.html

Some other ones I'm looking to compile but may not have time for, so if somebody would be willing to share, it would help a lot.

Shooting Percentages for all point guards (1960-2010)
Shooting Percentages for all shooting guards (1960-2010)
Shooting Percentages for all small forwards (1960-2010)
Shooting Percentages for all power forwards (1960-2010)
Shooting Percentages for all centers (1960-2010)


Interesting collaboration - but the ones listed are relatively simple calculations with B-R's filter, no?


Well, BR only filters by guard, forward and center.

And it would probably be close to 50 pages of data that you'd then have to clean up, because B-R puts in headers, every 10 or 15 lines.

Re: Data Sets

Posted: Tue Apr 27, 2010 1:42 pm
by bbstats
how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.

Re: Data Sets

Posted: Tue Apr 27, 2010 6:09 pm
by azuresou1
Silver I hope you didn't take that much time to do your sorting, because bbstats' solution does everything in, like, 20 minutes tops.

Re: Data Sets

Posted: Fri Apr 30, 2010 12:56 am
by Silver Bullet
bbstats wrote:how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.


I knew there had to be a simple solution :)

Unfortunately, I did spend eons cleaning it out one by one.

Re: Data Sets

Posted: Fri Apr 30, 2010 4:54 pm
by CellarDoor
Silver Bullet wrote:
bbstats wrote:how to clean headers:

import via excel ("from web" in 2007) for each page

select all the cells you want, run an auto-filter, sort the rank column from smallest to largest, delete the ones that don't have numbers.


I knew there had to be a simple solution :)

Unfortunately, I did spend eons cleaning it out one by one.

Ouch.