Log in

View Full Version : paper: 2009 Team 1114 Championship Scouting Database


Karthik
07-04-2009, 21:38
Thread created automatically to discuss a document in CD-Media.

2009 Team 1114 Championship Scouting Database (http://www.chiefdelphi.com/media/papers/2236?) by Karthik

Jared Russell
07-04-2009, 21:43
Just. Awesome.

Karthik
07-04-2009, 21:44
Greetings all,

Attached is the 2009 Team 1114 Championship Database. This year's database includes full results for every team who competed in the 2009 season as well. Version 1.0 does not have the divisional assignments. We will issue an updated version on the evening of the day the divisions are announced. This will be the final update of the database.


The database includes:
- An interface allow you pull an individual team's record
- Full listing of awards, record & finish
- Team scoring averages
- "Calculated Contribution" which is the same calculation being refered to as OPR on these forums. This calculation usings linear alegbra to determine what a team's average input to their alliance was at each regional. (Only using qualifying match results) We found an average correlation between R = 0.7 and 0.85 to observed emperical scouting data
- A master sheet for a sortable comparison of all FIRST teams

Version 2.0 will include
- Master sheets for each division and full divisional assigments

The data we have was all mined from the FIRST website. There may be some errors, but I'm confident the data is 97.1114% accurate. If you notice any errors please respond to this thread, and we will correct them for Version 2.0.


Prior to 2008 we never released any of our regression analysis (Calculated Contribution) that we had been doing since 2004. Since people have become more knowledgeable on the subject we decided to make the change. Please do not take a poor score as a slight or an insult. We simply used the actual scores from matches to perform a calculation. We feel that this tool is the best available metric if you are unable to watch the actual matches. Since none of us can attend every regional, it should be a valuable tool. That being said, because of the defensive nature of Lunacy, the regression analysis is not as valuable as it was in more offensive games such as Overdrive.

Thanks to Geoff Allan and Roberto Rotolo of Team 1114 for creating this year's database.

If you have any questions, please ask.

Edit: Version 2.0 is now up, with divisions.

wo-bot 141
07-04-2009, 21:46
team 141 also won the Entrepreneurship Award at MI state even though we didn't go to state. it was not on our awards list.

We were picked 3rd at Wisconsin not 14th thanks.

Vikesrock
07-04-2009, 21:46
As far as I can tell any team that competed at 10,000 Lakes or Northstar as their first regional is missing from the database.

Also 2970 was the second pick at Northstar and 2549 was 15th.

Ericgehrken
07-04-2009, 21:52
This is awesome!! Thank you!
Where are the Connecticut Regional Teams?

smurfgirl
07-04-2009, 21:53
Thanks for sharing this resource! I make sure my team checks this out every year, at the least as a source of some raw numbers. What they choose to do with them is another story. (:

Akash Rastogi
07-04-2009, 21:53
Some of the data for us is off by a bit. Is there still data to be entered for teams?

Karthik
07-04-2009, 21:55
As far as I can tell any team that competed at 10,000 Lakes or Northstar as their first regional is missing from the database.

Also 2970 was the second pick at Northstar and 2549 was 15th.

Thanks for catching this. There'll be an update released to address this database error.

AcesPease
07-04-2009, 22:00
I searched and found that 716 and 1124 are missing from the database.

IndySam
07-04-2009, 22:01
The record for 829 at Boilermaker is incorrect.
We were 6-5, 16th seed and Semi-finalist.

GreerD
07-04-2009, 22:10
Wow. This is great. Thank you to 1114!

Karthik
07-04-2009, 22:10
I searched and found that 716 and 1124 are missing from the database.

Yup. We received a bad dump of data, so there are about 136 missing teams. We will be releasing a new version very shortly.

The record for 829 at Boilermaker is incorrect.
We were 6-5, 16th seed and Semi-finalist.

The elimination performance is being fixed. As for the record, according to the FIRST site we have you at 5-5.

http://www2.usfirst.org/2009comp/events/IN/matchresults.html

If these standings are incorrect and someone has the correct version, please pass them along via PM.

Also, we are looking for complete alliance selection data from both Minnesota events. If anyone has the full draft order, it would be greatly appreciated. (FYI, The FIRST site lists the teams in random order for elimination matches, so mining this data is one of our tougher tasks)

Tom Schindler
07-04-2009, 22:21
Converter to read with office 2003:

http://www.microsoft.com/downloads/details.aspx?familyid=941B3470-3AE9-4AEE-8F43-C6BB74CD1466&displaylang=en

Thanks Karthik, you guys are awesome for providing this!

Tom

Chris Fultz
07-04-2009, 22:29
Thanks to everyone on 1114 who worked on this - exceptional work.
What an awesome database.

Karthik
07-04-2009, 22:41
Yup. We received a bad dump of data, so there are about 136 missing teams. We will be releasing a new version very shortly.


Alright. Version 1.1 is now up. The database error has been fixed, and this version should not be missing any teams. Thank you for your patience, and special thanks to Geoff and Roberto for working so quickly to get this new version up.

Also updated are the alliances for MN2, as well as a few other miscellaneous errors that were pointed out.

I have removed version 1.0 to reduce any possible confusion. Look for version 2.0 the evening divisions come out.

Chris is me
07-04-2009, 22:42
This sounds and looks amazing but unfortunately doesn't work in OOo 3, so I can't use it. Not like I was expecting it to, but it would be nice. If you release an 03 version it's more likely to work in OOo.

AlexD744
07-04-2009, 22:46
How come I can't open this, what is it supposed to be opened with. Excel isn't working.

Vikesrock
07-04-2009, 22:49
How come I can't open this, what is it supposed to be opened with. Excel isn't working.

It's an Excel 2007 file. In an earlier post in this thread Karthik said they are working on getting a 2003 version up soon.

Josh Goodman
07-04-2009, 22:51
Man I really need XCELL07. Now.

AlexD744
07-04-2009, 22:54
I have 03, okay well soon enough, Thank You!!

Karthik
07-04-2009, 23:04
I have 03, okay well soon enough, Thank You!!

Soon enough is now. :)

An Excel 2003 version of the database has now been uploaded. It's a 7.0 MB file, zipped to ~2.0MB.

Some formatting may display incorrectly in the 2003 version.

Jonathan Norris
07-04-2009, 23:15
Thanks to Geoff Allan and Roberto Rotolo of Team 1114 for creating this year's database.

Roberto did a great job once again this year!

Thanks Simbots!

jblay
07-04-2009, 23:16
thanks a lot for this. It is a great tool for looking back on a season and also in preparing for nationals.

AndyB
07-04-2009, 23:17
Once again, fantastic job 1114! Aside from TBA, this is probably the best resource available for Championship pre-scouting.

chessking132
07-04-2009, 23:24
Nice job it is always cool to see how your team compares to other teams out there.

Matthew Simpson
Team 75 Driver

Dan2081
08-04-2009, 00:19
This is soooooo cool!
.. but a little dissapointing seeing our good numbers but not qualifying for atl :(

Josh Goodman
08-04-2009, 09:06
THANKS KARTHIK! :D :D :D

EricLeifermann
08-04-2009, 09:41
2826 won the creativity, all-star, and the highest rookie seed at 10000 lakes. It is missing in your data. Other wise all the teams i looked at were correct and this thing is AWESOME! Thanks alot as always.

Peter Matteson
08-04-2009, 10:01
Glad to see the Simbots coming through for us again with their excellent scouting data.
Thanks, and see you next week!
Pete

Brandon Holley
08-04-2009, 10:23
Simbots have outdone theirselves again...very impressive scouting system Karthik!

-Brando

nahstobor
08-04-2009, 10:30
very very very nice, this should be helpful when the division list come out.

Question: Is there any way to sort teams going to Championship by offensive and defensive rankings?

Karthik
08-04-2009, 10:40
Question: Is there any way to sort teams going to Championship by offensive and defensive rankings?

Yup, sort descending by which ever ranking you wish, then sort ascending by division. All the teams who are registered for the Championship are listed as "Division still pending" and move to the top of the list.

Jared Russell
08-04-2009, 15:15
Karthik,

It looks like you have the finalists and winners for the SBPLI Long Island Regional reversed (else the FIRST site and Blue Alliance have them backwards).

Thanks!

davidfv
08-04-2009, 19:17
Wondering if you missed data on Team 236?
TBA has them at 5-5-0 in the Conn. Regional.

Thanks

Karthik
08-04-2009, 19:35
Wondering if you missed data on Team 236?
TBA has them at 5-5-0 in the Conn. Regional.

The records shown in the database are for qualifying matches only. TBA shows qualifying + elimination records.

harleywhite
08-04-2009, 21:23
Thanks Karthik and 1114! This is a great resource and will definitely help when scouting divisions in Atlanta.

Karthik
08-04-2009, 21:24
The updated files with the divisions and a few corrections are now uploaded. Please see Version 2.0.

Thanks to all who provided updated information, especially IndySam!

115inventorsam
08-04-2009, 21:44
Thanks a bunch! This certainly makes a scout's job a little easier.

Another little mistake I noticed for SAC, 115 was 3rd pick, and 692 was 14th pick, not the other way around. And 766 was 4th pick, and 2063 was 13th pick, that got switched around as well, but those mistakes are not too surprising, nor should they really matter, hopefully.

MrForbes
08-04-2009, 22:05
I looked at Newton....aack!!!!

Cuse
08-04-2009, 22:37
Karthik, this is staggering--incredible job by you and Simbotics. This entire system is incredibly elegant and well done, not to mention useful.

Small correction, 175 was a Finalist at CT, not Semi-Finalist.

Akash Rastogi
08-04-2009, 23:12
Woot 1114!

Thanks so much, now I have more things to waste my time with! :D

Analyzing divisions is fun with this.

AlexD744
09-04-2009, 00:28
Nevermind, found what I was looking for.

kulisb
10-04-2009, 01:23
Thanks for sharing your database, 1114! This will be a great tool for sorting through some data in preparation for Atlanta. You made all of our jobs a whole lot easier!

Gaurav27
10-04-2009, 11:38
This is an extremely powerful program, very impressive scouting system Karthik! :cool:

Just added all filters for every column in the sheets. This is just amazing!:ahh:

Akash Rastogi
12-04-2009, 00:08
Karthik mathematically explained a lot of holes in OPR during FIRSTcast this week. I suggest teams check out why. Good show Karthik.

http://www.firstcast.org/