paper: Caleb's Scouting Database 2018

Thread created automatically to discuss a document in CD-Media.

Caleb’s Scouting Database 2018
by: Caleb Sykes

This is a scouting database which provides calculated contributions (OPRs) and other metrics for all teams at each event using the data from the TBA API.

This is a scouting database which provides calculated contributions (OPRs) and other metrics for all teams at each event using the data from the TBA API. Each sheet contains data from a distinct FRC event. A new database will be published weekly within a day or two of all of that week’s events being completed. For sheets which contain events that have not yet occurred, seed values for each category are available in order to aid in pre-scouting.

Caleb’s_scouting_database_2018.0.0.xlsx (6.13 MB)
Caleb’s_scouting_database_2018.0.1.xlsx (6.15 MB)
Caleb’s_scouting_database_2018.1.0.xlsx (7.87 MB)
Caleb’s_scouting_database_2018.2.0.xlsx (13.6 MB)
Caleb’s_scouting_database_2018.3.1.xlsx (14.2 MB)
Caleb’s_scouting_database_2018.4.2.xlsx (17.1 MB)
Caleb’s_scouting_database_2018.5.1.xlsx (18.2 MB)
Caleb’s_scouting_database_2018.6.1.xlsx (19.1 MB)
Caleb’s_scouting_database_2018.6.2.xlsx (17.1 MB)
Caleb’s_scouting_database_2018.6.3.xlsx (17 MB)
Caleb’s_scouting_database_2018.7.1.xlsx (17.2 MB)
Caleb’s_scouting_database_2018.8.1.xlsx (17.5 MB)
Caleb’s_scouting_database_2018.9.1.xlsx (17.7 MB)

1 Like

But like… OPR is as meaningless as it’s ever been…

The other metrics will be cool though! Thanks Caleb!

This is a scouting database which provides calculated contributions (OPRs) and other metrics for all teams at each event using the data from the TBA API. Each sheet contains data from a distinct FRC event. A new database will be published weekly within a day or two of all of that week’s events being completed. For sheets which contain events that have not yet occurred, seed values for each category are available in order to aid in pre-scouting.

This workbook is a continuation of the 4536 scouting databases which I published in 2016 and 2017. However I am no longer affiliated with 4536 and don’t want anyone asking them questions instead of directly asking me, so I have renamed it “Caleb’s Scouting Database”.

Changes since the 2017 version include:
I moved what I consider the most important metrics to the first few columns.
Changed the way I handle sheet names
I have added a “Chairman’s Strength” column, which uses the methodology described in my Chairman’s Predictions books.

For week 1, the only things I would put that much faith in are the Elo ratings and the Chairman’s Strengths. The contributions are just shots in the dark based off of last season’s performances and attributes of the game based on the week 0 competition. Many of the metrics are filled in with -1, these are metrics that I will develop in the coming weeks as we learn more about the game. Most notably, I have left the entire vault section unused since *(https://www.chiefdelphi.com/forums/showthread.php?t=162927) the API data for this section.

If you notice any bugs or have any suggestions for improvements I am more than happy to hear them. I do this for all of you, so I want it to be as helpful as possible.*

We’ll see after week 1. That’s one of the first things I want to find out. I’m reasonably confident my Elo predictions will be better than OPR predictions this year.

Completely agreed. Did you ever get a chance to look into the CA predictions based on only the past 3-5 years rather than from the full life of a team? I know there was that one and a few other potential tweaks that we talked about before kickoff.

It’s definitely on my to-do list, but I’m unlikely to put any effort into it until after the competition season is over. I was just too busy to get around to it in the past few weeks and for the next few weeks I need all my efforts to go into maintaining/improving the scouting database and event simulator. I’ll get around to it eventually though. :wink:

Thank you for doing this, it really helps our small team where getting 5 scouts is a challenge. We find that this data and our scouting data show mostly the same trends in performance but there’s occasional fliers on both sides of the data. The fliers is something we watch closely on day two to narrow down our pick list.

I can’t thank you enough Caleb. This really makes my life as head scouter so much easier!

Caleb, our team enjoyed your event simulator last year for pulling blue alliance data down and analyzing. Are you planning an update for this year? we’d be grateful…

It will be out by tonight. Adapting it for this year has taken a bit more time than I originally anticipated, but I’ve got the night free tonight to wrap everything up. It might not have ranking projections for week 1, but I’ll finish everything else.

Glad you liked it last year, hopefully it will help you this year as well. :slight_smile:

I added a week 0 update with some minor changes. I forgot to add the “calculated contribution to win” metric so that one has been added (I’ve used one of my three placeholders). I also created a navigation sheet since I’m sick of scrolling and trying to use the right-click screen on the bottom left.

Caleb, after an exciting week 1 PWN district and with excitement, I downloaded Caleb’s_scouting_database_2018.0.1.xlsx. Pretty nice and useful.
I’m trying to analyze results for Wamou. The columns on the data import tab seems to cover blue alliance only. Are you able to get the red data as well?

I retract this question. I see red alliance data begins in columns EF and onward. I have what I need and will play with some pivot tables next.

thanks
Ericn99

A week 1 spreadsheet has been uploaded. If you notice any errors please let me know, I’ve made a lot of behind the scenes changes since last year, so I expect that something will be incorrect. I have added in the simple vault metrics since I didn’t notice any problems with vault data like I saw at the week 0 competition. My hope is to have all other unused categories completed by the week 2 update. I also have updated seed values for all teams, even ones that didn’t compete in week 1.

Thanks so much for pulling all this data together, Caleb. You ROCK!

A week 2 book has been added. The big changes are:
The “rate” type metrics have all been added.
Elo was re-calculated for week 1 teams because the standard deviation of scores I used last week was just an estimate based on 8ish events, and not based on scores for all week 1 events.
Hatboro-Horsham was moved to week 5
Chairman’s strengths are pretty messed up right now. Basically, they don’t show up for teams that have already competed. I’m hopeful that I’ll have this sorted out by next week.

Let me know if you notice any errors.

It’s my pleasure.

Awesome; thank you, Caleb!

Hi again, Caleb. Thanks again for putting all this wonderful data together. Thought I’d share how my team 1918 NC GEARS uses your Golden Spreadsheet:

https://public.tableau.com/profile/1918firstroboticsscouting#!/vizhome/2018FIRSTRoboticsCompetitionResultsOverview/MainDashboard?publish=yes

https://public.tableau.com/profile/1918firstroboticsscouting#!/vizhome/2018FIRSTRoboticsCompetitionResultsOverview/BubbleChart

A week 3 book has been added. Chairman’s strengths should all be fixed. It’s looking less and less likely that I’ll ever get around to adding in the alternative Elo metrics. They would have been cool, but the amount of effort it takes to backfill 3 weeks worth of data without causing errors to other things makes me wary of developing these metrics. Maybe near the end of the season I’ll revisit them.

These look really slick. I always think it’s really cool when people take my data and put them into more visually appealing formats. I’d do more of that myself, but maintaining everything properly seems to take up enough of my time.

The seed values were messed up in the book I just published, I’ll have an update posted shortly.

Thanks, Caleb. The work you’re doing to maintain the Excel Workbook and all the tabs is AMAZING and MUCH appreciated, and plenty. There are enough of us who love data that we can each have a role in the “raw data to story-telling” process :slight_smile: HIVE FIVE to group effort!

Here are links to some updated charts based on your most recent file:

Overview:
https://public.tableau.com/views/2018FIRSTRoboticsCompetitionResultsOverview/MainDashboard?:embed=y&:display_count=yes&publish=yes

Bubble Chart, Ownership to OPR:
https://public.tableau.com/views/2018FIRSTRoboticsCompetitionResultsOverview/BubbleChart-OwnershiptoOPR?:embed=y&:display_count=yes

Bubble Chart, Ownership to Ranking Points:
https://public.tableau.com/views/2018FIRSTRoboticsCompetitionResultsOverview/BubbleChart-OwnershiptoRankingPoints?:embed=y&:display_count=yes

Heat Map, Ownership to Ranking Points:
https://public.tableau.com/views/2018FIRSTRoboticsCompetitionResultsOverview/HeatMap-ScaleOwnershipSecondstoRankingPoints?:embed=y&:display_count=yes