Okay, let's look at all FRC geography in 2024

Now that my California thread is resolved, it’s time to look at the FRC landscape more broadly. While this isn’t strictly a regionals vs. districts thread, I think the data are helpful.

“There are so many numbers, just tell me what I should know”

-@KennySandon, probably

Some of you don’t like reading. That’s cool, it’s the holiday season. Here are the points I’ll make below:

  • District teams need to travel an average of 28 miles to their closest event while regional teams need to travel 140 miles to their closest event.
  • If you exclude outliers (teams from countries with no events), we’re talking 28 miles vs. 83 miles. Still a huge difference!
  • Do the same math for teams’ five closest events, and we’re talking about 67 miles on average for district teams and 532 miles for regional teams (or 384 miles if you exclude those international outliers).
  • Only 44% of teams have an event within 15 miles of them. 73% have an event within 50 miles, 85% have an event within 100 miles, and 96% have an event within 500 miles.
  • 4% of teams (152 total) don’t have an event within 500 miles, including 44 teams from Brazil.
Methodology

Geocoding teams and events

Unfortunately, The Blue Alliance no longer geocodes teams returns an empty lat and lng in the API responses. Time to find a new tool!

I was going to geocode the addresses using the Google Maps API, but there are aggressive rate limit/daily quotas even with an enterprise account, and I didn’t feel like waiting. Also, the responses were slower than I had hoped.

So, I used the geocode.maps.co API which is free and has a rate limit of one request per second, which I could deal with. The API utilizes OpenStreetMap which is slightly less accurate than Google Maps, but still quite accurate overall.

Using the API, I geocoded all teams currently registered for an event in 2024. As of the time of running the numbers, that’s 3,566 teams!

Using the same API, I geocoded all events, excluding the FIRST Championship and individual district championships. So, all Regional events and District events are included.

It’s worth noting that for teams, all we have is city, state/province, country, and postal code. I used those points to geocode, so generated latitude and longitude is approximate to the nearest city and there might be a few errors.

Some events have full addresses and other just have a city and state/province. If the full address was available, I used it, and if not, the city/state was a fallback.

Calculating distance

I ruled out the Google Maps driving distance API (which I used for the California analysis) for two reasons. First, the aforementioned rate limits apply to this API, too. Second, I wanted to calculate distances even when driving isn’t possible (for example, calculating all but the closest events for Hawaiian teams means calculating distance as the crow flies).

I then looked at the Open Source Routing Machine which is a super cool tool, also based on OpenStreetMap data, but it’s mostly focused on driving/biking/walking distance and was also a bit overengineered for this project.

I then realized (for which I’m sure the geologists in the crowd are going to say “duh!”) that I can use latitude, longitude, and math I barely understand to do the calculations. Yay!

The formula, to do this math in a spreadsheet (don’t forget to convert the radians!) is:

=ACOS((SIN(RADIANS({Lat1})) * SIN(RADIANSP{Lat2}))) + (COS(RADIANS({Lat1})) * COS(RADIANS({Lat2}))) * (COS(RADIANS({Lng2}) - RADIANS({Lng2})))) * 6371*0.621371

That formula calculates distance between two pairs of coordinates in miles.

I performed that calculation for each event-team pair. 162 events * 3,566 teams = 577,692 calculations! My computer had a lot to say about that, but ultimately did what it was told.

Calculating stats

I looked at the results of the above calculation and filtered values above or below certain thresholds and did some basic average calculations.

Caveats

  • Address data is best guess as described above.
  • No effort was made to determine teams’ eligibility to attend events (e.g. if a team in California is close to a PNW district event, they’ll be counted as close to that event, even though they can’t compete).

On to the data…

Data Points

I’ll use the following data points in the rest of my post:

Data Point Definition
Events <15 mi Number of events less than 15 miles (24 km) from the team
Events <50 mi Number of events less than 50 miles (80 km) from the team
Events <100 mi Number of events less than 100 miles (160 km) from the team
Events <500 mi Number of events less than 15 miles (805 km) from the team
Avg closest 5 events Mean distance of the closest five (5) events to the team

Differences in the Regional and District model, and between districts

Minimum distance to closest event

District Avg minimum distance to an event Avg minimum distance to an event (excluding countries with no events)
FMA 14 mi. 14 mi.
FIM 18 mi. 18 mi.
CHS 19 mi. 19 mi.
NE 21 mi. 21 mi.
FIN 31 mi. 31 mi.
ONT 33 mi. 33 mi.
PNW 33 mi. 33 mi.
FNC 33 mi. 33 mi.
ISR 35 mi. 35 mi.
PCH 35 mi. 35 mi.
FIT 60 mi. 60 mi.
Regional 140 mi. 83 mi.

There is a huge disparity between regional teams and district teams when it comes to the minimum distance required to get to one’s closest event. On average, district teams have just 28 miles to their closest event while regional teams have an average of 140 miles.

In fairness, there are some very isolated regional teams in remote areas. To give the greatest benefit of the doubt, I added a column for regional teams if I exclude any countries without a regional event. This excludes Azerbaijan, Barbados, Belize, Bulgaria, China, Colombia, Croatia, Czech Republic, Dominican Republic, Ecuador, France, Gambia, India, Japan, Lesotho, Netherlands, Pakistan, Panama, Philippines, Poland, Romania, Singapore, South Africa, Sweden, Switzerland, Taiwan, United Kingdom and Vietnam.

When we do that, that, the average minimum distance for regional teams drops to 83 miles which is still way more than the district average of 28 miles.

Interestingly, most districts have a number of <=35 miles, with FIRST In Texas being the only exception. This can be attributed to both Texas’ geography and the fact that FIT has teams from both Texas and New Mexico (which, unlike the other multi-state districts, spans pretty wide geography).

Minimum distance to five closest events

Now, all the same data, but using the five closest events to teams instead of just one:

District Average of Avg closest 5 events Average of Avg closest 5 events (excluding countries with no events)
FMA 28 mi. 28 mi.
FIM 39 mi. 39 mi.
NE 48 mi. 48 mi.
ONT 58 mi. 58 mi.
CHS 63 mi. 63 mi.
PNW 71 mi. 71 mi.
FIN 82 mi. 82 mi.
FNC 85 mi. 85 mi.
PCH 116 mi. 116 mi.
FIT 122 mi. 122 mi.
ISR 171 mi. 171 mi.
Regional 532 mi. 384 mi.

Here we start to see more of a discrepancy between districts (FMA, FIM, NE all have 5+ events in “driving distance” while in PCH, FIT, ISR, this number is over 100 miles). Not all districts are created equally.

When we compare to regionals, the discrepancies get even greater with this number being either ~400 or ~500 miles depending on how you slice the numbers.

Obviously, in neither model are most teams attending five events, but this negates the “you have so many events out there” argument and teams don’t have very many choices where they go. If you look at the California thread, you’ll see an example of how some events are over-subscribed and others under-subscribed.

Events with a certain distance - by district

District Average # of Events <15 mi Average # of Events <50 mi Average # of Events <100 mi Average # of Events <500 mi
CHS 1 2 4 66
FIM 2 7 14 62
FIN 0 1 4 60
FIT 1 2 3 15
FMA 1 7 12 59
FNC 0 1 3 40
ISR 1 3 4 4
NE 1 3 8 40
ONT 1 4 7 72
PCH 0 1 2 25
PNW 1 3 4 11
Regional 1 1 3 23

This chart doesn’t show a ton of helpful data, but there are some interesting tidbits (like how ISR is really an island of a district and how PCH, FIN, and FNC have events >15 mi on average…showing their relative size and geographic differences, etc.)

More details on events within a certain distance next…

Events within a certain distance - Overall

Events <15 mi

Most teams (56%) do not have an event within 15 miles. On average, teams have 0.7 events within 15 miles of them. There are two (2) teams who have a whopping seven (7) events within 15 miles of them! Both of those teams are located in Southfield, Michigan, USA.

Events <50 mi

This is probably around what most consider “reasonable driving distance” to an event. Of course, there are places where this equates to multiple hours of traffic, but on the average, this is roughly the commuting vs. overnight breaking point. Most teams (73%) have at least one event within 50 miles. On average, teams have 2.6 events within 50 miles of them. There are 22 teams who have a whopping 15 events within 50 miles of them, and they’re all somewhere in Michigan, USA!

Events <100 mi

We’re starting to get out of daily commuting distance, but are still in what most would consider “local.” Even more teams (85%) have at least one event within 100 miles. On average, teams have 5.1 events within 100 miles of them. This is cool, because teams who want to stay close-ish should, in theory, be able to do that. There are two (2) teams who have a whopping 24 events within 100 miles of them and you guessed it…they’re both from Michigan, USA.

Events <500 mi

Most teams will max out well before reaching this threshold (except maybe for the FIRST Championship), so these data points are less helpful practically, but still interesting. Nearly all teams (96%) have at least one event within 500 miles. That still means there are 152 teams with no events in that range! Of these 152 teams, 44 teams are from Brazil, 16 teams are from China, another 16 teams are from Mexico, 12 teams are from Taiwan, and the rest are scattered in over two dozen additional countries. Most of these teams are one of, or the only, team in their local area. I didn’t realize how spread out Brazil is until I embarked on this project. I’d love to hear about the geography as it relates to FRC from someone who knows the landscape better.

On average, teams have 34 events within 500 miles of them. There are five (5) teams who have a whopping 83 events within 500 miles of them. Curveball! They’re all from Pennsylvania, USA which seems to be the sweet spot to be kind-of-close-ish to many districts.

38 Likes

This is very interesting, thank you for putting this together and summarizing it so nicely!
What would a table for “minimum distance to three closest events” look like? 5 events is a bit much, but 3 is a somewhat common occurrence that we’ve seen a fair bit of discussion about, over in the Cali thread.

6 Likes

I wish:

QiAhVj81

Interesting stuff though. Nice to reinforce one of the positives of having districts (more teams having events near them).

2 Likes
District Average of Avg closest 3 events Average of Avg closest 3 events (excluding countries with no events)
FMA 22 mi 22 mi
FIM 30 mi 30 mi
ISR 35 mi 35 mi
NE 36 mi 36 mi
CHS 39 mi 39 mi
ONT 47 mi 47 mi
PNW 47 mi 47 mi
FIN 59 mi 59 mi
FNC 59 mi 59 mi
PCH 89 mi 89 mi
FIT 90 mi 90 mi
Regional 420 mi 289 mi

Unfortunately, there will always be outliers. Even if we had an FRC team in every high school, there are so many high schools that are very far away from the next high school. No matter of team growth can ever solve long distances for everyone.

5 Likes

Thanks for doing that, even if it made this data so much more :face_with_diagonal_mouth:

anyways

IMG_1848

3 Likes

For my similar Minnesota/FUM analysis, I first attempted to geocode “[team school] High School, [City], [State]” which worked pretty well. I’ve run through a ton of google maps free credit, so if I was to ever re-do my analysis I was planning on moving to OpenStreetMap as well.

Yes, looking at the data for the average team districts is far better for travel distance (and therefore travel cost). FIRST also seems to be giving districts slightly more championship slots (and a far more satisfying qualification structure).

Interesting. Gut reaction is “Oh, so districts are districts because they have a higher density of teams for their events.”

Then I see FIT.

I don’t know the history of when each district was established, and how their expansion (or contraction) of events played out after that. But it would be interesting to see. But, get some sleep first, Jared.

4 Likes

Our team is about 800 (1300km) miles out from the regional… Its certainly a fun road trip to get the robot over, but we’re not even the worst off in Australia.
The furthest teams in Australia would be from Perth, which is >2000 miles.

1 Like

Yeah, although California is an outlier for the Regional system. California’s geography – its team and event density – is much more like a district system than other areas in the regional system. The main difference is that CA doesn’t have as many events per team as a district.

The areas where we see the regional average distance really dragged down are areas with no events (e.g. China, Alaska, Kansas) areas with a lot of spread out teams and only a small number of events (e.g. Idaho, Hawaii, Colorado).

This seems like a great project for someone else. Sounds like really interesting data, requiring lots of different types of research. If only posting on the Internet was a full time job…

1 Like

There is an event in Kansas, and has been for a number of years.

I agree with and understand the point you’re making, but just wanted to correct this small error :slight_smile:

4 Likes

Great flag. A better example would have been Kentucky with its nine FRC teams and no events (though maybe something is brewing??? idk, someone who knows the area can weigh in)

To provide a different point, New Mexico is the US state with the most teams and no events, which is (I suppose) what happens when a state joins a district with all of its events in another state. This would be like if Hawaii joined a hypothetical California district and all events were in CA, albeit with much higher travel costs than the TX/NM situation.

Interesting. As someone in the regional system, I’d be interested in breaking down the regional travel distances by the team’s state.

2 Likes

Thanks for doing this!

That would be geographers, cartographers, and geodesists, (and yeah, oceanographers like me), not geologists who (roughly) study what the earth is made of and how it got this way.

Also, with a population with this much skew (asymmetry) and particularly kurtosis (having a very long tail), I found the percentile-based statistics far more useful than anything based on the mean.

The growth and density of FRC is similar to that of humanity at least since the invention of agriculture. Areas with rich soil (manufacturing and technology and science) support more farmers (teams) than resource poor areas. Villages (regionals) pop up where the population is sufficiently dense. This in turn increases the nearby population as it then takes fewer resources to be successful, thus enabling more villages and eventually cities (districts).

5 Likes

I’d be curious to see how many of the events near regional teams are district events which they cannot attend. I know this would take a lot more work to figure out, so it’s understandable why you didn’t, but this could very easily skew the math making things even worse on the side of regional teams which already have much further travel. I think it’s also a big barrier to areas like California and New York going district due to the amount of visitors they get.

I really had my “psychics at Caltech” C.J. Cregg moment, huh?

2 Likes

Eligible Events Only

For Regional teams, Eligible events are defined as any Regional event and no district events. For District teams, Eligible events are defined as events within one’s own district, excluding Regional events and events in other districts. While this is not strictly how eligibility works for advancement, awards, etc., this was the only reasonable way for me to perform this analysis.

Number of events within x miles

District Avg. # of ELIGIBLE Events <15 mi Avg. # of ALL Events <15 mi Avg. # of ELIGIBLE Events <50 mi Avg. # of ALL Events <50 mi Avg. # of ELIGIBLE Events <100 mi Avg. # of ALL Events <100 mi Avg. # of ELIGIBLE Events <500 mi Avg. # of ALL Events <500 mi
CHS 1 1 2 2 4 4 7 66
FIM 1 2 6 7 13 14 26 62
FIN 0 0 1 1 2 4 4 60
FIT 1 1 2 2 2 3 9 15
FMA 1 1 5 7 8 12 8 59
FNC 0 0 1 1 3 3 5 40
ISR 1 1 3 3 4 4 4 4
NE 1 1 3 3 7 8 12 40
ONT 1 1 3 4 5 7 9 72
PCH 0 0 1 1 1 2 4 25
PNW 1 1 3 3 3 4 8 11
Regional 1 1 1 1 2 3 9 23
Regional (excluding international outliers) 1 1 1 1 2 3 10 24

In almost all cases, the number of very local events (<15 miles) a team is eligible for is similar to the total number of such events. The only exception here is FIM where the average number of events within 15 miles drops from two to one when considering only local events. This is due to events in ONT which are very close to teams in FIM.

As we approach the 50 mile and 100 mile range, these variances remain relatively small, but do increase, for the same reasons. Just as there are ONT and FIM events/teams nearby, same happens with many other district pairs.

When we get to 500 miles, we start to see the most difference in this chart. For example, CHS teams have an average of 66 events within 500 miles, but only 7 of those events are in-district (since CHS only has 7 total events). Though not restricted by all the same borders, the number does still go down for Regional teams who had an average of 23 events within 500 miles when considering all events, but only have 9 events when considering eligible events.

Average minimum distance to an event

District Avg. Minimum distance to ELIGIBLE events Avg. Minimum distance to ALL events
FIM 18 mi 18 mi
FMA 19 mi 14 mi
CHS 20 mi 19 mi
NE 21 mi 21 mi
FNC 33 mi 33 mi
FIN 33 mi 31 mi
PNW 34 mi 33 mi
ISR 35 mi 35 mi
PCH 38 mi 35 mi
ONT 57 mi 33 mi
FIT 62 mi 60 mi
Regional (excluding international outliers) 85 mi 83 mi
Regional 144 mi 140 mi

We don’t expect there to be much difference in the minimum distance to an event when considering eligibility or ineligibility considering that, in theory, the closest event to teams should generally be one within their own system in which they are eligible to attend. Of course, there are exceptions to this rule. There are teams just on the outside of district borders who find themselves in the regional system but closer to a district event. Or, the biggest discrepancy in the chart above, we see teams who are closer to events in a different district (ONT) than their own (FIM).

With a few exceptions, these differences aren’t major. For those few teams who are the exception, though, this is major, and an argument that teams should be able to opt in to a nearby district even if they are a regional team or assigned to a different district. It’s not a common occurence, but there are certainly teams who would benefit from this.

Average distance to three closest events

District Avg. Distance to three (3) closest ELIGIBLE events Avg. Distance to three (3) closest ALL events
FMA 26 mi 22 mi
FIM 31 mi 30 mi
ISR 35 mi 35 mi
NE 37 mi 36 mi
CHS 42 mi 39 mi
PNW 48 mi 47 mi
FNC 64 mi 59 mi
ONT 78 mi 47 mi
FIN 86 mi 59 mi
FIT 104 mi 90 mi
PCH 115 mi 89 mi
Regional (excluding international outliers) 312 mi 289 mi
Regional 444 mi 420 mi

The chart above shows sadness. In many cases, the distance to a team’s closest three events is not major when considering eligibility. For the reasons outlined below the previous chart, however, the discrepancies get larger when we consider multiple events. In the case of ONT, the average distance to a team’s three closest events is just 47 miles, but if you count in-district events only, it skyrockets to 78 miles. For FIN, it goes from 59 miles to 86 miles. FIT and PCH are both joining the 100+ mile club going from 90 miles to 104 miles and 89 miles to 115 miles respectively.

For regional teams, this problem is bad, too. When excluding international outliers (teams in countries with zero events), the average distance to a team’s three closest events is 289 miles. But, when you exclude the district events at which these teams cannot compete, that jumps to 312 miles. These differences aren’t astronomical, but are not insignifcant, either.

2 Likes

Sure thing.

Data by state - Regional teams and regional events only

District teams are excluded. Regional team distance to district events is excluded, too.

US State Avg. # of Events <15 mi Avg. # of Events <50 mi Avg. # of Events <100 mi Avg. # of Events <500 mi Avg. Min distance to an event (mi) Avg. distance to three (#) closest events (miles)
Alabama 0 0 1 10 61 160
Alaska 0 0 0 0 1360 1725
Arizona 1 1 2 12 37 92
Arkansas 0 0 1 13 93 164
California 1 2 4 15 17 42
Colorado 1 1 1 3 35 297
Florida 0 1 1 4 52 171
Hawaii 1 1 1 1 43 1608
Idaho 0 1 1 4 110 287
Illinois 0 1 2 18 28 87
Iowa 0 0 1 16 85 131
Kansas 0 1 2 13 45 83
Kentucky 0 0 1 14 86 166
Louisiana 0 0 1 6 69 186
Minnesota 0 1 2 13 41 87
Mississippi 0 0 1 9 75 153
Missouri 0 1 2 16 23 81
Montana 0 0 0 3 280 390
Nebraska 0 0 0 16 182 216
Nevada 1 1 1 17 30 154
New York 1 2 2 8 19 58
North Dakota 0 0 1 7 75 207
Ohio 0 1 1 14 53 127
Oklahoma 1 1 2 9 25 118
Pennsylvania 1 1 1 12 27 121
South Dakota 0 0 0 4 317 390
Tennessee 0 1 1 12 46 152
Utah 1 1 1 7 26 228
West Virginia 0 0 0 13 102 154
Wisconsin 0 1 2 15 38 100
Wyoming 0 0 0 3 197 310

Average minimum distance to an event, sorted by distance

This is the same data as the second to last column above, just sorted:

Row Labels Avg. Min distance to an event (mi)
California 17
New York 19
Missouri 23
Oklahoma 25
Utah 26
Pennsylvania 27
Illinois 28
Nevada 30
Colorado 35
Arizona 37
Wisconsin 38
Minnesota 41
Hawaii 43
Kansas 45
Tennessee 46
Florida 52
Ohio 53
Alabama 61
Louisiana 69
Mississippi 75
North Dakota 75
Iowa 85
Kentucky 86
Arkansas 93
West Virginia 102
Idaho 110
Nebraska 182
Wyoming 197
Montana 280
South Dakota 317
Alaska 1360
5 Likes

That’s fascinating. Not as much of a difference as I’d expect, but definitely not insignificant either, especially when looking at closest 3 events. I think this issue is only to to get worse as more regions move to districts, eventually boxing out regions entirely. I’d also be curious to see for international teams how many events are within driving distance of an international airport, but that’s a much weirder statistic and would be tricky to measure. Still. Love the data! Keep at it.

We definitely could reach a point where regionals become too small of a minority and make it much harder to be in that system. Of course, if California, Minnesota, and New York all go to districts (lol, I know), the remaining regional teams would struggle more.

I don’t know if it would financially or operationally make sense for FIRST, but there could certainly be a logistical benefit to moving everyone to districts before we reach that point. That leaves big questions, including what to do with teams far from any events, how to build capacity where it doesn’t currently exist, etc. There are undoubtedly some major benefits to the current system in which becoming a district is largely self-determined, so this would be tough.

Universal points, anyone?

Was curious about how multiregional states switching to districts would affect the overall regional ecosystem. So I took a look at how many slots in each state are being taken up by Regional teams not from said state. I had to combine Missouri and Kansas because Kansas city makes things really complicated to split up and they I feel would be a package deal if they were to transfer to districts because of it.

California: 58 of 531
Missouri/Kansas: 39 of 158
New York: 38 of 253
Minnesota: 33 of 224
Ohio: 32 of 105
Florida: 30 of 153
Oklahoma: 29 of 91
Illinois: 29 of 88
Arizona: 25 of 82
Wisconsin: 25 of 108

1 Like