Pulling a List of Teams into Excel

For the past year, I’ve been using Excel’s web queries to pull a list of teams attending the event. The process is relatively simple, go to frclinks.com/event/gadu, point to the table that contains the information, and bam, done.

However, recently, the web queries have been failing, and the information isn’t showing up.

I’m at a bit of a loss and am wondering if anyone else is either (a) seeing a similar failure and (b) has a solution that let’s me continue to avoid parsing through the HTML.

Thanks,

  • Sunny G.

The Blue Alliance has CSV dumps of all events in our database. For your GADU example, the team list would be here: https://raw.githubusercontent.com/the-blue-alliance/tba-data-backup/master/tba-data-backup/2014/2014gadu/2014gadu_teams.csv

Complete data cam be found at: GitHub - the-blue-alliance/the-blue-alliance-data: Periodic data dumps from The Blue Alliance

Edit: We will provide a CSV dump of all teams (name, sponsors, location, etc.) in the near future.

My need is more along the lines of getting the most up-to-date information straight from FIRST’s webpages.

  • Sunny G.

FIRST added a check on the HTTP Referer header around the time that divisions came out, preventing direct access to event team lists unless linked there from the FIRST website itself. This was presumably to save their server from the deluge of traffic from the various “are divisions out yet” scrapers.

I have a workaround that makes FRCLinks still work, but unfortunately Excel will keep running into that HTTP Referer issue, since it tries to access the data directly at that https://my.usfirst.org URL that FRCLinks redirects to.

So rather than actually provide an easy way for the community to get to the information, they’ve intentionally obscured it.

Dear Frank,

Please get your IT guys straightened out and provide a method for FIRST developers to get their info! Thanks,

All of FIRST

Is the block just for event teams list? I didn’t seem to have a problem with my scouting database macros when I access Team Standing and Match Results.

I believe it is just for the team lists. I didn’t have a problem with my standing and match result queries either…

Anything with a my.usfirst URL is affected.

I have another spreadsheet that gets the team lists from usfirst website directly and it still works. How do we access through the my.usfirst for team list? I must not have used it that way.

Team list for CASJ here: Silicon Valley Regional FRC Event Web : Event Information

You can’t directly navigate to https://my.usfirst.org/myarea/index.lasso?page=teamlist&event_type=FRC&sort_teams=number&year=2014&event=casj due to the reason Pat mentioned above. FRCLinks has a redirect hack that makes it work.

What page do you use to get team lists that’s not a my.usfirst page?

If you need only current data (and not anything historical):

https://my.usfirst.org/frc/scoring/index.lasso?page=eventlist contains and ID_event column, which you can use like https://my.usfirst.org/frc/scoring/index.lasso?page=event_teamlist&ID_event=10837 (for Traverse City). Is this good enough?

Looks like Ed pulls his information from the event pages (www2.usfirst.org) and not from the event listing.

This actually works pretty well. I’m wondering (a) where do we get event ids and (b) why this data pulling isn’t using an HTTP referrer check.

Interesting,

  • Sunny G.

This is interesting. It works well.

I was tired last night and got confused. I thought I was pulling team list data from another place. It turns out my other spreadsheet pulls in match schedule and determine the team list that way. It is only useful after match schedule is out and before matches begin.

Oh I was using the www2.usfirst URLs so I didn’t realize it. Thanks for the info

The referrer check is just on https://my.usfirst.org/myarea URLs. The https://my.usfirst.org/frc/scoring URLs are used by FMS to download team lists when setting up at an event. They’re tab-separated tables, so they’re easier to parse but uglier to look at which is why I implemented the hack in FRCLinks instead of redirecting to them.

You can get the event IDs by looking at the list of all events: https://my.usfirst.org/frc/scoring/index.lasso

I’m guessing there’s no referrer check on them because then FMS would have had to add that header in its data download requests.

Didn’t FRCLinks previously have a link to these more ugly events? I use them for a lot of my analysis as the data is easier to parse but I always have to search for the links.

I had a link to them for a very brief period of time one year when divisions were posted there but not on the regular team listings, but not apart from that.

Any chance we could get a flag for it? (Sorry to add work)

Also, it turns out some my.usfirst pages not only check for the Referer header, but also a Cookie header.

See https://github.com/the-blue-alliance/the-blue-alliance/commit/63090e3caefee1d900b4efc1d3cbafc41df67abc