Introducing Dataset Colab: An Object Detection Dataset Collaboration Software

Dataset Colab is a software that allows all FRC teams to submit their annotated images of the FRC game objects and other robots to a large dataset which they then can download to train object detection models. By signing up you get access to the current 2023 (and 2024 coming soon!) datasets and the ability to upload your datasets to grow the common datasets.

Advantages

  • Benefits of Large Datasets - When training YOLO or other custom neural network based object detection models, larger datasets with more edge cases lead to higher accuracy. Currently, the 2023 dataset is the combination of all the public datasets on Roboflow, and has over 30,000 images which could never be collected by one team.

  • Upload Methods - Dataset Colab currently accepts any uploads in the COCO dataset format and any Roboflow datasets (simply provide the URL and Dataset Colab will fetch the dataset).

  • Community Support - Dataset Colab is a fully open source project (available on Github) developed and maintained by an active and welcoming Discord community. Feel free to ask any questions there!

Comparison with Roboflow

Roboflow is a great tool for annotating images and exporting them to a variety of dataset formats. While Roboflow does allow users to make their datasets open source through Roboflow Universe, it unfortunately does not allow users to collectively collaborate on public datasets. Dataset Colab fills this gap by allowing any user to contribute to collective datasets.

Current Features

  • Uploading both COCO datasets and any public Roboflow dataset.
  • Viewing all of your uploaded datasets and the combined communal dataset.
  • Download the combined dataset via the website or API.

Planned improvements

  • Upload and download datasets in the YOLO format. (January 15th)
  • Uploading private Roboflow datasets. (February 1st)
  • Improved dataset viewing with galleries of images and other stylistic changes to the site. (January 15th)
7 Likes

Cool idea. Would you want to just host it on Roboflow Universe?

I’m one of the founders and we’d be happy to give you increased limits so you can combine/curate the open source datasets into a better one & collaborate with others.

Just send me an email: brad@ domain of the company

3 Likes

This is absolutely revolutionary! I’m using one of the datasets to train a model, will get back with the results!

1 Like

Great resource! Thank you.

That said, the site appears to be broken, failing to fetch projects.

I downloaded the robot dataset in the YOLO format but it also came with the note class. Also, the note and robot classes are switched.

Hi @JaiCodes,

We are working to resolve this now. Sorry for the inconvenience. While the robot dataset came with the note class, the note class is never used in any of the annotations. We will remove it from the robot dataset.

Thank you,
Sean

Dataset Colab now has pretrained object detection models! Download the model weights and follow our documentation to start detecting notes and robots. We currently have 4 pretrained models:

  • YOLOv5n (95.92% mAP)
  • YOLOv5s (96.01% mAP)
  • YOLOv8n (96.25% mAP)
  • YOLOv8s (95.60% mAP)

On 2/1/2024, we will be releasing Limelight + Coral compatible (tensorflow) models: ssd-mobilenet-v2, efficientdet-d0, and ssd-mobilenet-v2-fpnlite-320. Signup to get an email notification when these are released.

1 Like

Is that mAP50? Or 50-95?

mAP50

Do you have the 50-90 numbers as well? They’re usually better at differentiating between models.

mAP50-95 is now added to the Dataset Colab models page.

Seems like the v8s model could use more training. It’s bigger so in order to get better results you have to train it for longer.

1 Like

Update: Dataset Colab now has only note and only robot models in addition to the note and robot models. Visit the models page to download and run these new models.

1 Like

New Google Coral Compatible Tensorflow Models

Dataset Colab now has Tensorflow object detection models (SSD Mobilenet v2 & EfficientDet) that can be used with the Google Coral and a variety of other accelerators! To download and get started using these models visit our models page to download the model and our documentation page for a guide to get started.

  • SSD Mobilenet v2 (92.20% mAP50)
  • EfficientDet (94.50% mAP50)

New Only-Note and Only-Robot Models

In addition to these models we also trained only-note and only-robot models. These models are more specialized and have greater performance. To download and use these models visit our models page .

Recommend New Models

If there are any other models that you would like to see on Dataset Colab, DM me and we would be happy to train them for you!

Can you compile the SSD Mobilenet and EfficientDet models to an EdgeTPU Google Coral?

Looks like the API SSL cert expired, it’s causing errors with the main page.

Hello, this should be fixed, thanks for letting us know!

The site appears to have been shutdown and redirected… does anyone still have access to the original note dataset?

Hello,

We expanded Dataset Colab to Synodic to support non-FRC datasets. I’ll upload the note dataset now.

Thanks,
Sean

Thanks! Where will I be able to find it once uploaded?