Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The ThinkChicago workbench provides access to five datasets from the City of Chicago. These datasets are also available via the City of Chicago Data Portal.

DatasetAPISize (CSV)Description

2FM Tech Challenge

534M

Crimes 2001 - present

REST API1.4G CSV; 2.9G JSON

Divvy Trips

REST API2.4G CSV; 4.0G JSON

Divvy Bicycle Stations (historical)

REST API9.4G CSV; 19G JSON

Taxi Trips

...

REST API8.0GB; CSV


Each of these datasets is available in the /shared directory of any running application in Workbench. 

Note that some of these files are large and you will not be able to read the entire file into memory.  Please prepare accordingly. (See resource limits below)

What applications are available?

  • Cloud9 development environments for popular languages including Python, Java, PHP, and Node.js
  • Data analysis environments including Jupyter Notebooks and RStudio
  • Database and data management software including MySQL, PostgresSQL, and MongoDB

Resource limits

Each account is limited to 4 cores, 8GB RAM memory, and 10GB storage. This means that you will not be able to read entire datasets into memory or start many different applications.

Getting getting data on/off

...

Workbench

  • Github: We strongly encourage you to use Github (or similar service) to store all of your source code and data.  
  • File Manager: The Workbench File Manager application can be used to upload/download data.

...