What is the ThinkChicago Workbench?
The ThinkChicago Workbench is a cloud-based service with a set of general-purpose development and data analysis environments to help you to explore your ideas with the ThinkChicago data. All applications run as Docker containers on a system hosted by the National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign.
A few guidelines:
- Start only the services that you need. Each account has limited resources. You will likely only need to run one or two of the provided services.
- For the larger datasets, don't try to read them all at once. You'll need to work with subsets of the data.
- If you have questions/problems, post to Slack (https://thinkchicago.slack.com/)
What data is available?
The ThinkChicago workbench provides access to the following datasets. Many of these datasets are also available via the City of Chicago Data Portal REST API.
Dataset | Description | API | Size (Format) |
---|---|---|---|
2FM Tech Challenge | Fleet and Facilities Management (2FM) vehicle and equipment data. | N/A | 534M CSV |
Array of Things Locations | Locations of Array of Things sensor nodes. | REST API | 6.5K CSV; 28K JSON |
Incidents of crime since 2001. | REST API | 1.4G CSV | |
Individual Divvy bike sharing trips, including the origin, destination, and timestamps for each trip | REST API | 2.4G CSV | |
Historical availability of bicycles and docks to return bicycles at the Divvy stations. | REST API | 9.4G CSV | |
Taxi trips reported to the City of Chicago. | REST API | 8.0GB CSV |
Each of these datasets is available in the /shared directory of any running application in Workbench.
...