Citi Bike is the nation's largest bike share program, with 10,000 bikes and 600 stations across Manhattan, Brooklyn, Queens, and Jersey City. This dataset includes Citi Bike trips since Citi Bike launched in September 2013 and is updated daily. The data has been processed by Citi Bike to remove trips that are taken by staff to service and inspect the system, as well as any trips below 60 seconds in length, which are considered false starts.
This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery
Try the sample queries below in the BigQuery UI.
What are the most popular Citi Bike stations?
First, let's look at the most popular Citi Bike stations, including their name, location, and number of trips. Run this query.
What are the most popular routes by subscriber type?
Next, let's look at the most popular routes by subscriber type, where “Subscribers” are Citibike members and “Customers” are one-off users. This query uses CONCAT to get the route.
Run this query.
What are the top routes by gender?
This query looks at top routes by gender. Here we get the top female routes in 2016. This query can easily be edited for different years and genders.
Run this query.
For more examples using NYC data in BigQuery see this tutorial
For public use, subject to Citi Bike’s Data License Agreement
Google Cloud Console has failed to load JavaScript sources from www.gstatic.com.
Possible reasons are: