Accessing Public Datasets in BigQuery
Prerequisites
Open the Cloud Console.
Open Menu > Big Query > SQL Workspace
data:image/s3,"s3://crabby-images/e5032/e5032219a1f64493c03384257e43b022ffaec9e4" alt=""
Click on Add Data > Explore Public Data sets.
data:image/s3,"s3://crabby-images/91c55/91c55b8378c9ba0c0a4203b7fd6eb48929850ad9" alt=""
Type NYC bike.
It will show New York citi bike trips data. Click it.
data:image/s3,"s3://crabby-images/10af9/10af97f3e78d6e9c11985d77c848a6de17faebae" alt=""
Click on View dataset.
data:image/s3,"s3://crabby-images/d83b3/d83b3513f2796c351831eb675a27754a18d5b88f" alt=""
The dataset will be added.
Expand the added data set
data:image/s3,"s3://crabby-images/5635c/5635cbff793d6e84e5374200d5f688878e118b9c" alt=""
Goto new_york_citibike > citibike_trips
data:image/s3,"s3://crabby-images/91a44/91a44e5da707b071a8e2abecd5c5efc961be20c7" alt=""
Click on Schema. It will show the schema of table.
data:image/s3,"s3://crabby-images/59160/591601cec413e5af8936d6a90204297b29f55e4b" alt=""
Click on Details. It will show the details of the Table.
data:image/s3,"s3://crabby-images/bfa9c/bfa9c766a1f05d1cb7cb0f43fddb7d3cbe97a489" alt=""
Click on Preview. It will show the table.
data:image/s3,"s3://crabby-images/6c71c/6c71cfa4d9ecdc0a8d63da51f071353a611f5031" alt=""
Click on Compose new Query.
data:image/s3,"s3://crabby-images/de1b7/de1b77add44086bc670eccad42aa3ccf329bedee" alt=""
Paste the below code in query editor.
SELECT
MIN(start_station_name) AS start_station_name,
MIN(end_station_name) AS end_station_name,
COUNT(tripduration) AS num_trips
FROM
bigquery-public-data.new_york_citibike.citibike_trips
WHERE
start_station_id != end_station_id
GROUP BY
start_station_id,
end_station_id
ORDER BY
num_trips DESC
LIMIT
10
Click RUN
data:image/s3,"s3://crabby-images/a10ef/a10eff8ebdd4447da2cffdb236557031afa1f4c5" alt=""
It will show the Results of query.
data:image/s3,"s3://crabby-images/4ec3d/4ec3d4451236c36d7beb99e1d1eded8ad05b0c59" alt=""
Click on Query history. It will show the query history.
data:image/s3,"s3://crabby-images/dea02/dea02e0f615f6f9c1f17ce49f0f4e9bfaf948a0a" alt=""
Click on Job Information.
It will show the job description.
data:image/s3,"s3://crabby-images/12235/122356f0d3e159ebb9d6e7074c758aab7a92f623" alt=""
Click on Execution Details. It will Show the execution details
data:image/s3,"s3://crabby-images/1dbad/1dbad91093a4e67a79da6c32cf60d5979b194208" alt=""
To save Query,
Click on Save Query button.
data:image/s3,"s3://crabby-images/cc652/cc652e542bcfb29fbc21f2c1dcbdbd837edf15fc" alt=""
Give the query name.
Choose the visibility mode. Click Save.
data:image/s3,"s3://crabby-images/0210f/0210f98c12403e6a1cd0ab53d66e8d419c2a1737" alt=""
To see the Query settings, Click on More > Query Settings.
data:image/s3,"s3://crabby-images/2938f/2938f0837f817f75768e938262f55b7236c619d0" alt=""
In Query settings, you can choose the engine for query.
You can change the destination for your query result. Instead of temporary table, you can choose a table in the project itself.
Select the destination table for query dataset.
Choose the project and dataset.
Give the table name and write preference.
Click Save.
data:image/s3,"s3://crabby-images/25b7a/25b7af0906997a14217c5a4e287401d2cb01129e" alt=""
Write the below query.
WITH
trip_distance AS (
SELECT
bikeid,
ST_Distance(ST_GeogPoint(s.longitude,s.latitude),
ST_GeogPoint(e.longitude,e.latitude)) AS distance
FROM
bigquery-public-data.new_york_citibike.citibike_trips,
bigquery-public-data.new_york_citibike.citibike_stations as s,
bigquery-public-data.new_york_citibike.citibike_stations as e
WHERE
start_station_id = s.station_id
AND end_station_id = e.station_id)
SELECT
bikeid,
SUM(distance)/1000 AS total_distance
FROM
trip_distance
GROUP BY
bikeid
ORDER BY
total_distance DESC
LIMIt
5
Click Run.
data:image/s3,"s3://crabby-images/29b37/29b376931a8f6ac135031683b4e71d2da6ebd3f9" alt=""
The result of query will be displayed in query results. It will be saved into new table named table_query_result
data:image/s3,"s3://crabby-images/f18b4/f18b405d59fd7844b7af68a81c039798408069e5" alt=""
SELECT
wx.date,
wx.value/10.0 AS prcp
FROM
bigquery-public-data.ghcn_d.ghcnd_2015 AS wx
WHERE
id = ‘USW00094728’
AND qflag IS NULL
AND element = ‘PRCP’
ORDER BY
wx.date
Click Run.
data:image/s3,"s3://crabby-images/39911/39911ed5e993d5d2d23e09493a41c0f28aa98683" alt=""
The query results will be displayed.