View on GitHub

Portfolio

Using Spark to create a data warehouse

Code

For this project, I created 2 small data warehouses using Spark. The project was run on Databricks and exported as a notebook. The data uses the US Gazetteer data provided by the US Census Bureau and Flight Data.

Requirements

Python 3.6

Python Libraries

Home