Using Spark to create a data warehouse
For this project, I created 2 small data warehouses using Spark. The project was run on Databricks and exported as a notebook. The data uses the US Gazetteer data provided by the US Census Bureau and Flight Data.
Requirements
Python 3.6
Python Libraries
- pyspark