Create emr_pipeline folder

In this module we will create a folder named: “emr_pipeline” that will store scripts and raw data

Make sure you are in the root directory of your project:

mkdir emr_pipeline

S3

cd .\emr_pipeline\
mkdir data
mkdir scripts

S3

mkdir data/sales_data_raw/
code data/sales_data_raw/sales_data.csv

Paste the values of this file

Link to the data

S3