Setup Apache Spark
cd ~/
mkdir spark
cd spark
Download spark binary, in our class, we use spark 3.0.0 preview with compatibility to hadoop 2.7, which we already have hadoop 2.7 instance up and running.
Unpack the tgz file
tar -xvzf spark-3.0.0-preview-bin-hadoop2.7.tgz
Create a shorter name by soft link:
ln -s spark-3.0.0-preview-bin-hadoop2.7 spark
cd spark
Last modified 3yr ago