Setup Apache Spark
Setup Apache Spark
cd ~/
mkdir spark
cd spark
Download spark binary, in our class, we use spark 3.0.0 preview with compatibility to hadoop 2.7, which we already have hadoop 2.7 instance up and running.
wget http://apache-mirror.8birdsvideo.com/spark/spark-3.0.0-preview/spark-3.0.0-preview-bin-hadoop2.7.tgz
Unpack the tgz file
tar -xvzf spark-3.0.0-preview-bin-hadoop2.7.tgz
Create a shorter name by soft link:
ln -s spark-3.0.0-preview-bin-hadoop2.7 spark
cd spark
Last updated