index

Install Python findspark library to be used in standalone Python script or Jupyter notebook to run Spark application outside PySpark.

Install Jupyter notebook Spylon kernel to run Scala code inside Jupyter notebook interactively.

To install findspark library for Python, open an Anaconda command prompt.

Go to virtual environment spark you have created that has Python 3.6 and Jupyter notebook

conda activate spark

For the first time, update pip

pip install pip --upgrade

pip install findspark

Next is to install Spylon library for Jupyter notebook to run Scala commands inside Jupyter-notebook

Open an Anaconda command prompt as administrator

conda activate spark

pip install spylon-kernel

Create a kernel spec for Jupyter notebook

python -m spylon_kernel install --user

Last updated