# Dev environment setup, task list

### Dev environment setup, task list

Install Java devilment toolkit 1.8 (required, higher version Java may not work properly with Spark, we found out jdk1.8.0\_191 works best, therefore, recommend jdk1.8.0\_191. (Do NOT install JDK 9 or above, Spark is not currently compatible with JDK 9 or above.)

Download and install Anaconda Python and create virtual environment with Python 3.6 (work best with most of the deep learning libraries)

Download and install Spark

Download and install Scala (optional, as Spark has scala)

Setup Spylon-kernel on jupyter-notebook, we will be using jupyter-notebook for both Python and Scala

Setup Elipcse Scala IDE

#### I have detailed video presentation on how to the developer workstation on Windows 10

[Video presentation in the Appendix](/data-science-and-apache-spark/appendix-video-presentations.md)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://george-jen.gitbook.io/data-science-and-apache-spark/dev_setup4.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
