Connect Tableau to Spark SQL running in VM with VirtualBox with NAT

This section is specific only to Spark running on Virtualbox VM which network adapter is set to NAT, with IP address to be 10.0.2.15. If your Spark SQL does not run on the VirtualBox VM or if the network adapter of the VirtualBox VM is not set to NAT, then this section does not apply.

First, Hadoop, HIVE, Spark must be up and running:

(spark) [hadoop@master tmp]$ jps
21520 Jps
3110 SecondaryNameNode
3447 NodeManager
3847 RunJar
4120 Worker
3290 ResourceManager
4010 Master
12490 SparkSubmit
2684 NameNode
2861 DataNode
4893 SparkSubmit

Next Spark thrift server needs to be up and running, if not start it by:

$SPARK_HOME/sbin/start-thriftserver.sh

that run the process such as below

hadoop     4893      1  1 17:09 pts/0    00:01:33 /usr/java/default/bin/java -cp /opt/spark/conf/:/opt/spark/jars/* -Xmx1g org.apache.spark.deploy.SparkSubmit --class org.apache.spark.sql.hive.thriftserver.HiveThriftServer2 --name Thrift JDBC/ODBC Server spark-internal

Then you need to with telnet to ensure port 10000 is being listened to:

(spark) [hadoop@master tmp]$ telnet master 10000
Trying fe80::e85c:6b26:5c3d:b72e%enp0s3...
Connected to master.
Escape character is '^]'.

You need to make sure you have setup port forward in VirtualBox network settings, in this example, add a port forwarding rule to forward TCP packets from port 36 of host machine to port 10000 of guest machine

Also, on your windows command window, test with telnet with host port 36 that is set to forward to port 10000 of VirtualBox guest machine

telnet localhost 36

Then specify port 36 in Tableau login window

You should be able to log in from Tableau

Last updated